File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/06/w06-1107_abstr.xml

Size: 1,029 bytes

Last Modified: 2025-10-06 13:45:17

<?xml version="1.0" standalone="yes"?>
<Paper uid="W06-1107">
  <Title>Evaluation of Several Phonetic Similarity Algorithms on the Task of Cognate Identification</Title>
  <Section position="2" start_page="0" end_page="0" type="abstr">
    <SectionTitle>
Abstract
</SectionTitle>
    <Paragraph position="0"> We investigate the problem of measuring phonetic similarity, focusing on the identi cation of cognates, words of the same origin in different languages. We compare representatives of two principal approaches to computing phonetic similarity: manually-designed metrics, and learning algorithms. In particular, we consider a stochastic transducer, a Pair HMM, several DBN models, and two constructed schemes. We test those approaches on the task of identifying cognates among Indoeuropean languages, both in the supervised and unsupervised context. Our results suggest that the averaged context DBN model and the Pair HMM achieve the highest accuracy given a large training set of positive examples.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML