XML Viewer - p03-1058

File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/03/p03-1058_abstr.xml

Size: 1,135 bytes

Last Modified: 2025-10-06 13:42:54

<?xml version="1.0" standalone="yes"?>
<Paper uid="P03-1058">
  <Title>Exploiting Parallel Texts for Word Sense Disambiguation: An Empirical Study Hwee Tou Ng</Title>
  <Section position="1" start_page="0" end_page="0" type="abstr">
    <SectionTitle>
Abstract
</SectionTitle>
    <Paragraph position="0"> A central problem of word sense disambiguation (WSD) is the lack of manually sense-tagged data required for supervised learning. In this paper, we evaluate an approach to automatically acquire sense-tagged training data from English-Chinese parallel corpora, which are then used for disambiguating the nouns in the SENSEVAL-2 English lexical sample task. Our investigation reveals that this method of acquiring sense-tagged data is promising. On a subset of the most difficult SENSEVAL-2 nouns, the accuracy difference between the two approaches is only 14.0%, and the difference could narrow further to 6.5% if we disregard the advantage that manually sense-tagged data have in their sense coverage. Our analysis also highlights the importance of the issue of domain dependence in evaluating WSD programs.</Paragraph>
  </Section>
class="xml-element"></Paper>

Download Original XML