File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/concl/96/c96-2202_concl.xml

Size: 941 bytes

Last Modified: 2025-10-06 13:57:42

<?xml version="1.0" standalone="yes"?>
<Paper uid="C96-2202">
  <Title>Word Extraction from Corpora and Its Part-of-Speech Estimation Using Distributional Analysis</Title>
  <Section position="6" start_page="1121" end_page="1121" type="concl">
    <SectionTitle>
5 Conclusion
</SectionTitle>
    <Paragraph position="0"> We have described a new method to extract words from a corpus and estimate their POSs using distributional analysis. Our method is based on the hypothesis that sets of strings preceding or following two arbitrary words belonging to the same POS are similar to each other. We have proposed a mathematically well-founded method to compute probability distrii)ution in which a string belongs to given POSs. The results of word extraction experiments attested the correctness of our hypothesis. Adding extracted words to the dictionary, the accuracy of a morphological analyzer augmented considerably.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML