File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/concl/06/w06-2916_concl.xml

Size: 1,062 bytes

Last Modified: 2025-10-06 13:55:49

<?xml version="1.0" standalone="yes"?>
<Paper uid="W06-2916">
  <Title>Unsupervised Grammar Induction by Distribution and Attachment</Title>
  <Section position="8" start_page="271" end_page="271" type="concl">
    <SectionTitle>
8 Conclusions
</SectionTitle>
    <Paragraph position="0"> We have presented an incremental grammar induction system that uses heuristics to improve the efficiency of distributional learning. However, in tests over a large corpus, we have shown that it is capable of learning only a small subset of constituent structure. We have analyzed actual constituent-context distributions to explain these limitations. This analysis provides the motivation for a more structured learning method, which incorporates knowledge of verifiable constituent boundaries - the starts and ends of sentences. This improved system performs significantly better, with a 75% increase in recall over distributional methods, and a significant improvement at retrieving structures that are problematic for distributional methods alone.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML