File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/02/p02-1017_abstr.xml

Size: 1,040 bytes

Last Modified: 2025-10-06 13:42:25

<?xml version="1.0" standalone="yes"?>
<Paper uid="P02-1017">
  <Title>A Generative Constituent-Context Model for Improved Grammar Induction</Title>
  <Section position="1" start_page="0" end_page="0" type="abstr">
    <SectionTitle>
Abstract
</SectionTitle>
    <Paragraph position="0"> We present a generative distributional model for the unsupervised induction of natural language syntax which explicitly models constituent yields and contexts. Parameter search with EM produces higher quality analyses than previously exhibited by unsupervised systems, giving the best published unsupervised parsing results on the ATIS corpus. Experiments on Penn treebank sentences of comparable length show an even higher F1 of 71% on non-trivial brackets. We compare distributionally induced and actual part-of-speech tags as input data, and examine extensions to the basic model. We discuss errors made by the system, compare the system to previous models, and discuss upper bounds, lower bounds, and stability for this task.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML