File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/02/p02-1018_abstr.xml

Size: 1,216 bytes

Last Modified: 2025-10-06 13:42:24

<?xml version="1.0" standalone="yes"?>
<Paper uid="P02-1018">
  <Title>A simple pattern-matching algorithm for recovering empty nodes and their antecedents</Title>
  <Section position="2" start_page="0" end_page="0" type="abstr">
    <SectionTitle>
Mark Johnson@Brown.edu
Abstract
</SectionTitle>
    <Paragraph position="0"> This paper describes a simple pattern-matching algorithm for recovering empty nodes and identifying their co-indexed antecedents in phrase structure trees that do not contain this information. The patterns are minimal connected tree fragments containing an empty node and all other nodes co-indexed with it. This paper also proposes an evaluation procedure for empty node recovery procedures which is independent of most of the details of phrase structure, which makes it possible to compare the performance of empty node recovery on parser output with the empty node annotations in a gold-standard corpus. Evaluating the algorithm on the output of Charniak's parser (Charniak, 2000) and the Penn treebank (Marcus et al., 1993) shows that the pattern-matching algorithm does surprisingly well on the most frequently occuring types of empty nodes given its simplicity.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML