File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/05/p05-1060_abstr.xml

Size: 888 bytes

Last Modified: 2025-10-06 13:44:27

<?xml version="1.0" standalone="yes"?>
<Paper uid="P05-1060">
  <Title>Multi-Field Information Extraction and Cross-Document Fusion</Title>
  <Section position="1" start_page="0" end_page="0" type="abstr">
    <SectionTitle>
Abstract
</SectionTitle>
    <Paragraph position="0"> In this paper, we examine the task of extracting a set of biographic facts about target individuals from a collection of Web pages. We automatically annotate training text with positive and negative examples of fact extractions and train Rote, Na&amp;quot;ive Bayes, and Conditional Random Field extraction models for fact extraction from individual Web pages. We then propose and evaluate methods for fusing the extracted information across documents to return a consensus answer. A novel cross-field bootstrapping method leverages data interdependencies to yield improved performance.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML