File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/05/p05-1060_abstr.xml
Size: 888 bytes
Last Modified: 2025-10-06 13:44:27
<?xml version="1.0" standalone="yes"?> <Paper uid="P05-1060"> <Title>Multi-Field Information Extraction and Cross-Document Fusion</Title> <Section position="1" start_page="0" end_page="0" type="abstr"> <SectionTitle> Abstract </SectionTitle> <Paragraph position="0"> In this paper, we examine the task of extracting a set of biographic facts about target individuals from a collection of Web pages. We automatically annotate training text with positive and negative examples of fact extractions and train Rote, Na&quot;ive Bayes, and Conditional Random Field extraction models for fact extraction from individual Web pages. We then propose and evaluate methods for fusing the extracted information across documents to return a consensus answer. A novel cross-field bootstrapping method leverages data interdependencies to yield improved performance.</Paragraph> </Section> class="xml-element"></Paper>