File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/06/w06-1632_abstr.xml

Size: 890 bytes

Last Modified: 2025-10-06 13:45:30

<?xml version="1.0" standalone="yes"?>
<Paper uid="W06-1632">
  <Title>Using Linguistically Motivated Features for Paragraph Boundary Identification</Title>
  <Section position="2" start_page="0" end_page="0" type="abstr">
    <SectionTitle>
Abstract
</SectionTitle>
    <Paragraph position="0"> In this paper we propose a machine-learning approach to paragraph boundary identification which utilizes linguistically motivated features. We investigate the relation between paragraph boundaries and discourse cues, pronominalization and information structure. We test our algorithm on German data and report improvements over three baselines including a reimplementation of Sporleder &amp; Lapata's (2006) work on paragraph segmentation. An analysis of the features' contribution suggests an interpretation of what paragraph boundaries indicate and what they depend on.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML