File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/04/c04-1077_abstr.xml

Size: 830 bytes

Last Modified: 2025-10-06 13:43:21

<?xml version="1.0" standalone="yes"?>
<Paper uid="C04-1077">
  <Title>Corpus and Evaluation Measures for Multiple Document Summarization with Multiple Sources</Title>
  <Section position="1" start_page="0" end_page="0" type="abstr">
    <SectionTitle>
Abstract
</SectionTitle>
    <Paragraph position="0"> In this paper, we introduce a large-scale test collection for multiple document summarization, the Text Summarization Challenge 3 (TSC3) corpus. We detail the corpus construction and evaluation measures. The significant feature of the corpus is that it annotates not only the important sentences in a document set, but also those among them that have the same content. Moreover, we define new evaluation metrics taking redundancy into account and discuss the effectiveness of redundancy minimization.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML