File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/concl/04/c04-1110_concl.xml

Size: 2,757 bytes

Last Modified: 2025-10-06 13:53:57

<?xml version="1.0" standalone="yes"?>
<Paper uid="C04-1110">
  <Title>Semantic Similarity Applied to Spoken Dialogue Summarization</Title>
  <Section position="7" start_page="0" end_page="0" type="concl">
    <SectionTitle>
6 Concluding Remarks
</SectionTitle>
    <Paragraph position="0"> We introduced a new approach to spoken dialogue summarization. Our approach combines statistical, i.e. corpus-based, and knowledge-based techniques.</Paragraph>
    <Paragraph position="1"> It utilizes the knowledge encoded in the noun part of WordNet and applies a set of semantic similarity metrics to dialogue summarization. All semantic similarity-based summarization methods outperform RANDOM, LEAD and TF*IDF baseline systems. In the following, we discuss some remaining challenges and future research.</Paragraph>
    <Paragraph position="2"> More sophisticated data pre-processing. We plan 4Roughly speaking, the differences are most evident for compression rates between 20% and 30%.</Paragraph>
    <Paragraph position="3"> to incorporate the pre-processing components used by Zechner (2002) and evaluate their contribution to our task. Including an anaphora resolution component would also result in better Recall.</Paragraph>
    <Paragraph position="4"> Automatic word sense disambiguation. Switchboard conversational speech is highly ambiguous. Automatic disambiguation of noun senses to Word-Net concepts is important in order to integrate our approach into real-life summarization systems.</Paragraph>
    <Paragraph position="5"> Investigating other types of information in parallel.</Paragraph>
    <Paragraph position="6"> A clear desideratum will be assessing the overall coherence of the discourse, speaker info, turn type, information about non-nouns.</Paragraph>
    <Paragraph position="7"> Application to text and speech summarization. Our approach can be applied to written-to-be-spoken speech and text summarization. It will be interesting to investigate whether conceptual structures of texts (the input to our system) are comparable to the conceptual structures found in dialogues.</Paragraph>
    <Paragraph position="8"> Readability, coherence, and usability of the summaries produced. A close examination of summaries based on human comprehension will be interesting. It may be necessary to introduce filtering or other post-processing techniques improving the quality of summaries.</Paragraph>
    <Paragraph position="9"> Even without very sophisticated pre-processing of the dialogue data, our algorithm yields promising results. It was evaluated on the Switchboard data, which is a challenging evaluation corpus. Our vision is to adopt the summarization approach presented here in a system used for the automatic production of meeting minutes.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML