File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/04/w04-2313_abstr.xml

Size: 1,156 bytes

Last Modified: 2025-10-06 13:44:00

<?xml version="1.0" standalone="yes"?>
<Paper uid="W04-2313">
  <Title>Towards Automatic Identification of Discourse Markers in Dialogs: The Case of Like</Title>
  <Section position="1" start_page="0" end_page="0" type="abstr">
    <SectionTitle>
Abstract
</SectionTitle>
    <Paragraph position="0"> This article discusses the detection of discourse markers (DM) in dialog transcriptions, by human annotators and by automated means. After a theoretical discussion of the definition of DMs and their relevance to natural language processing, we focus on the role of like as a DM. Results from experiments with human annotators show that detection of DMs is a difficult but reliable task, which requires prosodic information from soundtracks.</Paragraph>
    <Paragraph position="1"> Then, several types of features are defined for automatic disambiguation of like: collocations, part-of-speech tags and duration-based features. Decision-tree learning shows that for like, nearly 70% precision can be reached, with near 100% recall, mainly using collocation filters. Similar results hold for well, with about 91% precision at 100% recall.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML