File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/04/w04-3218_metho.xml

Size: 1,787 bytes

Last Modified: 2025-10-06 14:09:31

<?xml version="1.0" standalone="yes"?>
<Paper uid="W04-3218">
  <Title>Mining Spoken Dialogue Corpora for System Evaluation and Modeling</Title>
  <Section position="3" start_page="0" end_page="0" type="metho">
    <SectionTitle>
Call-type: Request(Account Balance)
</SectionTitle>
    <Paragraph position="0"> dialog level. Clustering at the utterance is for modeling the language people use in a speci c dialog context; clustering at the dialog level allows us to characterize the whole interaction between the users and a system. In the next section we describe the corpora data structure. In section 3 we describe the clustering algorithms.</Paragraph>
    <Paragraph position="1"> In sections 4 and 5 we report on experiments and results for utterance-based and dialog clustering, respectively.</Paragraph>
  </Section>
  <Section position="4" start_page="0" end_page="0" type="metho">
    <SectionTitle>
2 Dialog corpora
</SectionTitle>
    <Paragraph position="0"> The corpora is collected from an automatic call routing system where the aim is to transfer the user to the right route in a large call center.</Paragraph>
    <Paragraph position="1"> An example dialog from a customer care application is given in Figure 1. After the greeting prompt, the speaker's response is recognized using an automatic speech recognizer (ASR).</Paragraph>
    <Paragraph position="2"> Then, the intent of the speaker is identi ed from the recognized sequence, using a spoken language understanding (SLU) component. This step can be framed as a classi cation problem, where the aim is to classify the intent of the user into one of the prede ned call-types (Gorin et al., 1997). Then, the user would be engaged in a dialog via clari cation or con rmation prompts until a nal route is determined.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML