File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/concl/04/p04-1044_concl.xml
Size: 2,573 bytes
Last Modified: 2025-10-06 13:54:03
<?xml version="1.0" standalone="yes"?> <Paper uid="P04-1044"> <Title>Combining Acoustic and Pragmatic Features to Predict Recognition Performance in Spoken Dialogue Systems</Title> <Section position="9" start_page="0" end_page="0" type="concl"> <SectionTitle> 8 Conclusion </SectionTitle> <Paragraph position="0"> We used a combination of acoustic confidence and pragmatic plausibility features (i.e. computed from dialogue context) to predict the quality of incoming recognition hypotheses to a multi-modal dialogue system. We classified hypotheses as accept, (clarify), reject, or ignore: functional categories that 7Following (Hinton, 1995), we leave out categories with expected frequencies < 5 in the kh2 computation and reduce the degrees of freedom accordingly.</Paragraph> <Paragraph position="1"> can be used by a dialogue manager to decide appropriate system reactions. The approach is novel in combining machine learning with n-best processing for spoken dialogue systems using the Information State Update approach.</Paragraph> <Paragraph position="2"> Our best results, obtained using TiMBL with optimized parameters, show a 25% weighted f-score improvement over a baseline system that uses a &quot;grammar-switching&quot; approach to context-sensitive speech recognition, and are only 8% away from the optimal performance that can be achieved on the data. Clearly, this improvement would result in better dialogue system performance overall. Parameter optimization improved the classification results by 9% compared to using the learner with default settings, which shows the importance of such tuning.</Paragraph> <Paragraph position="3"> Future work points in two directions: first, integrating our methodology into working ISU-based dialogue systems and determining whether or not they improve in terms of standard dialogue evaluation metrics (e.g. task completion). The ISU approach is a particularly useful testbed for our methodology because it collects information pertaining to dialogue context in a central data structure from which it can be easily extracted. This avenue will be further explored in the TALK project8.</Paragraph> <Paragraph position="4"> Second, it will be interesting to investigate the impact of different dialogue and task features for classification and to introduce a distinction between &quot;generic&quot; features that are domain independent and &quot;application-specific&quot; features which reflect properties of individual systems and application scenarios.</Paragraph> </Section> class="xml-element"></Paper>