File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/92/c92-3165_intro.xml
Size: 2,618 bytes
Last Modified: 2025-10-06 14:05:12
<?xml version="1.0" standalone="yes"?> <Paper uid="C92-3165"> <Title>Interactive Speech Understanding</Title> <Section position="2" start_page="0" end_page="0" type="intro"> <SectionTitle> 1 Introduction </SectionTitle> <Paragraph position="0"> It has been continuously mentioned thatt some kind of latnguage knowledge is essential in good-quality speech understanding. Until recently, however, most research has focused mainly oil word recognition atnd one of the excellent recognition systems built to date is Sphinx developed by Lee \[7\]. Although SI)hinx atttained atn excellent word accuracy of 96 % on at 997-word task, its sentence recognition accuracy drops slgnificatntly clue to its use of only at stattisticaJ trigra~l gratmiilal'. null There hatve been at few atttempts to integratte at speech recognition device with a nattural language understanding syste,n, ltatyes el al. \[3\] adopted technique of case fi'ame instantiation to patrse at continuously spoken English sentence in the form of at word lattice (a set of word catndldattes hypothesized by at st)eech recognition module) and produce at frame representation of the utterance.</Paragraph> <Paragraph position="1"> The case frame patrsing hats been pursued by Poesio et al. \[8\] and Giatchin et al. \[2\] for instance. Meanwhile, at compiler-oriented shift-reduce LR parsing technique hats been used for speech recognition recently due to its no-batcktracking tatl)le-drlven ei\[iciency \[12, iII, 6\]. Becatuse the parsing proceeds from left to right pruning lowl)robatl)ility t)atrtiatl-parses, the correct parse catn not be obtained if the parsing fails to find the correct path in the beginning. Moreover, it is sometimes difficult to handle tim very noisy input, esl)ecially the input with missing words. Thus an Lll. parser sometimes yields totally incorrect but syntactically-sound hypotheses or no hypotheses att all. This weakness is occasionally cited to demonstrate superiority of the pa.rsing method nsing much simI)ler bigram or trigratm grammars in which the re.covery in the middle of the input earn be done at eatse. In this paper, we describe at method of enllatncing the generalized 1,R (GLR) parsing towatrds interactive speech understanding. null Section 2 describes the enhatnced GLR parrslug. Section 3 describes the rol)ustness of the parser and presents an interatctive method to resolve the unclcatr I)ortion of the input and unknown words. Section 4 experiments the effectiveness of the technique in parsing spoken sentences. Finally the concluding rematrks atre given in Section 5.</Paragraph> </Section> class="xml-element"></Paper>