XML Viewer - h93-1094

File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/93/h93-1094_metho.xml
Size: 4,177 bytes
Last Modified: 2025-10-06 14:13:26
<?xml version="1.0" standalone="yes"?>
<Paper uid="H93-1094">
  <Title>THIS WORK WAS SPONSORED BY THE DEFENSE AD- VANCED RESEARCH PROJECTS AGENCY. THE VIEWS</Title>
  <Section position="1" start_page="0" end_page="0" type="metho">
    <SectionTitle>
ROBUST CONTINUOUS SPEECH RECOGNITION
TECHNOLOGY
PROGRAM SUMMARY*
</SectionTitle>
    <Paragraph position="0"/>
  </Section>
  <Section position="2" start_page="0" end_page="0" type="metho">
    <SectionTitle>
PROGRAM GOALS
</SectionTitle>
    <Paragraph position="0"> The major objective of this program is to develop and demonstrate robust, high performance continuous speech recognition (CSR) techniques focussed on applications in Spoken Language Systems (SLS). The effort focusses on developing advanced acoustic modelling, efficient search techniques, rapid enrollment, and adaptation techniques for robust large vocabulary CSR. An additional Lincoln goal is to define and develop application of robust CSR to military and civilian systems, and to expedite effective technology transfer.</Paragraph>
  </Section>
  <Section position="3" start_page="0" end_page="0" type="metho">
    <SectionTitle>
BACKGROUND
</SectionTitle>
    <Paragraph position="0"> The Lincoln program began with a focus on improving speaker stress robustness for the fighter aircraft environment. A robust hidden Markov model (HMM) system was developed with very high performance under stress conditions. The robust HMM techniques were then extended to yield state-of-the-art performance on the DARPA Resource Management corpus, using a tied-mixture HMM CSR approach. null Recent work has focussed on the large-vocabulary Wall Street Journal (WSJ) corpus, with vocabularies of 5K, 20K, and up to 64K words. The HMM CSR has been converted to a stack-decoder-based control strategy to operate efficiently with good performance in these tasks.</Paragraph>
  </Section>
  <Section position="4" start_page="0" end_page="0" type="metho">
    <SectionTitle>
RECENT ACCOMPLISHMENTS
</SectionTitle>
    <Paragraph position="0"> Recent accomplishments include: (1) development of the stack decoder and demonstration of its effectiveness on vocabularies up to 64K words; (2) development and integration of fast-match and detailed match; (3) further development of acoustic modelling techniques for the large vocabulary task; (4) a full set of evaluation tests in the November 1992 WSJ tests, including (e.g.) a 4.5% error rate on a</Paragraph>
    <Paragraph position="2"> time speaker adaptation techniques with substantial improvements due to adaptation from both speaker-specific and speaker-independent initial models; (6) participation in and contributions to development of the WSJ corpus,</Paragraph>
  </Section>
  <Section position="5" start_page="0" end_page="0" type="metho">
    <SectionTitle>
*THIS WORK WAS SPONSORED BY THE DEFENSE AD-
VANCED RESEARCH PROJECTS AGENCY. THE VIEWS
EXPRESSED ARE THOSE OF THE AUTHOR AND DO NOT
REFLECT THE OFFICIAL POLICY OF POSITION OF THE
</SectionTitle>
    <Paragraph position="0"> U.S. GOVERNMENT.</Paragraph>
    <Paragraph position="1"> including providing baseline language models to all sites; (7) survey and study of opportunities for military and government applications of spoken language technology, and organization of a workshop focussing on technology transfer; and (8) continuing leadership of the DARPA spoken</Paragraph>
  </Section>
  <Section position="6" start_page="0" end_page="400" type="metho">
    <SectionTitle>
Language Coordinating Committee.
PLANS
</SectionTitle>
    <Paragraph position="0"> Plans for the current program include: (1) development of advanced acoustic modeliing techniques; (2) development and improvement of stack-decoder-based HMM for large vocabulary tasks, via development and integration of advanced acoustic models, acoustic fast match, and efficient search techniques; (3) development of technique for integration of stack-based CSR with natural language processors; (4) extension of run-time adaptation techniques to adapt acoustic parameters of the tied-mixture HMM to speaker channel, and environment; and (5) continued investigation of applications opportunities for spoken language systems.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML