XML Viewer - h89-1011

File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/concl/89/h89-1011_concl.xml

Size: 1,591 bytes

Last Modified: 2025-10-06 13:56:22

<?xml version="1.0" standalone="yes"?>
<Paper uid="H89-1011">
  <Title>Speaker Adaptation from Limited Training in the BBN BYBLOS Speech Recognition System</Title>
  <Section position="6" start_page="104" end_page="104" type="concl">
    <SectionTitle>
5. Conclusion
</SectionTitle>
    <Paragraph position="0"> Three improvements to the DTW-based speaker adaptation method have been combined to achieve a 45% overall reduction in recognition word error rate on development test data. The largest single improvement was due to the addition of a codebook derived from a set of cepstral derivative features. This improvement does not affect the estimation of the between-speaker transformation.</Paragraph>
    <Paragraph position="1"> This suggests that further improvements to the speaker-dependent prototype model can lead to significant improvements in the adapted model's performance.</Paragraph>
    <Paragraph position="2"> The performance of the system on new evaluation test data was 8.4% word error averaged over 12 speakers, using the standard word-pair grammar. The system used a total of 600 sentences from a single prototype speaker and and a training sample of 40 sentences from each of the 12 test speakers. The performance of the system is comparable to several speaker-independent systems trained on 4200 sentences from 105 speakers, and tested on the same data. This result suggests that speaker adaptation may be the most cost-effective solution for applications which must be brought up quickly and must accommodate changing task domains or test conditions.</Paragraph>
  </Section>
class="xml-element"></Paper>

Download Original XML