File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/05/p05-1055_abstr.xml
Size: 1,469 bytes
Last Modified: 2025-10-06 13:44:25
<?xml version="1.0" standalone="yes"?> <Paper uid="P05-1055"> <Title>Position Specific Posterior Lattices for Indexing Speech</Title> <Section position="1" start_page="0" end_page="0" type="abstr"> <SectionTitle> Abstract </SectionTitle> <Paragraph position="0"> The paper presents the Position Specific Posterior Lattice, a novel representation of automatic speech recognition lattices that naturally lends itself to efficient indexing of position information and subsequent relevance ranking of spoken documents using proximity.</Paragraph> <Paragraph position="1"> In experiments performed on a collection of lecture recordings -- MIT iCampus data -- the spoken document ranking accuracy was improved by 20% relative over the commonly used baseline of indexing the 1-best output from an automatic speech recognizer. The Mean Average Precision (MAP) increased from 0.53 when using 1-best output to 0.62 when using the new lattice representation. The reference used for evaluation is the output of a standard retrieval engine working on the manual transcription of the speech collection. null Albeit lossy, the PSPL lattice is also much more compact than the ASR 3-gram lattice from which it is computed -- which translates in reduced inverted index size as well -- at virtually no degradation in word-error-rate performance. Since new paths are introduced in the lattice, the ORACLE accuracy increases over the original ASR lattice.</Paragraph> </Section> class="xml-element"></Paper>