File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/04/w04-2205_abstr.xml
Size: 1,545 bytes
Last Modified: 2025-10-06 13:43:59
<?xml version="1.0" standalone="yes"?> <Paper uid="W04-2205"> <Title>Building and sharing multilingual speech resources, using ERIM generic platforms</Title> <Section position="1" start_page="0" end_page="0" type="abstr"> <SectionTitle> Abstract </SectionTitle> <Paragraph position="0"> In the framework of projects ChinFaDial and ERIM we have developed in recent years several platforms allowing to handle various aspects of bilingual spoken dialogues on the web --mainly, spontaneous speech corpus collection through distant human interpreting.</Paragraph> <Paragraph position="1"> Current development of the core ERIM-Interp and ERIM-Collect platforms now includes multimodal user interaction, integration of some machine aids (such as speech turn logs through speech recognition, or tentatively speech machine translation, both based on server-grounded market products), and next, online aids to speakers and/or interpreters.</Paragraph> <Paragraph position="2"> First collected data should be made available on the web in fall 2004 (DistribDial) along with, as soon as available, a robust version of the collecting platform, in order to promote collaborative building, and sharing, of &quot;raw&quot; unannotated multilingual speech corpora.</Paragraph> <Paragraph position="3"> A variant of the ERIM environment is to extend to distant e-training in interpreting, possibly creating situations which should in turn, in our view, foster larger-scale data collection and sharing in open access mode.</Paragraph> </Section> class="xml-element"></Paper>