File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/03/p03-2025_intro.xml
Size: 1,373 bytes
Last Modified: 2025-10-06 14:01:51
<?xml version="1.0" standalone="yes"?> <Paper uid="P03-2025"> <Title>Bilingual Terminology Acquisition from Comparable Corpora and Phrasal Translation to Cross-Language Information Retrieval</Title> <Section position="2" start_page="0" end_page="0" type="intro"> <SectionTitle> 1 Introduction </SectionTitle> <Paragraph position="0"> Although, corpora have been an object of study of some decades, recent years saw an increased interest in their use and construction. With this increased interest and awareness has come an expansion in the application to knowledge acquisition, such as bilingual terminology. In addition, non-aligned comparable corpora have been given a special interest in bilingual terminology acquisition and lexical resources enrichment (Dejean et al., 2002; Fung, 2000; Koehn and Knight, 2002; Rapp, 1999).</Paragraph> <Paragraph position="1"> This paper presents a novel approach to bilingual terminology acquisition and disambiguation from scarce resources such as comparable corpora, phrasal translation through re-scoring techniques as well as evaluations on Cross-Language Information Retrieval (CLIR). CLIR consists of retrieving documents written in one language using queries written in another language. An application is completed on a large-scale test collection, NTCIR for Japanese-English language pair.</Paragraph> </Section> class="xml-element"></Paper>