File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/00/c00-1022_intro.xml

Size: 1,951 bytes

Last Modified: 2025-10-06 14:00:49

<?xml version="1.0" standalone="yes"?>
<Paper uid="C00-1022">
  <Title>Exogeneous and Endogeneous Approaches to Semantic Categorization of Unknown Technical Terms</Title>
  <Section position="3" start_page="145" end_page="145" type="intro">
    <SectionTitle>
2 The Terminological Resources
</SectionTitle>
    <Paragraph position="0"> We use in this study a French/English bilingual terminology of the aeronautic domain. This hand-crafted database results tTrom a multidisciplinary effort involving technical writers, translators, terminologists and engineers. In its current state, the database contains 12,267 French/English term ctmt)les, structured in 70 semantic categories. As already ot)served in several terminological databases, multi-word terms cover the larger part; of the database (nearly 80%). Tile described terms are mostly nouns but the database also contains about 200 verbs and verbal phrases. Table 1 gives some exampies of terms and a short description of the associated categories. These linguistic resources are integrated in a computer-aided translation environment used by technical writers.</Paragraph>
    <Paragraph position="1"> Semantic categories have originally been introduced in order to distinguish the various senses of polysemous terms. Each term couple is annotated with one or more categories specifying the contexts in which the translation is recommended. An entry is associated to a ternr for each identified meaning. For example, the french term ddpasscmerd has at least two possible meanings, corresponding to two different translations: ovc~iflow in the Data Processing category (DPR,) and out-o\[-./lushnc.ss in the Airm'M't Structure category (STR). As shown in the examples of table 1, the assigmnent of semantic categories has been extended to monosemous terms.</Paragraph>
    <Paragraph position="2"> In our experiments of term categorization, only the fl:ench l;erms have 1)een used.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML