File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/92/c92-2088_intro.xml

Size: 5,272 bytes

Last Modified: 2025-10-06 14:05:11

<?xml version="1.0" standalone="yes"?>
<Paper uid="C92-2088">
  <Title>Lexi(:al Knowledge Acquisition ti'om Bilingual Col'por&amp;</Title>
  <Section position="2" start_page="0" end_page="0" type="intro">
    <SectionTitle>
1 Introduction
</SectionTitle>
    <Paragraph position="0"> It has become widely accel)ted that developing a large scale semantic dictionary is indispensable to future natural language research. ILL recent years, several research activities for compiling selnantic dictionaries tot natural language processing have been uudcrtaken One of the approaches in this research is attempts to compile dictionaries by band. Japan Electronic Dictionary Research Institute (El)R.) is now compiling conceptual dictionaries\[5\] by hand with the help of software tools. \[nformation-4echnology Promotion Agency (IPA), Japan, has also compiled IPA Lexicon of the Japanese Language for computers (II'AL)\[4\]. IPAL has 861 entries for basic Jalranese verbs. Cyc project attempts to assend)le a massive knowledge base covering human common-sense knowledge\[7\]. IIowever, this approach sailors from *The authol~ would like to t}mak the editorial staff of Kodazm|m for permission tO use the data of Jalmnese-12)nglidt dictionaa'y, arm also thank l)r. Shouichi YOKOYAMA, I,',TL, and Prof. l\[ozumi TANAKA and Dr. '\['akenobu TOKUNA(;A, Tokyo hmtitute of Teclmology, for providing us the data of Jal)ane~e-l~nglish dictionary. This work is partly supported by the Grants from Ministry of Education, #032,15103.</Paragraph>
    <Paragraph position="1"> probh'.Ins socb as a huge alnount of manila\[ labor, difficulties in extending tile dictionaries, unstable remilts, and so forth.</Paragraph>
    <Paragraph position="2"> Anothcr approach is to compile dictionaries using some teclxmques of lexical knowledge acquisition.</Paragraph>
    <Paragraph position="3"> One ~nch approach is to extract hierarclfical relations or it thesanrtm of conceptual items froln hunLall dictionaries in an automatic way. q)surrnnaru et el.</Paragraph>
    <Paragraph position="4"> studied to construct a t}LeSaLLrlIs of nominal concepts from noun detinitions\[t3\], qbmiara et al. also extracted snperordinatc-subordmatc relation between verbs from the defining sentences in IPAL\[12\]. lie sidcs these rcseasches, there are other several research activitics tbr lexical knowledge acquisition, which syntactically anMyze the sentences m large corpora and attcmpt to extract lcxical knowledge from statistical data \[3\] \[1\]. Most of the works undertake shallow analysis of texts and they extract only superticial lexical information.</Paragraph>
    <Paragraph position="5"> For the development of tile techniques of knowledge acquisition from natural language texts, it is very important to improve the httter approach of cornpiling semantic dictionaries by comimter l)rograuL~. Ilowever, there are at least two basic difficulties in this at)preach  1. Tire i~robh~m (ff syntactic ambiguities When analyzing a sentence., syntactic ambiguities often remain. So i~ is not easy to obtain correct parsed results automatically.</Paragraph>
    <Paragraph position="6"> 2. The, probh~rrr of polyue,my  it often happens that one word has several meanings and corre.sponrls to ,~cveral concepts. So it is not easy to associate one sm'fa~e word with olle correct conceptHal item.</Paragraph>
    <Paragraph position="7"> Our approach to solve these diiliculties is to make use of translatitm cxarnples in two distinct languages that have quite different syntactic structures and word mf~anings (such as English and Japanese), and to c(nnt~are analyzed results from each language, h| many (:asc~, the two languagcs }Lave different types of syntactic ambiguities, anti comparison of syntactic structures of both bmguagcs helps to resolve the ambiguities. Also, a pair of bilingually equivalent snrface words helps to a~'4ociate tile words with conceptual Ac~s DECOLING-92, NANTIiS, 23-28 AOIJT 1992 5 8 1 l'~oc. OF COL\]NG-92. NANTES. AUG. 23-28, 1992 words helps to associate the words with conceptual items, because the intersection of conceptual items that each surface word has could be considered as one conceptual item\[ll\] \[2\]. \[&amp;quot;or example, in tire case of the translation example given in Example 1, both syntactic and semantic ambiguities are resolved.</Paragraph>
    <Paragraph position="8"> Example 1 E: I hung my coat on the hook.</Paragraph>
    <Paragraph position="9"> J: ~:L (I) ;~ (topic) ~2~ (coat) ~ (ca.se-m~trker)  ~'5&amp;quot; (hook) lZ (case-marker) zi'$~'f: (hung)o 1. Syntactic disumbiguation  The English sentence in Example 1 is syntactically ambiguous because the prepositional phrase &amp;quot;on the hook&amp;quot; can modify both the verb &amp;quot;hung&amp;quot; aad tim noun phrase &amp;quot;my coat&amp;quot; using grammatical knowledge only. On the other band, in the Japanese sentence, the phrase &amp;quot;7)~ ~', Is_&amp;quot; can modify nothing but the verb &amp;quot;~t;tf:&amp;quot;. Thus, if knowledge about word equivalence pairs such as ( I, ~, ), (hung, ~'~t t: ), (coat, _t-.;a ), (hook, ~' g ) are available from bilingual dictionaries, the ambiguity of pp-attachment is resolved by syntactically matching the structures of the two sentences.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML