File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/88/c88-1042_abstr.xml
Size: 8,461 bytes
Last Modified: 2025-10-06 13:46:29
<?xml version="1.0" standalone="yes"?> <Paper uid="C88-1042"> <Title>If (CN has Ancestor ( A ((CAT QUES)) has First Son F ((CAT NP PRON)(WH l)))) Then < If (F ((P +))) Then < Return(True);> Else < Return(False); > ; > ; @ Who is competent? @ Who does John expect to seem old? @ What does John expect to seem old? If (CN has Ancestor B((CAT INFCL</Title> <Section position="2" start_page="0" end_page="205" type="abstr"> <SectionTitle> Abstract </SectionTitle> <Paragraph position="0"> An approach to tile Transfer phase of a Machine Translation system is presented, where the bilingual lexicon plays an active role, guiding Transfer by means of executable descriptions of word senses. The means for lexical sense specification are, however, general enough and can in principle apply to other system arthitectures, e.g. in tile Generation phase if Transfer is intentionally kept minimal. The active lexicon is the one and only systea~ component which is exposed to users and can serve to linguistically control Transfer effects. A unified approach to lexicon creation and maintenance is proposed, which contains means to gradually refine sense specification and tailor the definitions to specific text domains. The underlying linguistic principles, the nature of sense distinction required tot translation, and tilt: formal structure of the lexicon are discussed.</Paragraph> <Paragraph position="1"> Ideg httroduction While melbods of monolingual Analysis and Generation are also treated in other contexts, bilingual Transfer problems are hardly inw~stigated outside the context of Machine Translation. Research in Machine Translation can, in this case, make a specific contribution to Computational l,inguistics. The general issue here is tire formal representation of phrase structures and lexical units and the methodology for specifying transformations between these representations in two (or more) languages. The role of tile bilingual lexicon in the Transfer activity, attd its power to assist in the resolution of mapping problems, is a key element.</Paragraph> <Paragraph position="2"> In tills paper we present an approach to the formal representation of bilingual lexical knowledge and to the way this knowledge is incorporated into the translation process. In section 2 we describe the role and place of the bilingual lexicon in the translation process, present the concept of executable descriptions of word senses as lexical definitions, and discuss some aspects of practical usage. Our approach to the sense distinction required for translation, which is different from monolingual sense distinction, is discussed in section 3. In section 4 we make a few methodological comments, arguing that wha! is often portrayed as the ideal Transfer-based architecture, is not the only, and not necessarily the best way to achieve modularity and save work. Section 5 contains a formal definition of the lexicon specification language with some discussion of its features and the intended restrictions on the power given to the lexicographer. Finally, an additional example is given in detail in section 6.</Paragraph> <Paragraph position="3"> This work has been carried out as part of the MENTOR project, where several groups in European IBM Scientific Centers are collaborating on M(A)T research. The approach presented here has been developed and prototyped by the group in Ilaifa, Israel, as part of the proposal for tire design of Transfer-related operations. The examples below involve translations from English into I lebrew.</Paragraph> <Paragraph position="4"> We thank our colleagues Danit Ben-Ari, Esther Bentur and Maria Vilkuna for their contributions and comments.</Paragraph> <Paragraph position="5"> 2. &quot;lhe Role and Content of the Bilingt, al Lexicon \[Cullingford 87\] describes an MT system which is purely lexicon driven. Ills system follows the Conceptual Processing model, and is not Transfer-based, hence the emphasis there is on deep Analysis and Generation. Many other systems distinguish between I.exical Transfer and Structural Transfer, but they take different approaches to the actual separation of these two sub-processes. In the work reported here, an attempt was made to strictly separate lexicon-driven selection of target language equivalents from the global mapping of syntactic structures in the SIA into those of tile 'I'I~ (cf. \[Biewer 85\]). 'lhe lexicon lookup phase, which takes place before phrase structure transformations, gets as its input the internal data representation provided by the SL parser (PEG \[Jensen 86\], in out prototype). The terminal nodes (leaves of a parse tree) are searched in a pre-defined order for certain parts of speech.</Paragraph> <Paragraph position="6"> For each word in turn a target equivalent is selected from the bilingual lexicon and attached to the corresponding node in the parse. Features may also be added to other affected nodes.</Paragraph> <Paragraph position="7"> f lowever, no structural modifications are made. Structural transformations are carried out as an independent sub-process, upon completion of tire bilingual lexical phase, and are not discussed in this paper.</Paragraph> <Paragraph position="8"> Since in many cases, and ill fact for most verbs, several alternative translations exist, tile selection is done by texical differentiation rules. These rules refer to the syntactic environment of tile word in the parse tree and to a limited number of semantic features. The rules can access any node and attribute identified by the parser. Given that the rules are stated in terms of the SL phrase structure, it seems more natural to apply them as close to Analysis as possible. Nevertheless, the sense distinction cannot be done as part of SL Analysis itself, as in many cases it depends on factors which may vary from one TL to another.</Paragraph> <Paragraph position="9"> The sub-process of bilingual lexical substitution proceeds unidirectionally. No iterations take place for any given phrase. In some cases this may require extensive searching of' the phrase The filllowing abbreviation~ are u~ed throughout thi~ pal~er: SL =: Source Language, BL = BiLingual, TL = Target Language structure in a manner determined by the relevant differentiation rules.</Paragraph> <Paragraph position="10"> The differentiation rules which comprise an entry in the bilingual lexicon are stated in a special language, and are directly executable by a set of functions (LISP, in our implementation). Although the phrase attributes and features they can refer to are similar, to some extent, to what is done in certain other systems (see for example \[Nagao 86\]), it seems that action rules are more-flexible and powerful than static form-oriented structures. In fact, each statement in the set of rules which comprises a given lexical entry defines a correspondence between a syntactic environment (with semantic decoration) in the source language and a translation into the target language.</Paragraph> <Paragraph position="11"> Consider, for example, one of the four rules which comprise the entry for the verb &quot;avoid&quot; in the English-ilebrew lexicon. This particular rule identifies the case where the current verb node (CN) has a postmodifier noun phrase and the head noun of that noun phrase has the semantic attribute Animate (e.g.</Paragraph> <Paragraph position="12"> &quot;She avoids her old father.&quot;). In that case the Hebrew translation should be &quot;r~ pr3rEn~&quot;. In terms of the lexicon specification language and the attributes used by the PEG parser, the rule looks as follows: If ( CN has Postbrother ((CAT NP)(ANIMATE +)) ) Then < Put (HEB &quot;13 p~nnn&quot; );>; The put instruction attaches the tlebrew translation as a feature on the current node (CN is the default node assumed when no explicit node name is specified for put).</Paragraph> <Paragraph position="13"> Certain lexical rules can be simply and elegantly formulated in terms of functions which identify grammatical relations of various kinds. Thus, for example, the rule for translating the verb &quot;return&quot; can be specified in terms of the feature Passive, and the presence or absence of a direct object. The set of rules which follows makes the required distinction between the ergative (intransitive) sense and the transitive reading of the verb:</Paragraph> </Section> class="xml-element"></Paper>