File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/92/c92-4170_intro.xml
Size: 17,323 bytes
Last Modified: 2025-10-06 14:05:10
<?xml version="1.0" standalone="yes"?> <Paper uid="C92-4170"> <Title>THE ASSIGNMENT OF GRAMMATICAL RELATIONS IN NATURAL LANGUAGE PROCESSING</Title> <Section position="2" start_page="0" end_page="0" type="intro"> <SectionTitle> 1. Introduction </SectionTitle> <Paragraph position="0"> One of the main goals of an interpreter is to map the syntactic descriptions found in the sentence into the correct roles that the elements (described by the nominals) play in the situation at hand (described by the verb). For instance, we must be able to state that in 1) The cat ate the mouse the cat is the &quot;eater&quot; and the mouse is the &quot;eaten thing&quot;. Of course, if we only talk about roles and situations we miss some significant generalizations. In 2) The boy drank the water, if we say that the boy is the &quot;drinker&quot; and the water is the &quot;drunk thing&quot;, we disregard the evident similarity of the roles of &quot;eater&quot; and &quot;drinker&quot; in the two situations. The notion of deep case arises as the common ground underlying a number of &quot;apparently&quot; different roles. Upon this notion some frameworks, that stand at the core of semantic representation and natural language processing, are built (see \[Fillmore 68\], \[Bruce 75\] and ISomers 871).</Paragraph> <Paragraph position="1"> The hard task is to devise a mapping between the surface descriptions and these deep cases. The complexity of some syntactic phenomena, like passivization, subject and object raising, long distance dependencies, has led many researchers to pose an intermediate level between the linear string of words and the case system. The concept involved is that of &quot;grammatical relation&quot;, such as &quot;subject&quot;, &quot;direct object&quot;, &quot;indirect object&quot;. It is claimed, for example, that &quot;passivizatiou&quot; is universally (cross-linguistically) explained if one says that the &quot;object&quot; of an active sentence becomes the &quot;subject&quot; in the passive form, rather than by saying that the NP in the VP is moved to replace the NP in S (that is a direct mapping).</Paragraph> <Paragraph position="2"> In the latter case it is implicit that the partictdar language under examination has a Subject-Verb-Object structure (SVO), as it usually happens in configurational languages such as English. In the example 3a) Lo hanno visto gli amici di Piero (Him &tve seen the friends of Piero) 3b) E' stato visto dagli amici di Piero ((He) has been seen by l'iero's friends) the passive form does not obey tile law of direct mapping. The example is, however, easily accounted fbr by the relational theories. The passivization rule induces only changes of function: the SUBJ becomes the BYcomplement and the OBJ becomes the SUBJ.</Paragraph> <Paragraph position="3"> The importance of grammatical relations, taken as primitives for a universal grammar, is stated by a number of formalisms often collected under the label of Relational Grammar. The problem is to map the surface constituents into their correct roles. With languages as Italian, which stands in the middle between configurational and freely ordered languages \[Stock 891 some flexibility is required to accomplish this task. One possibility is to adopt 11 neutral syntactic structure, open to several alternatives in the interpretation process. The head & modifier approach seems to feature this kind of neutrality, and has effectively been used for dealing with free word order languages, like the Slavonic languages \[Sgall et al. 861 and Finnish \[Jappinen et al. 86\].</Paragraph> <Paragraph position="4"> The dependency formalism we have adopted is presented in \[Lesmo, Lombardo 91\]. An example is reported in fig.l, and concerns the sentence: 4) La ragazza ebe lavora al guardaroba fu persuasa da un cliente a comprare una enciclopedia (The girl who works at the wardrobe was persuaded by a customer to buy an encyclopedia).</Paragraph> <Paragraph position="5"> The daughter nodes that stand on the left of their head precede it in the linear order of the sentence, while daughter nodes on the right follow it. The arcs that link the nodes in the dependency tree are of three types: arcs of structural and logical dependency (D&S arcs, represented by bold arrows in the figure), arcs of only structural dependency (STR arcs, simple arrows in the figure), and arcs of only logical dependency (DEP arcs, dashed arrows in the figure). D&S arcs link two words that stand in a &quot;both structural and logical&quot; relation. STR and DEP split these two functions of arc: an STR individuatcs a purely superficial AcrEs DE COLING-92. NAN'~S, 23-28 ^o~r 1992 1 0 9 0 PROC. or: COLING-92, NANTF.S. AUG. 23-28, 1992</Paragraph> <Paragraph position="7"> cludes the grammatical relations that will be described below. The bold labels (e.g.</Paragraph> <Paragraph position="8"> agt, pat) refer to deep cases. The labels immediately below them refer to the initial stratum of grammatical relations. The lowest labels are the last stratum (surface relations). SUB-Goal stands for a GOAL relation expressed via a subordinate sentence.</Paragraph> <Paragraph position="9"> Cho-1 is a chomeur (see text), expressed in Itatian via a BY (DA) complement.</Paragraph> <Paragraph position="10"> attachment, DEP represents a deep dependency between two words that are structurally independent. DEP arcs enable us to represent long distance dependencies, the sharing of dependent nodes (i.e. multiple heads, see fig. 1) and to represent coordinative and comparative constructions without violating the adjacency principle \[Hudson 84\], that applies only to STR and D&S arcs l. An arc involving dependency (of DEP or D&S type) is labelled with the grammatical relation that exists between the two nodes that it links (the arrangement in strata is explained below).</Paragraph> <Paragraph position="11"> The goal of this paper is to show that the formalism of Relational Grammar can be integrated in a useful way in a gcneral NL interpreter, in particular if the surface stntctures are represented via the dependency formalism.</Paragraph> <Paragraph position="12"> The paper examines the problems associated with the use of RG in an interpretive (as opposed to generative) framework, where the phase of surface relation hypothesization is critical. The partial configurationality of Italian can be exploited as heuristic information aiding the interpreter in selecting the preferable initial 1 The adjacency principle intuitively states that a word B, that stands between the words A and C in the sentence, results in the santo position if we project Ihe related node.~ in the dependency tree onto a line.</Paragraph> <Paragraph position="13"> hypothesis. On tile contrary, the RG rules governing the nlappiug between strata airu at confirming the hypotheses: they are applied on the basis of the lexical and morphological infonnatioo associated with tile verb, where the lexicon provides tbe first stratum and possible constraints on nde applicability.</Paragraph> <Paragraph position="14"> 2. The assignment of grammatical relations We start this section by providing a short overview of the main ideas of RG. Such ideas are shared by many fornlalisms, bttt we. will mostly refer to the work described in \[Perlnlutter 83\] and \[l)erhuutter, Rosen 841, where it can be tound a comparison with other RG fonnalisms.</Paragraph> <Paragraph position="15"> Grammatical relations are arranged in a hierarchy and are usually referred to by numbers: 1, which is the highest, corresponds to SUBJECT, 2 to DIRECT OBJEC~I ', 3 to INDIRECT OBJECT. The key principle of RG is the promotion of relations to higher levels in the hierarchy. The passive Cml be described as a promotion of 2 to 1 (i.e. DIR-OBJ to SUB J), leaving the previous 1 element &quot;unemployed&quot;. The relation &quot;uuemployed&quot;, which is technically indicated by the corresponding French word AcrEs DE COLING-92, NANTES. 23-28 AO(n&quot; 1992 l 0 9 1 I)~OC. OF COLING-92, NAN'rES, AUG. 23 28. 1992 (a) (b) give Mary the book John give Mary the book John Fig.2 - The Relational Networks associated with the sentences &quot;John was given the book by Mary&quot; (a) and &quot;The book was given to John by Mary&quot; (b). chomeur, is assigned to an element that cannot be involved in any other promotion. Consider 5a) Mary gave the book to John, where Mary is the 1-element, the book is the 2-element and John is the 3-element. If we apply the rule for passivization described above, the book must be promoted to the 1 relation, Mary becomes a chomeur, while John is still the 3 element. The chomeur-1 element, i.e. a chomeur element that results from the &quot;unemploying&quot; of a l-element, assumes the surface form of a by-complement in English, thus yielding 5b) The book was given to John by Mary where Mary cannot be involved in any other promotion, because of its chomeur condition. A similar rule applies to double-accusative constructions, as shown in fig.2a.</Paragraph> <Paragraph position="16"> At the same level of promotional rules we can posit the lexical rules, that account for the determination of grammatical relations within subordinate untensed sentences, as in 4). Such information is stored within the lexical entry of the verb that governs the subordinate clause.</Paragraph> <Paragraph position="17"> For example, to promise forces the SUBJ of the subordinate clause to be the SUBJ element of the governing clause, as we can see in 6) Mary promised John to write him a letter, where the SUBJ of write is Mary, the same of promise. On the contrary, to persuade forces the SUBJ of the subordinate to be the OBJ element of the governing clause, as in 4).</Paragraph> <Paragraph position="18"> It must be noted that the lexical rules are related to the assignment of relations in the initial stratum, even if they are subsequently changed by promotional rules. For example, in 7) The girl was persuaded by a customer to buy an encyclopedia, the girl is the element which is still shared by the two clauses, even if it is the SUBJ now.</Paragraph> <Paragraph position="19"> The semantic interpretation process takes advantage of the functional analysis, i.e. the analysis in terms of grammatical functions: relational structures are easily mapped onto logical representations, because of the resemblance between a Relational Network and a Predicate-Argument structure: the initial stratum states which are the grammatical relations (actually the elements at the sentence level) that act as arguments of the predicate identified by P.</Paragraph> <Paragraph position="20"> From a computational point of view, syntactic and semantic clues must be taken into account, in order to map the grammatical relations onto the surface descriptions. The mapping is carried out incrementally, i.e. as soon as the nominal head of the complement is parsed: it is highly language-dependent, and considers features like inflectionality, configurationality and deep underlying structures. The mapping must also take into account the changes on the surface form that are induced by the rules on grammatical relations, discussed informally above 2.</Paragraph> <Paragraph position="21"> Unfortunately the mapping raises some difficulties. The bias of the rules at the RG level is of a generative kind. Rules start from the initial stratum, the one which is closer to the deep cases (arguments of the predicate), to produce the final stratum of the surface arrangement; on the contrary, in an interpreter of language, the task is to trace what rules have been actually applied (and in what order) to the initial stratum (and the subsequent strata) in order to achieve the surface realization of the grammatical relations. Useful heuristics are devised to identify the surface clues that evidence tile application of a particular nile. For example, a passivization is accompanied by a passive form of the verb.</Paragraph> <Paragraph position="22"> Starting from the surface descriptions, the 2 Of course, the application of such rules involves also changes in focus. Two expressions that result to be derivable from each other (3a and 3b). according to phenomena that are explained in terms of grammatical relations, are therefore not strictly equivalent, even if both of them involve the same roles to be played by the individuals in the ground sentence (or, better, in the sentence that has been claimed to be ground). Acids DE COLING-92, NANTES. 23-28 AO~r 1992 1 0 9 2 PRec. OF COLING-92. N^r,n'I~S. Auo. 23-28, 1992 interpreter is not always able to uniquely determine the assignment of grammatical relations on the basis of syntactic features.</Paragraph> <Paragraph position="23"> Only semantics allows a hearer to realize that in both cases the &quot;drinker&quot; is John and not the wine. Hence a flexible interaction between syntactic and semantic information (selectioual restrictions) must be devised.</Paragraph> <Paragraph position="24"> Since in our system the analysis is incremental, in the sense that parsing and interpretation are synchronous processes, as soon as the dependency tree is extended with a head of a substructure the semantic interpreter is triggered to interpret it: in the case of verbs, all the complements that precede the verb are interpreted when the verb is found, while each complement that follows the verb is interpreted as soon as it is attached to it.</Paragraph> <Paragraph position="25"> The association of the grammatical relations with the descriptions in the sentence is accomplished by the rules at the relational level (GR rules), which are divided into three groups: the first of them deals with the initial proposal of relations based on syntactic features (Proposal Rules - PR), the second concerns the movement across the strata (Stratal Rules ~ SR), and the third, of a lexical kind (then Lexical Rules - LR), accounts for the sharing of relations in unteused subordinate clauses, as in 4. The verbal lexical entry contains, among other information, its initial stratum of grammatical relations. Once the verb has been found, the GR rules are triggered iu order to find out the roles that are played by the elements that precede the verb in the input sentence (incremental interpretation). It is the actual input that determines which of the three groups must be applied. For example, if we have a single active sentence (without subordinate embedded clauses or passive forms), the Proposal Rules are triggered. SR and LR rules are activated only in presence of special features: a passive form (was eaten), for instance, activates the Passivization rule (belonging to SR), if we have the pair <SUBJ,OBJ> in the current stratum, while lexical rules are associated with verbs that govern subordinate clauses (e.g. t o persuade). The result of the application of one or more rules is the final stratum, against which the assignment of relations guessed by the PR group is nmtched.</Paragraph> <Paragraph position="26"> In the PR, the first feature that is taken into account is the syntactic form of the participants. SUBJ and DIR-OBJ require that the corresponding nominals are not preceded by a preposition, and prououns be inflected appropriately. For example, in 3a the pronoun I0 features an accusative case, thus a DIR-OBJ.</Paragraph> <Paragraph position="27"> If two nominal descriptions are not inflected and, hence, they cannot be associated with a particular relation via this marking, as in 9) 11 gatto mangio' ii topo (The cat ate the n~)use), the position of the nominals can be useful, since, in a partially configuratioual language such as Italian, grammatical relations are usually connected with the canonical positions of the SVO order: with a transitive verb, the SUBJ precedes the verb and OBJ follows it; witll iutransitive verbs, the position of the nominal without a preposition does not affect the grammatical relation assigned to, since it will be surely the SUBJ wherever it is. If the order too does not give an unambiguous assignment of grammatical relations, the last resources are the number agreement for the SUBJ relation and the semantic check. In situations such as 10) Le ragazze Giorgio le ha viste (The girls Giorgio \[them\] has seen) even if the nominal descriptions staud on the same side of a transitive verb, the latter agrees only with Giorgio in number. On the contrary, only semantics can solve a situation as 8; moreover, the semantic check can also reject an assignment made on the basis of the syntactic features that we have described. Consider, for example, the sentence 11) Un snsso calcio' il vitello (A rock kicked the calf) Even if the order rules assign an sasso the SUBJ relation and il vitello the OBJ, such an assignment is rejected on the semantic ground.</Paragraph> <Paragraph position="28"> Notwithstanding a system that works correctly cannot be based only on semantics, since a sentence like il sounds really strange to a native speaker, if we are not in a particular focussing situation.</Paragraph> </Section> class="xml-element"></Paper>