File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/98/p98-1025_metho.xml
Size: 19,119 bytes
Last Modified: 2025-10-06 14:14:55
<?xml version="1.0" standalone="yes"?> <Paper uid="P98-1025"> <Title>Deriving the Predicate-Argument Structure for a Free Word Order Language *</Title> <Section position="4" start_page="0" end_page="0" type="metho"> <SectionTitle> > Theme </SectionTitle> <Paragraph position="0"> &quot; Thanks to Mark Steedman for discussion and material, and to the anonymous reviewer of an extended version whose comments led to significant revisions. This research is supported by TUBITAK (EEEAG-90) and NATO Science Division (TULANGUAGE). null whereas LFG accounts make use of the following (Bresnan and Kanerva, 1989): Agent > Beneficiary > Goal / Experiencer > Inst</Paragraph> </Section> <Section position="5" start_page="0" end_page="167" type="metho"> <SectionTitle> > Patient/Theme > Locative. </SectionTitle> <Paragraph position="0"> As an illustration, the predicate-argument structures of the agentive verb murder and the psychological verb fear are (Grimshaw, 1990, p.8): To abstract away from language-particular case systems and mapping of thematic roles to grammatical functions, I assume the Applicative Hierarchy of Shaumyan (1987) for the definition of prominence:</Paragraph> <Paragraph position="2"> Primacy of a term over another is defined by the former having a wider range of syntactic features than the latter. In an accusative language, subjects are less marked (hence primary) than objects; all verbs take subjects but only transitive verbs take objects.</Paragraph> <Paragraph position="3"> Terms (=arguments) can be denoted by the genotype indices on NPs, such as NP1, NP2 for primary and secondary termsJ An NP2 would be a direct object (NPacc) in an accusative language, or an ergativemarked NP (NPerg) in an ergative language. This level of description also simplifies the formulation of grammatical function changing; the primary term of a passivized predicate (PASS p) is the secondary term of the active p. I follow Shaumyan and Steedman (1996) also in the ordered representation of the PAS (1). The reader is referred to (Shaumyan, 1987) for linguistic justification of this ordering.</Paragraph> <Paragraph position="4"> (1) Pred... <Sec. Term> <Primary Term> Given this representation, the surface order of t Shaumyan uses T 1 , T 2, but we prefer NPI, NP2 for easier exposition in later formulations.</Paragraph> <Paragraph position="5"> constituents is often in conflict with the order in the PAS. For instance, English as a configurational SVO language has the mapping:</Paragraph> </Section> <Section position="6" start_page="167" end_page="169" type="metho"> <SectionTitle> (2) SS: S~O PAS: ~ NP2~&quot;&quot;..~P1 </SectionTitle> <Paragraph position="0"> However, in a non-configurational language, permutations of word order are possible, and grammatical functions are often indicated not by configurations but by case marking. For instance, in Turkish, all six permutations of the basic SOV order are possible, and Japanese allows two verb-final permutations of underlying SOV. The relationship between case marking and scrambling is crucial in languages with flexible word order. A computational solution to the problem must rely on some principles of parsimony for representing categories and types of arguments and predicates, and efficiency of processing. null In a categorial formulation, grammatical functions of preverbal and postverbal NPs in (2) can be made explicit by type shifting 2 the subject to S/(S\NP1) and the object to (S\NP1)\((S\NP1)/NP2). These categories follow from the order-preserving type shifting scheme (Dowty, 1988): (3) NP ~ T/(T~NP) or TVT/NP) To resolve the opposition between surface order and the PAS in a free word order language, one can let the type shifted categories of terms proliferate, or reformulate CCG in such a way that arguments of the verbs are sets, rather than lists whose arguments are made available one at a time. The former alternative makes the spurious ambiguity problem of CG parsing (Karttunen, 1989) even more severe. Multi-set CCG (Hoffman, 1995) is an example of the setoriented approach. It is known to be computationally tractable but less efficient than the polynomial time CCG algorithm of Vijay-Shanker and Weir (1993). I try to show in this paper that the traditional curried notation of CG with type shifting can be maintained to account for Surface Form+-~PAS mapping without leading to proliferation of argument categories or to spurious ambiguity.</Paragraph> <Paragraph position="1"> Categorial framework is particularly suited for this mapping due to its lexicalism. Grammatical functions of the nouns in the lexicon are assigned 2aka. type raising, lifting, or type change by case markers, which are also in the lexicon.</Paragraph> <Paragraph position="2"> Thus, grammatical function marking follows naturally from the general CCG schema comprising rules of application (A) and composition (B). The functor-argument distinction in CG helps to model prominence relations without extra levels of representation. CCG schema (Steedman (1988; 1990)) is summarized in (4). Combinator notation is preferred here because they are the formal primitives operating on the PAS (cf. (Curry and Feys, 1958) for Combinatory Logic). Application is the only primitive of the combinatory system; it is indicated by juxtaposition in the examples and denoted by * in the normal order evaluator (SS4). B has the reduction rule B f ga>_f (ga).</Paragraph> <Paragraph position="3"> (4) X/Y: f Y: a =~A> X: fa Y: a X\ Y: f ==C/'A< X: f a x/Y: f r/z: g :----.8> x/z: Bfg Y z:a x r:.f x z: Bfg x/Y: y rxz:9 x\z: Big v/z: g XkV: y Sx< x/z: Byg 2 Grammatical Functions, Type Shifting,</Paragraph> <Section position="1" start_page="167" end_page="169" type="sub_section"> <SectionTitle> and Composition </SectionTitle> <Paragraph position="0"> In order to derive all permutations of a ditransitive construction in Turkish using (3), the dativemarked indirect object (NP3) must be type shifted in 48 (4!2) different ways so that coordination with the left-adjacent and the right-adjacent constituent is possible. This is due to the fact that the result category T is always a conjoinable type, and the argument category T/NP3 (and T~NP3) must be allowed to compose with the result category of the adjacent functor. However, categories of arguments can be made more informative about grammatical functions and word order. The basic principle is as follows: The category assigned for argument n must contain all and only the term information about NPi for all i < n. An NP2 type must contain in its category word order information about NP1 and NP2 but not NP3. This can be generalized as in (5): Ta = Lexical category of an NPngoverning element (e.g., a verb) in the language whose highest genotype argument is NPn.</Paragraph> <Paragraph position="1"> Tr = The category obtained from Ta by removing NPn.</Paragraph> <Paragraph position="2"> Case markers in Turkish are suffixes attached to noun groups. 3 The types of case markers in the lexicon can be defined as: (6) Lexical type assignment for the case marker (-case) encoding argument n: -case: = C(n): T(C(n) )x\N: x where T(C) denotes the semantic type for category C: (7) a. T(NPn) = I (lower type for NPn) b. T(C) = T (if C is a type shifted category as in (3)) c. T(C) = BBT (if C is a type shifted and composed category) (5) and (6) are schemas that yield three lexical categories per -case: one for lower type, and two for higher types which differ only in the directionality of the main function due to (5). For instance, for the accusative case suffix encoding NP2, we have:</Paragraph> <Paragraph position="4"> Type shifting alone is too constraining if the verbs take their arguments in an order different from the Applicative Hierarchy (SS 1). For instance, the category of Turkish ditransitives is SINPIlNP31NP2.</Paragraph> <Paragraph position="5"> Thus the verb has the wrapping semantics Cv' where C is the permutator with the reduction rule Cfga>fag. Type shifting an NP3 yields (SINP1 INP2)/(SINP1 \]NP~ INP3) in which the argument category is not lexically licensed. (5) is order-preserving in a language-particular way; the result category always corresponds to a lexical category in the language if the argument category does too.</Paragraph> <Paragraph position="6"> For arguments requiring a non-canonical order, we need type shifting and composition (hence the third clause in (7)): 3As suggested in (Bozsahin and Gocmen, 1995), morphological and syntactic composition can be distinguished by associating several attachment calculi with functors and arguments (e.g., affixation, concatenation, clitics, etc,)</Paragraph> <Paragraph position="8"> Once syntactic category of the argument is fixed, its semantics is uniquely determined by (7).</Paragraph> <Paragraph position="9"> The combinatory primitives operating on the PAS are I (7a), T (7b--c), and B (7c). T has the reduction rule Tar>f a, and If>f. The use ofT or B signifies that the term's category is a functor; its correct place in the PAS is yet to be determined. I indicates that the term is in the right place in the partially derived PAS.</Paragraph> <Paragraph position="10"> According to (5), there is a unique resultargument combination for a higher type NP3, compared to 24 using (3). (5) differs from (3) in another significant aspect: Tr and Ta may contain directionally underspecified categories if licensed by the lexicon. Directional underspecification is needed when arguments of a verb can scramble to either side of the verb. It is necessary in Turkish and Warlpiri but not in Japanese or Korean. The neutral slash I is a lexical operator; it is instantiated to either \ or / during parsing. A crucial use of underspecification is shown in (8). SV composition could not follow through if the verbs had backward-looking categories; composition of the type shifted subject and the verb in this case would only yield a backward-looking S\NP2 by the schema (4).</Paragraph> <Paragraph position="11"> The schema in (5) makes the arguments available in higher types, and allows lower (NPn) types only if higher types fail (as in NP2 in (8)). There are two reasons for this: Higher types carry more information about surface order of the language, and they are sufficient to cover bounded phenomena. SS3 shows how higher types correctly derive the PAS in various word orders. Lower types are indispensable for unbounded constructions such as relativization and coordination. The choice is due to a concern for economy. If lower types were allowed freely, they would yield the correct PAS as well: In parsing this is achieved as follows: An NPk can only be the argument in a rule of application, and schema (5) is the only way to obtain NPk from a noun group. Thus it suffices to check in the application rules that if the argument category is NPk, then the functor's result category (e.g., X in X/Y) has none of the terms with genotype indices lower than k. NP2 in (8) is licensed because the adjacent functor is S/NP2. NP2 in (9) is not licensed because the adjacent functor has NP1.</Paragraph> <Paragraph position="12"> For noun-governed grammatical functions such as the genitive (NPs), (5) licenses result categories that are underspecified with respect to the genotype index. This is indeed necessary because the resulting NP can be further inflected on case and assume a genotype index. For Turkish, the type shifted category is C(5) =NPagr/(NPagr\NPs).</Paragraph> <Paragraph position="13"> Hence the genitive suffix bears the category C(5)\N. Agreement features enforce the possessorpossessed agreement on person and number via unification (as in UCG (Calder et al., 1988)): kalem -in uc -u pencil -GEN.3s tip -POSS.3s N.'--ff C(5)\N: T N:t ---~ (Ne=g~\NPs>\N:p.oss A NPag~/(NP~g~\NPs):~p 5 NP~,~\NPs:posst' A> NPo:: :rp' Cposst' )>_(posst' )p' 'The tip of the pencil'</Paragraph> </Section> </Section> <Section position="7" start_page="169" end_page="170" type="metho"> <SectionTitle> 3 Word Order and Scrambling </SectionTitle> <Paragraph position="0"> Due to space limitations, the following abbreviated categories are employed in derivations:</Paragraph> <Paragraph position="2"> The categories licensed by (5) can then be written as IV/TV and IV~TV for NP2, TV/DV and TV~DV for NP3, etc. (10a-b) show the verb-final variations in the word order. The bracketings in the PAS and juxtaposition are left-associative; (fa)b is same as lab.</Paragraph> <Paragraph position="3"> b. kitab-t Mehmet oku-du IV/TV: Tb' S\IV: Tm ~ TV: r I &quot;Bx< S/TV: B(Tm' )(Tb' ) S: B(Tm' )(Tb' )r' >r' b' m A~&quot; (10a) exhibits spurious ambiguity. Forward composition of S/IV and IV/TV is possible, yielding exactly the same PAS. This problem is resolved by grammar rewriting in the sense proposed by Eisner 4 (1996). Grammar rewriting can be done using predictive combinators (Wittenburg, 1987), but they cannot handle crossing compositions that are essential to our method. Other normal form parsers, e.g. that of Hepple and Morrill (1989), have the same problem. All grammar rules in (4) in fact check the labels of the constituent categories, which show how the category is derived. The labels are as in (Eisner, 1996). -FC: Output of forward composition, of which forward crossing composition is a special case. -BC: Output of backward composition, of which backward crossing composition is a special case. -OT: Lexical or type shifted category. The goal is to block e.g., X/Y-FC Y/Z-{FC, BC, OT} =~B> X/Z and X/Y-FC Y-{FC, BC, OT} =~A> X in (10a). S/TV composition would have the label -FC, which cannot be an input to forward application. In (10b), the backward composition follows through since it has the category-label S/TV-BC, which the forward application rule does not block. We use Eisner's method to rewrite all rules in (4).</Paragraph> <Paragraph position="4"> (llab) show the normal form parses for post-verbal scrambling, and (llc-d) for verb-medial cases.</Paragraph> <Paragraph position="5"> 4Eisner (1996, p.81) in fact suggested that the labeling system can be implemented in the grammar by templates, or in the processor by labeling the chart entries.</Paragraph> <Paragraph position="6"> e.g., having both (and only) IV/TV and IV~TV licensed by the lexicon for an NP2, does not lead to alternative derivations in (10-11). Assume that A/B B\ C, where A /B and B\ C are categories produced by (5), gives a successful parse using the output A\C. A\B B\C and A\B B/C are not composable types according to (4). The other possible configuration, A/B B/C, yields an A/C which looks for C in the other direction. Multiple derivations appear to be possible if there is an order-changing composition over C, such as C/C (e.g., a VP modifier IV/IV). (12) shows two possible configurations with a C on the right. (12b) is blocked by label check because A/C-FC C =~A> A is not licensed by the grammar. If C were to the left, only (12a) would succeed. Similar reasoning can be used to show the uniqueness of derivation in other patterns of directions. null Constrained type shifting avoids the problem with freely available categories in Eisner's normal form parsing scheme. However, some surface characteristics of the language, such as lack of case marking in certain constructions, puts the burden of type shifting on the processor (Bozsahin, 1997). Lower type arguments such as NP2 pose a different kind of ambiguity problem. Although they are required in unbounded constructions, they may yield alternative derivations of local scrambling cases in a labelled CCG. For instance, when NP2 is peripheral in a ditransitive construction and the verb can form a constituent with all the other arguments (S\NP2 or S/NP2), the parser allows NP2. This is unavoidable unless the parser is made aware of the local and non-local context. In other words, this method solves the spurious ambiguity problem between higher types, but not among higher and lower types. One can try to remedy this problem by making the availability of types dependent on some measures of prominence, e.g., allowing subjects only in higher types to account for subject-complement asymmetries. But, as pointed out by Eisner (1996, p.85), this is not spurious ambiguity in the technical sense, just multiple derivations due to alternative lexical category assignments. Eliminating ambiguity in such cases remains to be solved.</Paragraph> </Section> <Section position="8" start_page="170" end_page="170" type="metho"> <SectionTitle> 4 Revealing the PAS </SectionTitle> <Paragraph position="0"> The output of the parser is a combinatory form. The combinators in this form may arise from the CCG schema, i.e., the compositor B, and the substitutor S (Steedman, 1987). They may also be projected from the PAS of a lexical item, such as the duplicator W (with the reduction rule Wfa>faa) for re- n+l flexives, and B C for predicate composition with the causative suffix. For instance, the combinatory form for (13a) is the expression (13b).</Paragraph> <Paragraph position="1"> Although B works in a binary manner in CCG to achieve abstraction, it requires 3 arguments for full evaluation (its order is 3). Revealing the PAS amounts to stripping off all combinators from the combinatory form by evaluating the reducible expressions (redexes). Bfg is not a redex but Bfga is. In other words, the derivations by the parser must saturate the combinators in order to reveal the PAS, which should contain no combinators. PAS is the semantic normal form of a derivation.</Paragraph> <Paragraph position="2"> The sequence of evaluation is the normal order, which corresponds to reducing the leftmostoutermost redex first (Peyton Jones, 1987). In treetheoretic terms, this is depth-first reduction of the combinator tree in which the rearrangement is controlled by the reduction rule of the leftmost combinator, e.g., Tin' X>_Xm' where X is the parenthesized subexpression in (13b). Reduction by T</Paragraph> <Paragraph position="4"> By the second Church-Rosser theorem, normal order evaluation will terminate if the combinatory form has a normal form. But Combinatory Logic has the same power as A-calculus, and suffers from the same undecidability results. For instance, WWW has no normal form because the reductions will never terminate. Some terminating reductions, such as Glib>N, has no normal form either. It is an open question as to whether such forms can be projected from a natural language lexicon. In an expression X.Y where X is not a redex, the evaluator recursively evaluates to reduce as much as possible because X may contain other redexes, as in (5) above. Recursion is terminated either by obtaining the normal form, as in (6) above, or by equivalence check. For instance, (G.(I.a).b).Y recurses on the left subexpression to obtain (G,a-b) then gives up on this subexpression since the evaluator returns the same expression without further evaluation.</Paragraph> </Section> class="xml-element"></Paper>