XML Viewer - p89-1034

File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/89/p89-1034_metho.xml
Size: 28,674 bytes
Last Modified: 2025-10-06 14:12:23
<?xml version="1.0" standalone="yes"?>
<Paper uid="P89-1034">
  <Title>EFFICIENT PARSING FOR FRENCH*</Title>
  <Section position="4" start_page="0" end_page="280" type="metho">
    <SectionTitle>
1. LINGUISTIC THEORIES AND
WORD ORDER
</SectionTitle>
    <Paragraph position="0"> Word order remains a pervasive issue for most linguistic analyses. Among the theories most closely related to FG, Unification Categorial Grammar (UCG : Zeevat et al. 1987), Combinatory Categorial Grammar (CCG : Steedman 1985, Steedman 1988), Categorial Unification Grammar (CUG : Karttunen 1986) and Head-driven Phrase Structure Grammar (I-IPSG: Pollard &amp; Sag 1988) all present inconveniences in their way of dealing with word order as regards parsing efficiency and/or linguistic data.</Paragraph>
    <Paragraph position="1"> * The workreported here was carried outin the ESPR/T Project 393 ACORD, ,,The Construction and Interrogation of Knowledge Bases using Natural Language Text and Graphics~.</Paragraph>
    <Paragraph position="2">  In UCG and in CCG, the verb typically encodes the notion of a canonical ordering of the verb arguments. Word order variations are then handled by resorting to lexical ambiguity and jump rules ~ (UCG) or to new combinators (CCG). As a result, the number of lexical and/or phrasal edges increases rapidly thus affecting parsing efficiency. Moreover, empirical evidence does not support the notion of a canonical order for French (cf. B~s &amp; Gardent 1989).</Paragraph>
    <Paragraph position="3"> In contrast, CUG, GPSG (Gazdar et al. 1985) and HPSG do not assume any canonical order and subcategorisation information is dissociated from surface word order. Constraints on word order are enforced by features and graph unification (CUG) or by Linear Precedence (LP) statements (HPSG, GPSG). The problems with CUG are that on the computational side, graph-unification is costly and less efficient in a Prolog environment than term unification while from the linguistic point of view (a) NP's must be assumed unambiguous with respect to case which is not true for - at least - French and (b) clitic doubling cannot be accounted for as a result of using graph unification between the argument feature structure and the functor syntax value-set. In HPSG and GPSG (cf. also Uszkoreit 1987), the problem is that somehow, LP statements must be made to interact with the corresponding rule schemas. That is, either rule schemas and LP statements are precompiled before parsing and the number of rules increases rapidly or LP statements are checked on the fly during parsing thus slowing down processing. null</Paragraph>
  </Section>
  <Section position="5" start_page="280" end_page="281" type="metho">
    <SectionTitle>
2. THE GRAMMAR
</SectionTitle>
    <Paragraph position="0"> The formal characteristics of FG underlying the parsing heuristic are presented in SS4. The characteristics of FG necessary to understand the grammar are resumed here (see (B~s &amp; Gardent 89) for a more detailed presentation).</Paragraph>
    <Paragraph position="1"> t Ajumpmle of the form X/Y, YfZ ---~ X/Z where X/Yis atype raised NP and Y/Z is a verb.</Paragraph>
    <Paragraph position="2"> FG accounts for French linearity phenomena, embedded sentences and unbounded dependencies. It is derived from UCG and conserves most of the basic characteristics of the model : monostratality, lexicalism, unification-based formalism and binary combinatory rules restricted to adjacent signs. Furthermore, FG, as UCG, analyses NP's as type-raised categories.</Paragraph>
    <Paragraph position="3"> FG departs from UCG in that (i) linguistic entities such as verbs and nouns, sub-categorize for a set - rather than a list-of valencies ; (ii) a feature system is introduced which embodies the interaction of the different elements conditioning word order ; (iii) FG semantics, though derived directly from InL ~, leave the scope of seeping operators undefined.</Paragraph>
    <Paragraph position="4"> The FG sign presents four types of information relevant to the discussion of this paper : (a) Category, Co) Valency set ; (c) Features ; (d) Semantics. Only two combinatory rules-forward and backward concatenation - are used, together with a deletion rule. A Category can be basic or complex. A basic category is of the form Head, where Head is an atomic symbol (n(oun), np or s(entence)). Complex categories are of the form C/Sign, where C is either atomic or complex, and Sign is a sign called the active sign.</Paragraph>
    <Paragraph position="5"> With regard to the Category information, the FG typology of signs is reduced to the following.</Paragraph>
    <Paragraph position="6">  Thus, the result of the concatenation of a NP (fl) with a verb (f0) is a verbal sign (f0). Wrt the concatenation rules, f0 signs are arguments; fl signs are either functors of f0 signs, or arguments of f2 signs. Signs of type 1&amp;quot;2 are leaves and fanctors.</Paragraph>
    <Paragraph position="7"> Valencies in the Valency Set are signs which express sub-categorisation. The semantics ofa fO sign is a predicate with an argumental list. Variables shared by the semantics of each valency and by the predicate list, relate the semantics of the valency with the semantics of the predicate. Nouns and verbs sub-categorize not only for &amp;quot;normal&amp;quot; valencies such as nom(inative), dat(ive), etc, but also for a mod(ifier) valency, which is consumed and recursively reintroduced by modifiers (adjectives, laP's and adverbs). Thus, in FG the com: In/. (Indexed language) is the semantics incorporated to UCG ; it derives from Kamp's DRT. From hereafter werefer to FG semantics as InL'.</Paragraph>
    <Paragraph position="8">  plete combinatorial potential of a predicate is incorporated into its valency set and a unified treatment of nominal and verbal modifiers is proposed. The active sign of a fl functor indicates the valency - ff any which the functor consumes.</Paragraph>
    <Paragraph position="9"> No order value (or directional slash) is associated with valencies. Instead, Features express adjacent and non-adjacent constraints on constituent ordering, which are enforced by the unification-based combinatory rules. Constraints can be stated not only between the active sign of a functor and its argument, but also between a valency, of a sign., the sign. and the active J J . J sign of the fl functor consuming valency~ while concatenating with sign~ As a result, the valency of a verb or era noun imposes constraints not only on the functor which consumes it, but also on subsequent concatenations. The feature percolation system underlies the partial associativity property of the grammar (cf. SS4). As mentioned above, the Semanticspart of the sign contains an InL' formula. In FG different derivations of a string may yield sentence signs whose InL' formulae are formally different, in that the order of their sub-formulae are different, but the set of their sub-formulae are equal. Furthermore, sub-formulae are so built that formulae differing in the ordering of their sub-formulae can in principle be translated to a semantically equivalent representation in a first order predicate logic. This is because : (i) in InL', the scope of seeping operators is left undefined ; (ii) shared variables express the relation between determiner and restrictor, and between seeping operators and their semantic arguments ; (iii) the grammar places constants (i.e.</Paragraph>
    <Paragraph position="10"> proper names) in the specified place of the argumental list of the predicate. For instance, FG associates to (2) the InL' formulae in (3a) and (3b) :  (2) Un garcon pr~sente Marie ~ une fille (3) (a) \[15\] \[indCX) &amp; garcon(X) &amp; ind(Y) &amp; fiRe(Y) &amp; presenter (E,X,marie,Y)\] Co) \[E\] \[indCO &amp; fille(Y) &amp; ind(X) &amp; gar~on(X) &amp; presenter (E,X,marie,Y)\]  While a seeping operator of a sentence constituent is related to its argument by the index of a noun (as in the above (3)), the relation between the argument of a seeping operator and the verbal unit is expressed by the index of the verb. For instance, the negative version of (2) will incorporate the sub-formula neg (E).</Paragraph>
    <Paragraph position="11"> In InL' formulae, determiners (which are leaves and f2 signs, el. above), immediately precede their restrictors. In formally different InL' formulae, only the ordering of seeping operators sub-formulae can differ, but this can be shown to be irrelevant with regard to the semantics. In French, scope ambiguity is the same for members of each of the following pairs, while the ordering of their corresponding semantic sub-formulae, thanks to concatenation of adjacent signs, is ines- null capably different.</Paragraph>
    <Paragraph position="12"> (4) (a) Jacques avait donn6 un livre (a) ~ tousles dtudiants ( b ).</Paragraph>
    <Paragraph position="13"> (a) Jacques avait donn6 d tousles dtudiants(b) un livre (a).</Paragraph>
    <Paragraph position="14"> (b) Un livre a 6t~ command6 par chaque ~tudiant (a) dune librairie (b).</Paragraph>
    <Paragraph position="15"> Co') Un livre a6t6 command6d une librairie (b)par chaque dtudiant (a).</Paragraph>
    <Paragraph position="16">  At the grammatical level (i.e. leaving aside pragmatic considerations),the translation of an InL' formula to a scoped logical formula can be determined by the specific scoping operator involved (indicated in the sub-formula) and by its relation to its semantic argument (indicated by shared variables). This translation must introduce the adequate quantifiers, determine their scope and interpret the'&amp;' separator as either ^ or --&gt;, as well as introduce .1. in negative forms. For instahoe, the InL' formulae in (Y) translate ~ to : (5) 3E, 3X, 3Y (garqon(X)^ fille(Y) ^ pr6senter (E,X~narie,Y)).</Paragraph>
    <Paragraph position="17"> We assume here the possibility of this translation without saying any more on it. Since this translation procedure cannot be defined on the basis of the order of the sub-formulae corresponding to the scoping operators, InL' formulae which differ only wrt the order of their sub-formulae are said to be semantically equivalent. null</Paragraph>
  </Section>
  <Section position="6" start_page="281" end_page="284" type="metho">
    <SectionTitle>
3. THE PARSER
</SectionTitle>
    <Paragraph position="0"> Because the subcategorisation information is represented as a set rather than as a list, there is no constraint on the order in which each valency is consumed. This raises a problem with respect to parsing which is that for any triplet X,Y,Z where Y is a verb and X and Z are arguments to this verb, there will often be two possible derivations i.e., (XY)Z and xo'z).</Paragraph>
    <Paragraph position="1"> The problem of spurious parses is a well-known one in extensions of pure categorial grammar. It derives either from using other rules or combinators for derivation than just functional application (Pareschi and Steedman 1987, Wittenburg 1987, Moortgat 1987, Morrill 1988) or from having anordered set valencies (Karttunen 1986), the latter case being that of FG.</Paragraph>
    <Paragraph position="2"> Various solutions have been proposed in relation to this problem. Karttunen's solution is to check that for any potential edge, no equivalent analysis is already In (5) 3E can be paraphrased as &amp;quot;There exists an event&amp;quot;.  stored in the chart for the same string of words. However as explained above, two semantically equivalent formulae of InL' need not be syntactically identical.</Paragraph>
    <Paragraph position="3"> Reducing two formulae to a normal form to check their equivalence or alternatively reducing one to the other might require 2* permutations with n the number of predicates occaring in the formulae. Given that the test must occur each time that two edges stretch over the same region and given that itrequires exponential time, this solution was disguarded as computationaUy inefficient. null Pareschi's lazy parsing algorithm (Pareschi, 1987) has been shown (I-Iepple, 1987) to be incomplete.</Paragraph>
    <Paragraph position="4"> Wittenburg's predictive combinators avoid the parsing problem by advocating grammar compilation which is not our concern here. Morilrs proposal of defining equivalence classes on derivations cannot be transposed to FG since the equivalence class that would be of relevance to our problem i.e., ((X,Z)Y, X(ZY)) is not an equivalence class due to our analysis of modifiers.</Paragraph>
    <Paragraph position="5"> Finally, Moortgat's solution is not possible since it relies on the fact that the grammar is structurally complete ~ which FG is not.</Paragraph>
    <Paragraph position="6"> The solution we offer is to augment a shift-reduce parser with a heuristic whose essential content is that no same functor may consume twice the same valency.</Paragraph>
    <Paragraph position="7"> This ensures that for all semantically unambiguous sentences, only one parse is output. To ensure that a parse is always output whenever there is one, that is to ensure that the parser is complete, the heuristic only applies to a restricted set of edge pairs and the chart is organized as aqueue. Coupled with the parlial-associativity of FG, this strategy guarantees that the parser is complete (of. SS4).</Paragraph>
    <Section position="1" start_page="282" end_page="283" type="sub_section">
      <SectionTitle>
3.1 THE HEURISTIC
</SectionTitle>
      <Paragraph position="0"> The heuristic constrains the combination of edges in the following way 2.</Paragraph>
      <Paragraph position="1"> Let el be an edge stretching from $1 to E1 labelled with the typefl~, a predicate identifier pl and a sign Sign1, let e2 be an edge stretching from E1 to $2 labelled with type fl and a sign Sign,?, then e2 will reduce with el by consuming the valency Val of pl if e2 has not already reduced with an edge el'by consuming the valency Valofpl where el 'stretches from $1&amp;quot; to E1 and $1' ~ $1.</Paragraph>
      <Paragraph position="2"> In the rest of this section, examples illustrate how A structurally complete grammar is one such that : If a sequence of categories X I.. Xn reduces to Y, there is a red u~on to Y for any bracketing of Xl .. Ym into constituents (Moortgat, 19S7).</Paragraph>
      <Paragraph position="3"> 2 A mote complete difinition is given in the description of the parsing algorithm below.</Paragraph>
      <Paragraph position="4"> this heuristic eliminates spurious parses, while allowing for real ambiguities.</Paragraph>
      <Paragraph position="5"> Avoiding spurious parses Consider the derivation in (6)  (6) Jean aime Marie</Paragraph>
      <Paragraph position="7"> where Ed4 = Edl(Ed2,pl,subj) indicates that the edge Ed 1 reduces with Ed2 by consuming the subject valency of the edge Ed2 with predicate pl.</Paragraph>
      <Paragraph position="8"> Ed5 and EdlO are ruled out by the grammar since in French no lexical (as opposed to clirics and wh-NP) object NP may appear to the left of the verb. Ed9 is ruled out by the heuristic since Ed3 has already consumed the object valency of the predicate pl thus yielding Ed6. Note also that Edl may consume twice the subject valency ofpl thus yielding Ed4 and Ed8 since the heuristic does not apply to pairs of edges labelled with signs Of type fl and f0 respectively.</Paragraph>
      <Paragraph position="9"> Producing as many parses as there are readings The proviso that a functor edge cannot combine with two different edges by consuming twice the same valency on the same predicate ensures that PP attach- null ment ambiguities are preserved. Consider (7) for instance 1.</Paragraph>
      <Paragraph position="10"> (7) Regarde le chien darts la rue 0 --Edl --- 1 ---Ed2 - 2 - Ed3 .... 3 --- Ed4 ....... 4</Paragraph>
      <Paragraph position="12"> where pl and p2 are the predicate identifiers labelling the edges Edl and Ed3 respectively.</Paragraph>
      <Paragraph position="13"> The above heuristic allows a functor to concatenate twice by consuming two different valencies. This case t For the sake of clarity, all irelevant edges have been omitted. This practice will hold throughout the sequel.</Paragraph>
      <Paragraph position="14">  of real ambiguity is illustrated in (8).</Paragraph>
      <Paragraph position="15"> (8) Quel homme pr6sente Marie ~t Rose ?</Paragraph>
      <Paragraph position="17"> Thus, only edges of the same length correspond to two different readings. This is the reason why the heuristic allows a functor to consume twice the same valency on the same predicate iff it combines with two edges E andE' thatstretch over the same region. A case in point is illustrated in (9)  (9) Quel homme pr6sente Marie ~ Rose ?</Paragraph>
      <Paragraph position="19"> where a Rose concatenates twice by consuming twice the same - dative - valency of the same predicate.</Paragraph>
    </Section>
    <Section position="2" start_page="283" end_page="284" type="sub_section">
      <SectionTitle>
3.2 THE PARSING ALGORITHM
</SectionTitle>
      <Paragraph position="0"> The parser is a shift-reduce parser integrating a chart and augmented with the heuristic.</Paragraph>
      <Paragraph position="1"> An edge in the chart contains the following informarion : edge \[Name, Type, Heur, S,E, Sign\] where Name is the name of the edge, S and E identifies the startingand the ending vertex and Sign is the sign labelling the edge. Type and Heur contain the info'rmarion used by the heuristic. Type is either f0, fl and t2 while the content of Heur depends on the type of the edge and on whether or not the edge has already combined with some other edge(s).</Paragraph>
      <Paragraph position="2"> Heur f0 pX where X is an integer.</Paragraph>
      <Paragraph position="3"> pX identifies the predicate associated with any edge.</Paragraph>
      <Paragraph position="4"> type fO fl before combination : Vat where Var is the anonymous variable. This indicates that there is as yet no information available that could violate the heuristic.</Paragraph>
      <Paragraph position="5"> after combination : Heur-List where Heur-List is a list of triplets of the form \[Edge,pX.Val\] and Edge indicates an argument edge with which the functor edge has combined by consuming valency Val of the predicate pX labelling Edge.</Paragraph>
      <Paragraph position="7"> The basic parsing algorithm is that of a normal shift-reduce parser integrating a chart rather than a  stack i.e., 1. Starting from the beginning of the sentence, for each word W either shift or reduce, 2. Stop when there is no more word to shift and no more reduce to perfomi, 3. Accept or reject.</Paragraph>
      <Paragraph position="8">  Shifting a word W consists in adding to the chart as many lexical edges as there are lexical entries associated with W in the lexicon. Reducing an edge E consists in trying to reduce E with any adjacent edge E' already stored in the chart. The operation applies recursively in that whenever a new edge E&amp;quot; is created it is immediately added to the chart and tried for reduction. The order in which edges tried for reduction are retrieved from the chart corresponds to organising the chart as a queue i.e., f'n'st-in- ftrst-out. Step 3 consists in checking the chart for an edge stretching from the beginning to the end of the chart and labelled with a sign of category s(entence). If there is such an edge, the string is accepted - else it is rejected.</Paragraph>
      <Paragraph position="9"> The heuristic is integrated in the reduce procedure which can be defined as follows.</Paragraph>
      <Paragraph position="10"> Two edges Edge 1 and Edge2 will reduce to a new edge Edge3 iff - null Either (a) 1. Edgel = \[el,Typel,H1,E2,Signl\] and 2. Edge2 = \[e2,Type2,H2,E2,Sign2\] and &lt;Typel,Type2&gt; # &lt;f0,fl&gt; and 3. apply(Sign 1,Sign2,Sign3) and 4. Edge3 = \[e3,Type3,H3,E3,Sign3\] and &lt;$3,E3&gt; = &lt;S I,E2&gt; or (b) 1. Edgel = \[el,f0,pl,S1,E1,Signl\] and 2. Edge2 = \[e2,fl,I-I2,S2,E2,Sign2\] and E1 = $2 and 3. bapply(Signl,Sign2,Sign3) by consuming the valency Val and 4. H2 does not contain a triplet of the form \[el',pl,Val\] where Edge 1' = \[el',f0,pl,S'l,S2\] and S'I&amp;quot;-S1 5. Edge3 = \[e3,f0,pl,S1,E2,Sign3\] 6. The heuristic information H2 in Edge2 is updated to \[e 1,p 1,Val\]+I-I2  where '+ 'indicates list concatenation and under the proviso that the triplet does not already belong to H2. Where apply(Sign1 ,Sign2,Sign3) means that Sign 1 can combine with Sign2 to yield Sign3 by one of the two combinatory rules of FG and bapply indicates the backward combinatory rule.</Paragraph>
      <Paragraph position="11">  This algorithm is best illustrated by a short example. Consider for instance, the parsing of the sentence Pierre aime Marie. Stepl shifts Pierre thus adding Edgel to the chart. Because the grammar is designed to avoid spurious lexical ambiguity, only one edge is</Paragraph>
      <Paragraph position="13"> Since there is no adjacent edge with which Edgel could be reduced, the next word is shifted i.e., aime  the subject valency of pl thus yielding Edge7. However, the heuristic forbids Edge4 to consume the object valency of pl on Edge3 since Edge4 has already consumed the object valency of pl when combining with Edge2. In this way, the spurious parse Edge8 is avoided.</Paragraph>
      <Paragraph position="14"> The final chart is as follows.</Paragraph>
    </Section>
  </Section>
  <Section position="7" start_page="284" end_page="285" type="metho">
    <SectionTitle>
4. UNICITY AND COMPLETNESS
OF THE PARSING
DEFINITIONS
</SectionTitle>
    <Paragraph position="0"> 1. An indexed lexical f0 is a pair &lt;X,i&gt; where X is a lexical sign of f0 type (c.f. 2) and i is an integer.</Paragraph>
    <Paragraph position="1"> 2. PARSE denotes the free algebra recursively defined by the following conditions.</Paragraph>
    <Paragraph position="2"> 2.1 Every lexical sign of type fl or f2, and every indexed lexical f0 is a member of PARSE.</Paragraph>
    <Paragraph position="3"> 2.2 If P and Q are elements of PARSE, i is an integer, and k is a name of a valency then (P+aQ) is a member of PARSE.</Paragraph>
    <Paragraph position="4"> 2.3 If P and Q are elements of PARSE, (P+imQ) is a member of PARSE, where I~ is a new symbol} 3. For each member, P, of PARSE, the string of the leaves of P is defined recursively as usual : 3.1 If P is a lexical functor or a lexical indexed argument, L(P) is the string reduced to P.</Paragraph>
    <Paragraph position="5"> 3.2 L(P+~tQ) is the string obtained by concatenation of L(P) and L(Q).</Paragraph>
    <Paragraph position="6"> 4. A member P of PARSE, is called a well indexed parse (WP) if two indexed leaves which have different ranges in L(P), have different indicies.</Paragraph>
    <Paragraph position="7"> 5. The partial function, SO:'), from the set of WP to the set of signs, is defined recursively by the following conditions : 5.1 IfP is a leave S(P) = P 5.2 S(F+ikA) = Z \[resp. S(A+ikF) = Z\] (km ) If S (F) is a functor of type fl, S(A) is an argument and Z is the result sign by the FC rule \[resp. BC rule\] when S(F) consumes the valency named k in the leave of S(A) indexed by i.</Paragraph>
    <Paragraph position="8"> 5.3 S(P+ilnA ) = Z \[res. S(A+i~-&amp;quot; ) = Z\] if S(F) is a functor of type fl or f2, S(A) is an argument sign and Z is the result sign by the FC rule \[resp. BC rule\]. 6. For each pair of signs X and Y we denote X.=. Y if X  and Y are such that their non semantic parts are formally equal and their semantic part is semantically equivalent. null I In 2.3 the index i is just introduced for notational convenience and will not be used ; k,l.., will denote a valency name or the symbol m.  7. IfP and Q are WP P =Qiff 7.1 S(P) and S(Q) are defined 7.2 S(P) = S(Q) and 7.3 L(P) = L(Q) 8. A WP is called acceptedif it is accepted by the parser augmented with the heuristic described in SS3. THEOREM 1. (Unicity) IfP and Q are accepted WP's and ifP = Q, then P and Q are formally equal.</Paragraph>
    <Paragraph position="9"> 2. (Completeness) IfP is a WP which is accepted by the  grammar, and S(P) is a sign corresponding to a grammatical sentence, then there exists a WP Q such that : a) Q is accepted, and b)P =Q.</Paragraph>
  </Section>
  <Section position="8" start_page="285" end_page="285" type="metho">
    <SectionTitle>
NOTATIONAL CONVENTION
</SectionTitle>
    <Paragraph position="0"> F, F'...(resp. A,A',...) will denote WP's such that S(F), S(F')...are functors of type fl (resp. S(A), S(A') .... are arguments of type f0).</Paragraph>
    <Paragraph position="1"> The proof of the theorem is based on the following properties 1 to 3 of the grammar. Property 1 follows directly from the grammar itself (cf. SS2) ; the other two are strong conjectures which we expect to prove in a near future.</Paragraph>
    <Paragraph position="2"> PROPERTY 1 If S(K) is defined and L(K) is not a lexical leaf, then :</Paragraph>
    <Paragraph position="4"> For every i and k if F+i~A = F+ixA', or A+i~F -- A'+i~t.F then i= i', k = k', A--A' and F = F' PROPERTY 3 (Partial associativity) : For every F,A,F' such that L(F) L(A) L(F') is a sub-string of a string oflexical entries which is accepted by the grammar as a grammatical sentence, a) If S\[F+i~(A+aF)\] and S\[(F+ikA)+u F'\] are defined,</Paragraph>
  </Section>
  <Section position="9" start_page="285" end_page="285" type="metho">
    <SectionTitle>
PROOF OF THE PART 1 OF THE THEOREM
</SectionTitle>
    <Paragraph position="0"> Tile proof is by induction on the lengh, lg(P), of L(P). So we suppose a) and b) : a) (induction hypothesis). For every P' and Q' such that P' and Q' are accepted, if P' =_ Q', and lg(P') &lt; n, then P' =Q' b) P and Q are accepted, P = Q and lg(P) = n and we have to prove that</Paragraph>
    <Paragraph position="2"> By the Lemma 1 P't and Q't must be both functors or both arguments. And ifP'~ and Q'~ are functors (res.</Paragraph>
    <Paragraph position="3"> arguments) then P'2 and Q'2 are arguments (resp. functors). So by Property 2, we have : i = i', k = k', P'l -- Q't, and P'2 =- Q' 2 .</Paragraph>
    <Paragraph position="4"> Then the induction hypothesis implies that P't = Q't and that P'2 = Q'2&amp;quot; Thus we have proved that P = Q.</Paragraph>
  </Section>
  <Section position="10" start_page="285" end_page="286" type="metho">
    <SectionTitle>
PROOF OF THE PART 2 OF THE THEOREM
</SectionTitle>
    <Paragraph position="0"> Let P be a WP such that S(P) is define and cortes- null ponds to a grammatical sentence. We will prove, by induction on the lengh of L(K), that for all the subtrees K of P, there exists K' such that : a) K' is accepted, and b) K_=_K'.</Paragraph>
    <Paragraph position="1"> We consider the following cases (Property 1)  1. IfKis a leaf then K' = K 2. If K = F+tkA, then by the induction hypothesis there exist F' and A' such that : (i) F' and A' are accepted, and (ii) F_=_ F', A = A'.</Paragraph>
    <Paragraph position="2"> Then F'+A' is also accepted. So that K' can be choosed as F'+A'.</Paragraph>
    <Paragraph position="3"> 3. If K = A+ikF, we define F, A' as in (2) and we consider the following subcases : 3.1 If A' is a leaf or if A' = FI+jlA1 where S(AI+~ F') is not def'med, then A'+~F is accepted, and we can take it as K.</Paragraph>
    <Paragraph position="4"> 3.2 If A' = Al+ilF1, then by the Lemma 2 A'+~kF' is accepted. Thus we can define K' as A'+u F'. 3.3 IfA' = FI+nA1 and S(AI+~ F) is defined.  Let A2 = Al+ikF.</Paragraph>
    <Paragraph position="5"> By the Property 3 S(FI+jlA2) is defined and K = A'+tkF = FI+jlA2.</Paragraph>
    <Paragraph position="6"> Thus this case reduces to case 2.</Paragraph>
    <Paragraph position="7"> 4. If K = Fu+~Ar, where Fu is of type f2 and Ar is of type f0 or fl, then by induction hypothesis there exists At' such that Ar ~_ Ar' and At' is accepted. Then K can be defined as Fu+i(r)Ar'.</Paragraph>
  </Section>
  <Section position="11" start_page="286" end_page="286" type="metho">
    <SectionTitle>
5. IMPLEMENTATION AND COVE-
RAGE
</SectionTitle>
    <Paragraph position="0"> FG is implemented in PIMPLE, a PROLOG term unification implementation of PATR II (cf. Calder 1987) developed at Edinburgh University (Centre for Cognitive Studies). Modifications to the parsing algorithm have been introduced at the &amp;quot;Universit6 Blaise Pascal&amp;quot;, Clermont-Ferrand. The system runs on a SUN M 3/50 and is being extensively tested. It covers at present : declarative, interrogative and negative sentences in all moods, with simple and complex verb forms. This includes yes/no questions, constituent questions, negative sentences, linearity phenomena introduced by interrogative inversions, semi free constituent order, clitics (including reflexives), agreement phenomena (including gender and number agreement between obj NP to the left of the verb and participles), passives, embedded sentences and unbounded dependencies. null</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML