File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/89/e89-1034_metho.xml

Size: 17,731 bytes

Last Modified: 2025-10-06 14:12:19

<?xml version="1.0" standalone="yes"?>
<Paper uid="E89-1034">
  <Title>FRENCH ORDER WITHOUT ORDER*</Title>
  <Section position="4" start_page="0" end_page="0" type="metho">
    <SectionTitle>
1. UCG
</SectionTitle>
    <Paragraph position="0"> In UCG, the phonological, categorial, semantic and order information associated with a word is contained in a single grammar structure called a sign. This can be represented as follows.</Paragraph>
    <Paragraph position="1">  where colons separate the different fields of the sign. We need not concern ourselves here with the Semantics and the Phonology fields of the sign. More interesting for our purpose are the Category and the Order attributes. Following the categorial tradition, categories can be basic or complex. Basic categories are of the form HeadAFeatures where Head is one of the atomic symbols n(oun), np or s(entence) and Features is a list of feature values. Complex categories are of the form C/Sign, where C is either atomic or complex and Sign is a sign, so that departing from traditionalCG's,a functorplaces constraints on the whole sign of the argument rather than on its syntactic category only. The part of a complex category which omits the Head'Weatures information constitutes the activepart of the sign. The f'trst accessible sign in the active part is called the active sign, the idea being that e.g. verb valencies are ordered in a list so that each time a valency is consumed, the next sign in the active part becomes the new active sign. The Order attribute places constraints on the combination rule that may apply to a functor: pre on an argument sign Y indicates that the functor X/Y must precede the argument, while post indicates that the functor must follow the argument.</Paragraph>
    <Paragraph position="2"> - 249-Using terms and term unification, the forward version 1 of functional application can then be stated as follows.  were upper letters indicate Prolog variables. In effect, the rule requires that the active part of the functor sign term unifies with the argument sign. The Result is a sign identical to the functor sign, but where the complex category is stripped from its active part and where variables shared by the active part of the functor and the rest of the functor sign may have become ground as a result of the active part unifying with the argument.</Paragraph>
    <Paragraph position="3"> The resulting phonology consists of the phonology of the functor followed by the phonology of the argument.</Paragraph>
    <Paragraph position="4"> An illustrative combination is given in (3) below for the sentence Jean marche.</Paragraph>
    <Paragraph position="5">  where lines represent the information flow determining ordering : shared variables ensure that pre in a verb valency constrains the fanctor NP that consumes this valency to precede the verb carrying this valency.</Paragraph>
  </Section>
  <Section position="5" start_page="0" end_page="0" type="metho">
    <SectionTitle>
2. LINGUISTIC OBSERVATIONS
</SectionTitle>
    <Paragraph position="0"> Word order in French is characterised by three main facts. First, the positioning - left or right- of a particular argument with respect to the verb is relatively free. As t. Baekward application is just the symmetric of (2) where the argument precedes the funetor endpre becomes post.</Paragraph>
    <Paragraph position="1"> illustrated in (4), the subject can appear to the left (4a) or to the right (4b,c) of the verb, or between the auxiliary and the verb (4d), depending on the morphological class of the NP and on the type of the sentence (declarative, subject-auxiliary inversion, wh-question, etc).</Paragraph>
    <Paragraph position="2">  (4) (a) Jacques aime Marie.</Paragraph>
    <Paragraph position="3"> CO) Alme-t-il Marie ? (c) Quel livre aime Jacques ? (d) A-t-il aim6 Marie ? All other arguments can also appear to the left or to the right of the verb under similar conditions. For example, a lexical non-nominative NP can never be to the left of the verb, but critics and wh-constituents can. (5) (a) *Marie aregard~e Jacques ? (with Marie = Obj) Co) QueUe revue a regard~e Jacques ? (with Quelle revue = Obj) (c) Jacques l'a regardte  Second, there seems to be no clear regularities governing the relative ordering of a sequence of arguments. That is, assuming that only adjacent constituents may combine and taking the combinations leftto-right, the combination pattern varies as indicated below of each example in (6). Here again, the permissible distributions are influenced by factors such as the morphological class of the constituents and the verb mood.</Paragraph>
    <Paragraph position="4">  (6) (a) Pierre donne h Marie un livre.</Paragraph>
    <Paragraph position="5">  Third, coocurrence restrictions hold between constituents. For example, clitics constrain the positioning and the class of other arguments as illustrated in (7) 2  (7) (a) Pierre le lui donne.</Paragraph>
    <Paragraph position="6"> Co) Pierre lui en donne.</Paragraph>
    <Paragraph position="7"> (c) Pierre lui donne un livre.</Paragraph>
    <Paragraph position="8"> (d) *Pierre lui le donne.</Paragraph>
    <Paragraph position="9"> (e) *Pierre lui y donne.</Paragraph>
    <Paragraph position="10">  Since the ordering and the positioning of verb arguments in French are very flexible, the rigid orde-In italics : the word whose coocurrence restriction is violated (starred sentences) or obeyed (non-starred sentences). For instance, (7d) is starred because lu/may not be followed by le.</Paragraph>
    <Paragraph position="11"> - 250ring forced by the UCG active list and the fixed positioning resulting from the Order attribute are rather inadequate. On the other hand, word order in French is not free either. Rather it seems to be governed  by conditional ordering statements such as: (8) IF (a) the verb has an object valency, and Co) the object NP is a wh-constituent, and (c) the verbal constituent is the simple inflected verb, and (d) the clitic t-il/elle has not been incorporated THEN the object can be placed to the left or to the  right of the verb.</Paragraph>
    <Paragraph position="12"> If say, (8d) is not fulfilled, the wh-NP can be placed only to the left, witness: *Jacques a-t-il regard~ quelle revue ?, and mutatis mutandis for the other conditions. More generally, five elements can be isolated whose interaction determine whether or not a given argument  can occupy a given position in the sentence. (9) (a) Position - left or right - with regard to the verb, Co) Verbal form and sentence type, (c) Morphological class (lexical, wh-constituent or clitic) of the previous constituent having concatenated to the left or to the right of the verb, (d) Morphological class of the current constituent (lexical, wh-constituent or clitic), (e) Case.</Paragraph>
    <Paragraph position="13">  We claim that it is possible to extend UCG in order to express the above conditioning variables. The resulling grammar can account for the preceding linguistic facts without resorting either to lexical ambiguity or to jump rules 3.</Paragraph>
  </Section>
  <Section position="6" start_page="0" end_page="0" type="metho">
    <SectionTitle>
3. EXTENSIONS TO UCG
</SectionTitle>
    <Paragraph position="0"> To account for the facts presented in section 2, UCG has been modified in two ways. Firstly, the active part ofaverb category is represented as a set rather than a list. Secondly, a feature system is introduced which embodies the interactions of the different elements conditioning word order as described in (9) above.</Paragraph>
  </Section>
  <Section position="7" start_page="0" end_page="0" type="metho">
    <SectionTitle>
3.1 SIGN STRUCTURE AND COMBINATION
RULE : FROM AN ACTIVE LIST TO AN
ACTIVE SET.
</SectionTitle>
    <Paragraph position="0"> To accomodate our analysis, the sign structure and the combination rule had to be slightly modified. In the  in that (i) there are no Features associated with the Head and (ii) the active part of a verb is viewed as a set rather than as a list.</Paragraph>
    <Paragraph position="1"> The Features attribute is a list of features. In this paper, only those relevant to order constraints are mentioned. They are: case, verb mood, morphological class of NP's (i.e., lexical, clitic or wh-constituent) and last concatenation to the left (Lastlefl) or to the right (Lastright). The latter features indicate the morphological status of the last concatened functorand are updated by the combination rule (cf. (13)). For instance, the sign associated with Jean lui a donnd un livre will have lex as values for Lastlefl and Lastright whereas lui a donn~ un livre has lui and lex respectively. The Features attribute can be represented as in (11 ) below, where the same feature may occupy a different position in the feature list of different linguistic units, e.g., feature list of verb valencies and feature list of NP signs.</Paragraph>
    <Paragraph position="2">  As illustrated in (12), the Order attribute has two parts, one for when the functor combines forward, the other for when it combines backward.</Paragraph>
    <Paragraph position="3">  where Cdts and Resfeat are lists of feature values whose order and content are independent from those of the Features attribute. The intuition behind this is that functors (i.e., type-raised NP's) are two-sided i.e., they can combine to the left and to the right but under different conditions and with different results. The features in Cdts place constraints on the features of the argument while the features in Resfeat are inherited by the resulting sign. These effects are obtained by unifi- null 4. In the rest of this paper, the Semantics and the Optionality attributes will be omitted since they have no role to play in our treatment of word order while Phonology will only be represented when relevant.</Paragraph>
    <Paragraph position="4"> - 251 cation of shared variables in the rules of combination. Omitting Semantics and Optionality attributes, the forward combination rule is as follows.</Paragraph>
    <Paragraph position="5"> (13) Forward Combination s (FC)  The rule requires (i) that the functor category combines with the argument category \[\] to yield the result category Category'. The notion of combination relies on the idea that the active part of a verb is a set rather than a list. More precisely, given a type-raised NP NP1 with category C/(C/NP.) where NP i is a valency sign, and a verb V1 with category slActSet where ActSet is a set of valency signs, NP1 combines with V1 to yield V2 iff NPi unifies with some NP-valency sign in the active set ActSet of the verb. V2 is identical to V1 except that the unifying NP i valency sign has been removed from the active set and that some features in V1 will have been instantiated by the rule. Forward combination further requires (ii) that the two features in the condition list to pre unify with the Lastlefl and Vmood features of the argument (the features conditioning post are ignored since they are relevant only when the functor combines backwards), and (iii) that the features of the resulting sign be as specified. Note in particular that the MClass of the resulting sign is the MClass of the argument, that Lastright which indicates the morphological class ofthelast sign to have combined with the verb from theright, is transmitted from the argument, and thatLastlefl is assigned as value the MClass of the functor. Features of the resuiting sign which are conditional on the combination order ate inherited from the Resfeat field. This percox In this figure, numbers inside square denote the following attribute. For instance, \[\] denotes CategoryA'.</Paragraph>
    <Paragraph position="6"> lation of features by rule is crucial to our treatment of word order. It is illustrated by (14) below where the sign S 1 associated with the clitic le combines with the sign for $2 regarde to yield a new sign $3 le regarde.  :\[vb .... \] :(tiui or i, ind\] =&gt; pre =&gt; \[ind\], \[i, imp\] =&gt;post =&gt; \[imp\]) :tie .... \]</Paragraph>
    <Paragraph position="8"> When le is used as a forward functor, the conditions on pre require that the argument i.e., the verb bears for the feature Lastlefl the value lui or i where i stands for initial state thus requiring that the verb has not combined with anything on the left. When it combines by BC, the conditions onpost ensure that the argument has not combined with anything on its right and that it has mood imperative. In this way, all sentences in (15) are parsed appropriately.</Paragraph>
    <Paragraph position="9">  (15) (a) I1 le hi donne.</Paragraph>
    <Paragraph position="10"> Co) *II lui le donne.</Paragraph>
    <Paragraph position="11"> (c) Donne le lui.</Paragraph>
    <Paragraph position="12"> (d) *Donne lui le.</Paragraph>
    <Paragraph position="13">  The backward combination rule (BC) functions like FC except for two things. First, the argument must be to the left of the functor and second, the condition field considered is that ofpost rather than ofpre. There is also a deletion rule to eliminate optional valencies. No additional rule is needed.</Paragraph>
  </Section>
  <Section position="8" start_page="0" end_page="0" type="metho">
    <SectionTitle>
3.2 EXPRESSING THE VARIABLES UNDER-
LYING WORD ORDER CONSTRAINTS
</SectionTitle>
    <Paragraph position="0"> In our grammar, there are nopost andpre primitive values associated with specific verb valencies. Instead, features interact with combination rules to enforce the - 252 constraints on word order described in (9). (9a) is captured in the two-sided order field. (9b - verb mood) and (9c- morphological class of preceding concatenating functor) are accounted for in that in a functor, the features conditioning order include the verb mood and the last concatenation attribute.</Paragraph>
    <Paragraph position="1"> (9d) is accounted for in that conditions which are invariant for a particular class of constituent (clitic, wh-constituent, lexical NP) are expressed in the Order field of these constituents. For example, wh-constituents reject through their conditions topre a wh-value for the Lastlefl feature of the argument and an inv16 value in their condition to post. As a result, the following sentences are parsed appropriately.</Paragraph>
    <Paragraph position="2">  (16) (a) *A qui qui a ttltphon6 ? Co) *A-t-il t~ltphon6 a qui ? (c) A qui a-t-il ttltphon6 ? (d) I1 a ttltphone a qui ?  Conditions which vary depending on the class of the concatenating constituent are expressed in the Features attribute of the verb valencies. This allows us to express constraints on the position of a given type of NP (lex, wh or clitic) relative to the valency it consumes. For instance,a lexical NPcan be subject or object. If it is subject and it is to the left of the verb, it cannot be immediately followed by a wh-constituent. If it is subject and it is placed to the right of the verb, it must be immediately adjacent to it. These constraints can be stated using unification along to the following lines. A verb valency is of the form (17) (np:\[... X,Y ...\]:Ord) where X and Y are either the anonymous variable or a constant. They state constraints, among others, on possible values ofLastlefl andLastright features of the verb. Recall that a valency is a sign which is a member of a set in the Category attribute of a verbal sign. The active sign of a type raised NP is of the form: (18) C/(np:\[... V1, V2...\]:__) :\[vb I_\] :(\[V1 ...\] =&gt; pre =&gt; Z, \[V2...\] =&gt; post =&gt; W) By rule, V1 and V2 in the Category attribute of (18) must unify with X and Y, respectively, in the verb valency (17). Being shared variables, they transmit the information to the Conditions on concatenation by FC (pre) and BC (post), respectively.</Paragraph>
    <Paragraph position="3"> Furthermore, V1 and V2 in the Ord attribute of the functor must unify, by rule, with some specified features in the verb Features attribute represented in (19). The value/nvl for the Lastlefl feature of a verb resulm from a backward combination of the nominative elitic -t-il with this verb.</Paragraph>
    <Paragraph position="4"> (19) \[vb, Lasleft, Lastright .... \] The flow of information between (17), (18) and (19) is represented graphically in (20), where (20a), (20b) and (20c) correspond to (17), (18) and (19) respectively. (20a) and (20c), which express the Category and theFeatures attibutes of the same verbal sign, have been dissociated for the sake of clarity.</Paragraph>
    <Paragraph position="5">  The concatenation by FC is allowed (-wh is compatible with 0, the requirement extracted from the verbal valency being that the Lastleft concatenated contituent with the verb is not a wh-constituent. But a concatenation by BC will fail(i does not unify with lex). Thus examples in (22) are correctly recognised (see Appendix).</Paragraph>
    <Paragraph position="6">  (22) (a) Jean ttltphone ~t la Idle.</Paragraph>
    <Paragraph position="7"> Co) *Ttltphone ~ la fille Jean.</Paragraph>
    <Paragraph position="8"> (c) *Jean ~ quelle fille ttltphone ? (d) A queUe fille ttltphone Jean ?</Paragraph>
  </Section>
  <Section position="9" start_page="0" end_page="0" type="metho">
    <SectionTitle>
4. IMPLEMENTATION
</SectionTitle>
    <Paragraph position="0"> The UCG formalism and the corresponding computational environment were developped at the Centre for Cognitive Science, University of Edinburgh by (Calderetal. 1986). They include facilities for defining templates and path-equations as in PATR-2 and a shift-reduce parser. The extensions to the original framework have been implemented at the Universit6 Blaise - 253 -Pascal, Formation Doctorale Linguistique et Informatique, Clermont-Ferrand (France). The system runs on a Sun and has been extensively tested.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML