XML Viewer - w06-1502

File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/06/w06-1502_metho.xml
Size: 26,263 bytes
Last Modified: 2025-10-06 14:10:39
<?xml version="1.0" standalone="yes"?>
<Paper uid="W06-1502">
  <Title/>
  <Section position="4" start_page="0" end_page="10" type="metho">
    <SectionTitle>
2 eXtensible MetaGrammar (XMG)
</SectionTitle>
    <Paragraph position="0"> By opposition to other metagrammatical frameworks, XMG (Duchier et al., 2004) uses an expressive though simple language, enabling a monotonic description of a real scale grammar. Monotonicity is important because it means that the order of application of the different operations does not matter. This is the major drawback of lexical-rule systems. Moreover, (Crabb'e, 2005b) shows that it is sufficiently expressive to implement conveniently a core TAG for French.</Paragraph>
    <Paragraph position="1"> XMG allows the grammar writer to manipulate tree descriptions through a control language. The intuition behind is that a metagrammatical language needs to provide means to describe syntactic information along two methodological axis (Crabb'e, 2005b): structure sharing and alternatives. Structure sharing is the axis dedicated to express factorisation in the grammar, whereas alternatives allow to express regular alternation relationships such as alternatives between the representation of a canonical nominal subject and its interrogative representation, or between an active  and a passive verb form1.</Paragraph>
    <Paragraph position="2"> Building on this intuition the XMG language allows the user to name partial tree descriptions within classes. The name of the class can be manipulated afterwards. For instance the following tree descriptions on the right of the arrow are associated with the names stated on the left of the  Naming is the main device that allows the grammar writer to express and to take advantage of the structure sharing axis mentioned above. Indeed class names can be reused in other descriptions.</Paragraph>
    <Paragraph position="3"> Thus names can also be used to describe alternatives. To express, in our simplified example, that a Subject is an abstract way to name a Relativised-Subject or a CanonicalSubject, we use a choice operator ([?]) as illustrated below:  (2) Subject - CanonicalSubject [?] RelativisedSubject  Disjunction (non-deterministic choice) is the device provided by the language to express the methodological axis of alternatives.</Paragraph>
    <Paragraph position="4"> Finally, names can be given to class combinations. To express the composition of two tree descriptions in the language, we use the [?] operator. 1The passive is a semi-regular alternation, many transitive verbs do not passivise. Our system presupposes a classical architecture for the computational representation of Tree Adjoining Grammars such as XTAG, where means to express such exceptions during the anchoring process are wellknown. In what follows, we therefore consider only tree templates (or tree schematas) as our working units. Finally the trees depicted in this paper take their inspiration from the grammar described by (Abeill'e, 2002).</Paragraph>
    <Paragraph position="5"> 2To represent the tree descriptions mentioned in this paper, we use a graphical notation. Immediate dominance is depicted with a straight line and precedence follows the graphical order. Note that nodes are decorated with their labels only, ignoring the names of the variables denoting them. Note also that we use only the reflexive transitive closure of precedence between sibling nodes and it is explicitly stated with the symbol [?][?].</Paragraph>
    <Paragraph position="6"> Thus we can say that an IntransitiveVerb is made by the composition of a Subject and a VerbalForm  as follows: (3) IntransitiveVerb - Subject [?] VerbalForm  Given these 3 primitives, the control language is naturally interpreted as a context free grammar whose terminals are tree descriptions and where our composition plays the role of concatenation.</Paragraph>
    <Paragraph position="7"> This abstract grammar or metagrammar is further restricted to be non recursive in order to ensure that the generated TAG is finite.</Paragraph>
    <Paragraph position="8"> Provided the axiom IntransitiveVerb, an interpreter for this language generates non deterministically all the sentences of the grammar3 underlying a grammatical description. Thus in our current example the two sentences generated are those depicted on the left hand side of the arrows in Figure 1. On the right hand side of the arrow is depicted the result of the composition of the tree descriptions. null It remains to make clear what is actually this composition. The grammatical classes may contain information on tree descriptions and/or express composition of descriptions stated in other classes. Tree descriptions take their inspiration from the logic described in (Rogers and Vijay-Shanker, 1994). Its syntax is the following:</Paragraph>
    <Paragraph position="10"> where x,y are node variables, - the dominance relation, [?] the precedence relation, [?] denoting the reflexive transitive closure of a relation. The last line associates x with a feature f whose value is the result of evaluating expression E.</Paragraph>
    <Paragraph position="11"> Tree descriptions are interpreted as finite linear ordered trees being the minimal models of the description. null Using tree descriptions, the above mentioned operation of tree &amp;quot;composition&amp;quot; breaks down to a conjunction of formulas where variables of each conjunct are in first approximation renamed to avoid name collisions. Renaming is a crucial difference with previous approaches to metagrammar (Candito, 1999; Xia, 2001) where the user had to manage explicitly a &amp;quot;global namespace&amp;quot;. Here a specific attention is given to namespace management, because this was a bottleneck for real scale  grammar design. More precisely each class has its own namespace of identifiers and namespace merging can be triggered when a class combination occurs. This merging relies on a fine-grained import/export mechanism.</Paragraph>
    <Paragraph position="12"> In addition to conjunction and disjunction, XMG is augmented with syntactic sugar to offer some of the features other metagrammatical formalisms propose. For instance, inheritance of classes is not built-in in the core language but is realised through conjunction and namespace import. Of course, this restricts users to monotonic inheritance (specialisation) but it seems to be sufficient for most linguists.</Paragraph>
  </Section>
  <Section position="5" start_page="10" end_page="12" type="metho">
    <SectionTitle>
3 Constraining admissible structures
</SectionTitle>
    <Paragraph position="0"> XMG has been tested against the development of a large scale French Grammar (Crabb'e, 2005a). To ease practical grammatical development we have added several augmentations to the common tree description language presented so far in order to further restrict the class of admissible structures generated by the metagrammar.</Paragraph>
    <Paragraph position="1"> Further constraining the structures generated by a grammar is a common practice in computational linguistics. For instance a Lexical Functional Grammar (Bresnan and Kaplan, 1982) further restricts the structures generated by the grammar by means of a functional uniqueness and a functional completeness principles. These constraints further restricts the class of admissible structures generated by an LFG grammar to verify valency conditions. null For TAG and in a theoretical context, (Frank, 2002) states a set of such well formedness principles that contribute to formulate a TAG theory within a minimalist framework. In what remains we describe operational constraints of this kind that further restrict the admissibility of the structure generated by the metagrammar. By contrast with the principles stated by (Frank, 2002), we do not make any theoretical claim, instead we are stating operational constraints that have been found useful in practical grammar development.</Paragraph>
    <Paragraph position="2"> However as already noted by (Frank, 2002) and by opposition to an LFG framework where constraints apply to the syntactic structure of a sentence as a whole, we formulate here constraints on the well-formedness of TAG elementary trees. In other words these constraints apply to units that define themselves their own global domain of locality. In this case, it means that we can safely ignore locality issues while formulating our constraints. This is theoretically weaker than formulating constraints on the whole sentential structure but this framework allows us to generate common TAG units, preserving the formal and computational properties of TAG.</Paragraph>
    <Paragraph position="3"> We formulate this constraint driven framework by specifying conditions on model admissibility.</Paragraph>
    <Paragraph position="4"> Methodologically the constraints used in the development of the French TAG can be classified in four categories: formal constraints, operational constraints, language dependent constraints and theoretical principles.</Paragraph>
    <Paragraph position="5"> First the formal constraints are those constraining the trees generated by the model builder to be regular TAG trees. These constraints require the trees to be linear ordered trees with appropriate decorations : each node has a category label, leaf nodes are either terminal, foot or substitution, there is at most one foot node, the category of the foot note is identical to that of the root node, each tree has at least one leaf node which is an anchor.</Paragraph>
    <Paragraph position="6">  It is worth noting here that using a different set of formal constraints may change the target formalism. Indeed XMG provides a different set of formal constraints (not detailed here) that allow to generate elementary units for another formalism, namely Interaction Grammars.</Paragraph>
    <Paragraph position="7"> The second kind of constraint is a single operational constraint dubbed the colouration constraint. We found it convenient in the course of grammar development. It consists of associating colour-based polarities to the nodes to ensure a proper combination of the fragmentary tree descriptions stated within classes. Since in our framework descriptions stated in two different classes are renamed before being conjoined, given a formula being the conjunction of the two following tree descriptions :</Paragraph>
    <Paragraph position="9"> both the following trees are valid models of that formula:  (5) (a)</Paragraph>
    <Paragraph position="11"> In the context of grammar development, however,  only (a) is regarded as a desired model. To rule out (b) (Candito, 1999; Xia, 2001) use a naming con- null vention that can be viewed as follows4: they assign a name to every node of the tree description. Both further constrain model admissibility by enforcing the identity of the interpretation of two variables associated to the same name. Thus the description stated in their systems can be exemplified as follows: null  sign choice creates two additional complications: (1) it constrains the grammar writer to manually manage a global naming, entailing obvious prob null lems as the size of the grammatical description grows and (2) it prevents the user to reuse several times the same class in a composition. This case is a real issue in the context of grammatical development since a grammar writer willing to describe a ditransitive context with two prepositional phrases cannot reuse two times a fragment 4They actually use a different formal representation that does not affect the present discussion.</Paragraph>
    <Paragraph position="12"> describing such a PP since the naming constraint will identify them.</Paragraph>
    <Paragraph position="13"> To solve these problems we use a colouration constraint. This constraint associates unary properties, colours, to every node of the descriptions. A colour is taken among the set red(*R), black(*B ), white (*W). A valid model is a model in which every node is coloured either in red or black. Two variables in the description interpreted by the same node have their colours merged following the table given in Figure 2.</Paragraph>
    <Paragraph position="14">  The table indicates the resulting colour after a merge. The [?] symbol indicates that this two colours cannot be merged and hence two nodes labelled with these colours cannot be merged. Note that the table is designed to ensure that merging is not a procedural operation.</Paragraph>
    <Paragraph position="15"> The idea behind colouration is that of saturating the tree description. The colour white represents the non saturation or the need of a node to be combined with a resource, represented by the colour black. Black nodes need not necessarily be combined with other nodes. Red is the colour used to label nodes that cannot be merged with any other node. A sample tree description with coloured node is as follows:</Paragraph>
    <Paragraph position="17"> Colours contribute to rule out the (b) case and remove the grammar writer the burden of managing manually a &amp;quot;global namespace&amp;quot;.</Paragraph>
    <Paragraph position="18"> The third category of constraints are language dependent constraints. In the case of French, such constraints are clitic ordering, islands constraints, etc. We illustrate these constraints with clitic ordering in French. In French clitics are non tonic particles with two specific properties already identified by (Perlmutter, 1970): first they appear in front of the verb in a fixed order according to their rank (8a-8b) and second two different clitics in front of the verb cannot have the same rank (8c).</Paragraph>
    <Paragraph position="19"> For instance the clitics le, la have the rank 3 and lui the rank 4.</Paragraph>
    <Paragraph position="20">  In the French grammar of (Crabb'e, 2005a) trees with clitics are generated with the fragments illustrated on the left of the arrow in Figure 35. As illustrated on the right of the arrow, the composition may generate ill-formed trees. To rule them out we formulate a clitic ordering constraint. Each variable labelled with a clitic category is also labelled with a property, an integer representing its rank. The constraint stipulates that sibling nodes labelled with a rank have to be linearly ordered according to the order defined over integers.</Paragraph>
    <Paragraph position="21"> Overall language dependent constraints handle cases where the information independently specified in different fragments may interact. These interactions are a counterpart in a metagrammar to the interactions between independently described lexical rules in a lexical rule based system. Assuming independent lexical rules moving canonical arguments (NP or PP) to their clitic position, lexical rules fall short for capturing the relative ordering among clitics6.</Paragraph>
    <Paragraph position="22"> A fourth category of constraints, not implemented in our system so far are obviously the language independent principles defining the theory underlying the grammar. Such constraints could involve for instance a Principle of Predicate Argument Coocurrency (PPAC) or even the set of minimalist principles described by (Frank, 2002).</Paragraph>
  </Section>
  <Section position="6" start_page="12" end_page="14" type="metho">
    <SectionTitle>
4 Efficient implementation
</SectionTitle>
    <Paragraph position="0"> We describe now the implementation of our meta-grammatical framework. In particular, we will fo- null in a generative grammar framework where clitics where assumed to be moved by transformations.</Paragraph>
    <Paragraph position="1"> cus on the implementation of the constraints discussed above within XMG.</Paragraph>
    <Paragraph position="2"> As mentioned above, a metagrammar corresponds to a reduced description of the grammar. In our case, this description consists of tree fragments combined either conjunctively or disjunctively. These combinations are expressed using a language close to the Definite Clause Grammar formalism (Pereira and Warren, 1980), except that partial tree descriptions are used as terminal symbols. In this context, a metagrammar can be reduced to a logic program whose execution will lead to the computation of the trees of the grammar. null To perform this execution, a compiler for our metagrammatical language has been implemented.</Paragraph>
    <Paragraph position="3"> This compilation is a 3-step process as shown in  First, the metagrammar is compiled into instructions for a specific virtual machine inspired by the Warren's Abstract Machine (Ait-Kaci, 1991). These instructions correspond to the unfolding of the relations7 contained in the tree descriptions of the metagrammar.</Paragraph>
    <Paragraph position="4"> Then, the virtual machine performs unifications of structures meant to refer to corresponding information within fragments (e.g. two nodes, two feature structures ...). Note that the XMG's virtual machine uses the structure sharing technique for memory management, i.e. data are represented by a pair pattern - environment in which to interpret it. The consequences are that (a) we save memory when compiling the metagrammar, and (b) we have to perform pointer dereferencing during unification. Even if the latter is time-consuming, it remains more efficient than structure copying as we have to possibly deal with a certain amount of tree descriptions.</Paragraph>
    <Paragraph position="5"> Eventually, as a result of this instruction processing by the virtual machine, we obtain poten7These relations are either dominance or precedence between node variables, or their reflexive transitive closure, or the labelling of node variable with feature structures.  tially total tree descriptions, that have to be solved in order to produce the expected TAG.</Paragraph>
    <Paragraph position="6"> Now, we will introduce XMG's tree description solver and show that it is naturally designed to process efficiently the higher level constraints mentioned above. In particular, we will see that the description solver has been designed to be easily extended with additional parametric admissibility constraints.</Paragraph>
    <Section position="1" start_page="13" end_page="14" type="sub_section">
      <SectionTitle>
4.1 Tree descriptions solving
</SectionTitle>
      <Paragraph position="0"> To find the minimal models corresponding to the total tree descriptions obtained by accumulating fragmentary tree descriptions, we use a tree description solver. This solver has been developed in the Constraint Programming paradigm using the constraint satisfaction approach of (Duchier and Niehren, 2000). The idea is to translate relations between node variables into constraints over sets of integers.</Paragraph>
      <Paragraph position="1"> Basically, we refer to a node of the input description in terms of the nodes being equals, above, below, or on its side (see Figure 5). More precisely, we associate each node of the description with an integer, then our reference to a node corresponds to a tuple containing sets of nodes (i.e. sets of integers).</Paragraph>
      <Paragraph position="2"> As a first approximation, let us imagine that we refer to a node x in a model by means of a 5-tuple Nix = (Eq, Up, Down, Left, Right) where i is an integer associated with x and Eq (respectively Up, Down, Left, Right) denotes the set of nodes8 in the description which are equal, (respectively above, below, left, and right) of x.</Paragraph>
      <Paragraph position="3"> Then we can convert the relations between nodes of our description language into constraints on sets of integer.</Paragraph>
      <Paragraph position="4"> 8I.e. integers.</Paragraph>
      <Paragraph position="5">  For instance, if we consider 2 nodes x and y of the description. Assuming we associate x with the integer i and y with j, we can translate the dominance relation x - y the following way9:  This means that if the node10 x strictly dominates y in the input description, then (i) the set of nodes that are above or equal x in a valid model is included in the set of those that are strictly above y and (ii) the dual holds for the nodes that are above and (iii) the set of nodes that are on the left of y is included in the set of those that are on the left of x and (iv) similarly for the right part.</Paragraph>
      <Paragraph position="6"> Once the constraints framework is settled, we can search for the solutions to our problem, i.e. the variable assignments for each of the sets of integers used to refer to the nodes of the input description. This search is performed by associating with each pair of nodes (x,y) of the input description a choice variable denoting the mutually exclusive relations11 between these two nodes. Then  we use a search strategy to explore the consistent assignments to these choices variables (and the associated assignments for sets of integers referring to nodes)12. Note that the strategy used in XMG is a first-fail strategy which leads to very good results (see section 5 below). The implementation of this solver has been done using the constraint programming support of the Mozart Programming</Paragraph>
    </Section>
    <Section position="2" start_page="14" end_page="14" type="sub_section">
      <SectionTitle>
System (The Oz-Mozart Board, 2005).
4.2 Extension to higher-level constraints
</SectionTitle>
      <Paragraph position="0"> solving An important feature of our approach is that this system of constraints over integer sets can be extended so that we not only ensure tree well-formedness of the outputted trees, but also the respect of linguistic properties such as the uniqueness of clitics in French, etc.</Paragraph>
      <Paragraph position="1"> The idea is that if we extend adequately our node representation, we can find additional constraints that reflects the syntactic constraints we want to express.</Paragraph>
      <Paragraph position="2"> Clitic uniqueness For instance, let us consider the clitic uniqueness constraint introduced above. We want to express the fact that in a valid model ph, there is only one node having a given property p (i.e. a parameter of the constraint, here the category clitic13). This can be done by introducing, for each node x of the description, a boolean variable px indicating whether the node denoting x in the model has this property or not. Then, if we call Vphp the set of integers referring to nodes having the property p in a model, we have: px [?] (Nix.Eq [?]Vphp ) negationslash= [?] Finally, if we represent the true value with the integer 1 and false with 0, we can sum the px for each x in the model. When this sum gets greater than 1, we can consider that we are not building a valid model.</Paragraph>
      <Paragraph position="3"> Colouration constraint Another example of the constraints introduced in section 3 is colouration. Colouration represents operational constraints whose effect is to control tree fragment combination. The idea is to label nodes with a colour between red, black and white. Then, during 12More information about the use of such choice variables is given in (Duchier, 1999) 13In fact, the uniqueness concerns the rank of the clitics, see (Crabb'e, 2005b), SS9.6.3.</Paragraph>
      <Paragraph position="4"> description solving, nodes are identified according to the rules given previously (see Figure 2).</Paragraph>
      <Paragraph position="5"> That is, red nodes are not identified with any other node, white nodes can be identified with a black one. Black nodes are not identified with each other. A valid model in this context is a saturated tree, i.e. where nodes are either black (possibly resulting from identifications) or red. In other words, for every node in the model, there is at most one red or black node with which it has been identified. The implementation of such a constraint is done the following way. First, the tuples representing nodes are extended by adding a integer field RB referring to the red or black node with which the node has been identified. Then, considering the following sets of integers: VR, VB, VW respectively containing the integers referring to red, black and white nodes in the input description, the following constraints hold:</Paragraph>
      <Paragraph position="7"> where VphB represents the black nodes in a model, i.e. VphB = Vph [?]VB. (a) expresses the fact that for red nodes, Nix.RB is the integer i associated with x itself, and Nix.Eq is a set only containing i. (b) means that for black nodes, we have that Nix.RB is also the integer i denoting x itself, but we cannot say anything about Nix.Eq. Eventually (c) means that whites nodes have to be identified with a black one.</Paragraph>
      <Paragraph position="8"> Thus, we have seen that Constraint Programming offers an efficient and relatively natural way of representing syntactic constraints, as &amp;quot;all&amp;quot; that has to be done is to find an adequate node representation in terms of sets of nodes, then declare the constraints associated with these sets, and finally use a search strategy to compute the solutions.</Paragraph>
    </Section>
  </Section>
  <Section position="7" start_page="14" end_page="15" type="metho">
    <SectionTitle>
5 Some features
</SectionTitle>
    <Paragraph position="0"> There are two points worth considering here: (i) the usability of the formalism to describe a real scale grammar with a high factorisation, and (ii) the efficiency of the implementation in terms of time and memory use.</Paragraph>
    <Paragraph position="1"> Concerning the first point, XMG has been used successfully to compute a TAG having more than 6,000 trees from a description containing 293  classes14. Moreover, this description has been designed relatively quickly as the description language is intuitive as advocated in (Crabb'e, 2005a). Concerning the efficiency of the system, the compilation of this TAG with more than 6,000 trees takes about 15 min with a P4 processor 2.6 GHz and 1 GB RAM. Note that compared with the compilation time of previous approaches (Candito, 1999; Gaiffe et al., 2002) (with the latter, a TAG of 3,000 trees was compiled in about an hour), these results are quite encouraging.</Paragraph>
    <Paragraph position="2"> Eventually, XMG is released under the terms of the GPL-like CeCILL license15 and can be freely downloaded at http://sourcesup.cru.fr/xmg.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML