File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/92/p92-1022_metho.xml

Size: 29,406 bytes

Last Modified: 2025-10-06 14:13:11

<?xml version="1.0" standalone="yes"?>
<Paper uid="P92-1022">
  <Title>An Alternative Conception of Tree-Adjoining Derivation*</Title>
  <Section position="3" start_page="167" end_page="168" type="metho">
    <SectionTitle>
2 The Standard Definition of
</SectionTitle>
    <Paragraph position="0"/>
    <Section position="1" start_page="167" end_page="168" type="sub_section">
      <SectionTitle>
Derivat ion
</SectionTitle>
      <Paragraph position="0"> To exemplify the distinction between standard and extended derivations, we exhibit the TAG of Figure 1. This grammar derives some simple noun phrases such as &amp;quot;roasted red pepper&amp;quot; and &amp;quot;baked red potato&amp;quot;. The former, for instance, is associated with the derived tree in Figure 2(a). The tree can be viewed as being derived in two ways 1 Dependent: The auxiliary tree fifo is adjoined at the root node (address e) of fire. The resultant tree is adjoined at the root node (address e) of initial tree ap~. This derivation is depicted as the derivation tree in Figure 3(a).</Paragraph>
      <Paragraph position="1"> Independent: The auxiliary trees firdeg and fire are adjoined at the root node of the initial tree ape. This derivation is depicted as the derivation tree in Figure 3(b).</Paragraph>
      <Paragraph position="2"> In the independent derivation, two trees are separately adjoined at one and the same node in the initial tree. In the dependent derivation, on the other hand, one auxiliary tree is adjoined to the</Paragraph>
      <Paragraph position="4"> other, the latter only being adjoined to the initial tree. We will use this informal terminology uniformly in the sequel to distinguish the two general topologies of derivation trees.</Paragraph>
      <Paragraph position="5"> The standard definition of derivation, as codified by Vijay-Shanker, restricts derivations so that two adjunctions cannot occur at the same node in the same elementary tree. The dependent notion of derivation is therefore the only sanctioned derivation for the desired tree in Figure 2(a); the independent derivation is disallowed. Vijay-Shanker's definition is appropriate because for any independent derivation, there is a dependent derivation of the same derived tree. This can be easily seen in that any adjunetion of/32 at a node at which an adjunction of/31 occurs could instead be replaced by an adjunction of/32 at the root of/31.</Paragraph>
      <Paragraph position="6"> The advantage of this standard definition of derivation is that a derivation tree in this normal form unambiguously specifies a derived tree. The independent derivation tree on the other hand is ambiguous as to the derived tree it specifies in that a notion of precedence of the adjunctions at the same node is unspecified, but crucial to the derived tree specified. This follows from the fact that the independent derivation tree is symmetric with respect to the roles of the two auxiliary trees (by inspection), whereas the derived tree is not.</Paragraph>
      <Paragraph position="7"> By symmetry, therefore, it must be the case that the same independent derivation tree specifies the alternative derived tree in Figure 2(b).</Paragraph>
    </Section>
  </Section>
  <Section position="4" start_page="168" end_page="171" type="metho">
    <SectionTitle>
3 Motivation for Extended
</SectionTitle>
    <Paragraph position="0"/>
    <Section position="1" start_page="168" end_page="168" type="sub_section">
      <SectionTitle>
Derivations
</SectionTitle>
      <Paragraph position="0"> In the absence of some further interpretation of the derivation tree nothing hinges on the choice of derivation definition, so that the standard definition is as reasonable as any other. However, tree-adjoining grammars are almost universally extended with augmentations that make the issue apposite. We discuss three such variations here, all of which argue for the use of independent derivations under certain circumstances.</Paragraph>
    </Section>
    <Section position="2" start_page="168" end_page="170" type="sub_section">
      <SectionTitle>
3.1 Adding Adjoining Constraints
</SectionTitle>
      <Paragraph position="0"> Already in very early work on tree-adjoining grammars (Joshi et al., 1975) constraints were allowed to be specified as to whether a particular auxiliary tree may or may not be adjoined at a particular node in a particular tree. The idea is formulated in its modern variant as selective-adjoining constraints (Vijay-Shanker and Joshi, 1985). As an application of this capability, we consider the remark by Quirk et al. (1985, page 517) that &amp;quot;direction adjuncts of both goal and source can normally be used only with verbs of motion&amp;quot;, which accounts for the distinction between the following sentences: (1)a. Brockway escorted his sister to the annual cotillion.</Paragraph>
      <Paragraph position="1"> b. #Brockway resembled his sister to the annual cotillion.</Paragraph>
      <Paragraph position="2"> This could be modeled by disallowing through selective adjoining constraints the adjunction of the elementary tree corresponding to a to adverbial at the VP node of the elementary tree corresponding to the verb resembles. 2 However, the restriction applies even with intervening (and otherwise acceptable) adverbials.</Paragraph>
      <Paragraph position="3"> (2)a. Brockway escorted his sister last year.</Paragraph>
      <Paragraph position="4"> b. Brockway escorted his sister last year to the annual cotillion.</Paragraph>
      <Paragraph position="5"> (3)a. Brockway resembled his sister last year.</Paragraph>
      <Paragraph position="6"> b. #Brockway resembled his sister last year to the annual cotillion.</Paragraph>
      <Paragraph position="7"> Under the standard definition of derivation, there is no direct adjunction in the latter sentence of the to tree into the resembles tree. Rather, it is dependently adjoined at the root of the elementary tree that heads the adverbial last year, the latter directly adjoining into the main verb tree. To restrict both of the ill-formed sentences, then, a restriction must be placed not only on adjoining</Paragraph>
      <Paragraph position="9"> the goal adverbial in a resembles context, but also in the last year adverbial context. But this constraint is too strong, as it disallows sentence (2b) above as well.</Paragraph>
      <Paragraph position="10"> The problem is that the standard derivation does not correctly reflect the syntactic relation between adverbial modifier and the phrase it modifies when there are multiple modifications in a single clause. In such a case, each of the adverbials independently modifies the verb, and this should be reflected in their independent adjunction at the same point. But this is specifically disallowed in a standard derivation.</Paragraph>
      <Paragraph position="11"> It is important to note that the argument applies specifically to auxiliary trees that correspond to a modification relationship. Auxiliary trees are used in TAG typically for predication relations as well, 3 as in the case of raising and sentential complement constructions. 4 Consider the following sentences. (The brackets mark the leaves of the pertinent trees to be combined by adjunction in the assumed analysis.) Brockway conjectured that Harrison wanted to escort his sister.</Paragraph>
      <Paragraph position="12"> \[Brockway conjectured that\] \[Harrison wanted\] \[to escort his sister\] Brockway wanted to try to escort his sister. null \[Srockway wanted\] \[to try\] \[to escort his sister\] Harrison wanted Brockway tried to escort his sister.</Paragraph>
      <Paragraph position="13"> \[Harrison wanted\] \[Brockway tried\] \[to escort his sister\] Assume (following, for instance, the analysis of Kroch and Joshi (1985)) that the trees associated with the various forms of the verbs &amp;quot;try&amp;quot;, &amp;quot;want&amp;quot;, and &amp;quot;conjecture&amp;quot; all take sentential complements, certain of which are tensed with overt subjects and others untensed with empty subjects. The auxiliary trees for these verbs specify by ad3We use the term 'predication' in its logical sense, that is, for auxiliary trees that serve as logical predicates over the trees into which they adjoin, in contrast to the term's linguistic sub-sense in which the argument of the predicate is a linguistic subject.</Paragraph>
      <Paragraph position="14"> 4 The distinction between predicative and modifier trees has been proposed previously for purely linguistic reasons by Kroch (1989), who refers to them as thematic and athematic trees, respectively. The arguments presented here can be seen as providing further evidence for differentiating the two kinds of auxiliary trees.</Paragraph>
      <Paragraph position="15">  junction constraints which type of sentential complement they take: &amp;quot;conjecture&amp;quot; requires tensed complements, &amp;quot;want&amp;quot; and &amp;quot;try&amp;quot; untensed. Under this analysis the auxiliary trees must not be allowed to independently adjoin at the same node. For instance, if trees corresponding to &amp;quot;Harrison wanted&amp;quot; and &amp;quot;Brockway tried&amp;quot; (which both require untensed complements) were both adjoined at the root of the tree for &amp;quot;to escort his sister&amp;quot;, the selective adjunction constraints would be satisfied, yet the generated sentence (6a) is ungrammatical.</Paragraph>
      <Paragraph position="16"> Thus, the case of predicative trees is entirely unlike that of modifier trees. Here, the standard notion of derivation is exactly what is needed as far as interpretation of adjoining constraints is concerned. In summary, the interpretation of adjoining constraints in TAG is sensitive to the particular notion of derivation that is used. Therefore, it can be used as a litmus test for an appropriate definition of derivation. As such, it argues for a nonstandard, independent, notion of derivation for modifier auxiliary trees and a standard, dependent, notion for predicative trees.</Paragraph>
    </Section>
    <Section position="3" start_page="170" end_page="170" type="sub_section">
      <SectionTitle>
3.2 Adding Statistical Parameters
</SectionTitle>
      <Paragraph position="0"> In a similar vein, the statistical parameters of a stochastic lexicalized TAG (SLTAG) (Resnik, 1992; Schabes, 1992) specify the probability of adjunction of a given auxiliary tree at a specific node in another tree. This specification may again be interpreted with regard to differing derivations, obviously with differing impact on the resulting probabilities assigned to derivation trees. (In the extreme case, a constraint prohibiting adjoining corresponds to a zero probability in an SLTAG.</Paragraph>
      <Paragraph position="1"> The relation to the argument in the previous section follows thereby.) Consider a case in which linguistic modification of noun phrases by adjectives is modeled by adjunction of a modifying tree. Under the standard definition of derivation, multiple modifications of a single NP would lead to dependent adjunctions in which a first modifier adjoins at the root of a second. As an example, we consider again the grammar given in Figure 1, that admits of derivations for the strings &amp;quot;baked red potato&amp;quot; and &amp;quot;baked red pepper&amp;quot;. Specifying adjunction probabilities on standard derivations, the distinction between the overall probabilities for these two strings depends solely on the adjunction probabilities of fire (the tree for red) into apo and ape (those for potato and pepper, respectively), as the tree fib for the word baked is adjoined in both cases at the root of fl~ in both standard derivations. In the extended derivations, on the other hand, both modifying trees are adjoined independently into the noun trees. Thus, the overall probabilities are determined as well by the probabilities of adjunction of the trees for baked into the nominal trees. It seems intuitively plausible that the most important relationships to characterize statistically are those between modifier and modified, rather than between two modifiers. 5 In the case at hand, the fact that potatoes are more frequently baked, whereas peppers are roasted, would be more determining of the expected overall probabilities.</Paragraph>
      <Paragraph position="2"> Note again that the distinction between modifier and predicative trees is important. The standard definition of derivation is entirely appropriate for adjunction probabilities for predicative trees, but not for modifier trees.</Paragraph>
    </Section>
    <Section position="4" start_page="170" end_page="171" type="sub_section">
      <SectionTitle>
3.3 Adding Semantics
</SectionTitle>
      <Paragraph position="0"> Finally, the formation of synchronous TAGs has been proposed to allow use of TAGs in semantic interpretation, natural language generation, and machine translation. In previous work (Shieber and Schabes, 1990), the definition of synchronous TAG derivation is given in a manner that requires multiple adjunctions at a single node. The need for such derivations follows from the fact that synchronous derivations are intended to model semantic relationships. In cases of multiple adjunction of modifier trees at a single node, the appropriate semantic relationships comprise separate modifications rather than cascaded ones, and this is reflected in the definition of synchronous TAG derivation. 6 Because of this, a parser for synchronous TAGs must recover, at least implicitly, the extended derivations of TAG derived trees.</Paragraph>
      <Paragraph position="1"> 5Intuition is an appropriate guide in the design of the SLTAG framework, as the idea is to set up a linguistically plausible infrastructure on top of which a lexically-based statistical model can be built. In addition, suggestive (though certainly not conclusive) evidence along these lines can be gleaned from corpora analyses. For instance, in a simple experiment in which medium frequency triples of exactly the discussed form &amp;quot;(adjective) (adjective) (noun)&amp;quot; were examined, the mean mutual information between the first adjective and the noun was found to be larger than that between the two adjectives. The statistical assumptions behind the experiment do not allow very robust conclusions to be drawn, and more work is needed along these lines.</Paragraph>
      <Paragraph position="2"> 6The importance of the distinction between predicative and modifier trees with respect to how derivations are defined was not appreciated in the earlier work; derivations were taken to be of the independent variety in all cases. In future work, we plan to remedy this flaw.</Paragraph>
      <Paragraph position="3">  Note that the independence of the adjunction of modifiers in the syntax does not imply that semantically there is no precedence or scoping relation between them. As exemplified in Figure 4, the derived tree generated by multiple independent adjunctions at a single node still manifests nesting relationships among the adjoined trees. This fact may be used to advantage in the semantic half of a synchronous tree-adjoining grammar to specify the semantic distinction between, for example, the following two sentences: 7 (7)a. Brockway paid for the tickets twice intentionally. null b. Brockway paid for the tickets intentionally twice.</Paragraph>
      <Paragraph position="4"> We hope to address this issue in greater detail in future work on synchronous tree-adjoining grammars. null</Paragraph>
    </Section>
  </Section>
  <Section position="5" start_page="171" end_page="172" type="metho">
    <SectionTitle>
4 Informal Specification of
</SectionTitle>
    <Paragraph position="0"/>
    <Section position="1" start_page="171" end_page="172" type="sub_section">
      <SectionTitle>
Extended Derivations
</SectionTitle>
      <Paragraph position="0"> We have presented several arguments that the standard notion of derivation does not allow for an appropriate specification of dependencies to be captured. An extended notion of derivation is  needed that . Differentiates predicative and modifier auxiliary trees; 2. Requires dependent derivations for predicative trees; 3. Requires independent derivations for modifier trees; and 4. Unambiguously specifies a derived tree.  Recall that a derivation tree is a tree with unordered arcs where each node is labeled by an elementary tree of a TAG and each arc is labeled by a tree address specifying a node in the parent tree. In a standard derivation tree no two sibling arcs can be labeled with the same address. In an extended derivation tree, however, the condition is relaxed: No two sibling arcs to predicative trees can be labeled with the same address. Thus, for any given address there can be at most one predicative tree and several modifier trees adjoined at rWe are indebted to an anonymous reviewer for raising this issue crisply through examples similar to those given here.</Paragraph>
      <Paragraph position="1">  associated derived tree that node. So as to fully specify the output derived tree, we specify a partial ordering on sibling arcs by mandating that arcs corresponding to modifier trees adjoined at the same address are treated as ordered left-to-right. However, all other arcs, including those for predicative adjunctions are left unordered.</Paragraph>
      <Paragraph position="2"> A derivation tree specifies a derived tree through a bottom-up traversal (as is standard since the work of Vijay-Shanker (1987)). The choice of a particular traversal order plays the same role as choosing a particular rewriting derivation order in a context-free grammar -- leftmost or rightmost, say -- in eliminating spurious ambiguity due to inconsequential reordering of operations. An extended derivation tree specifies a derived tree in exactly the same manner, except that there must be a specification of the derived tree specified when several trees are adjoined at the same node.</Paragraph>
      <Paragraph position="3"> Assume that in a given tree T at a particular address t, the predicative tree P and the k modifier trees M1,..., Mk (in that order) are directly adjoined. Schematically, the extended derivation tree would appear as in Figure 4(a). Associated with the subtrees rooted at the k + 1 elementary auxiliary trees in this derivation are k + 1 derived auxiIiary trees (Ap and A1,..., Ak, respectively). (The derived auxiliary trees are specified inductively; it is this sense in which the definition corresponds to a bottom-up traversal.) There are many possible trees that might be entertained as the derived tree associated with the derivation rooted at T, one for each permutation  of the k + 1 auxiliary trees. Since the ordering of the modifiers in the derivation tree is essentially arbitrary, we can fix on a single ordering of these in the output tree. We will choose the ordering in which the top to bottom order in the derived tree follows the partial order on the nodes in the derivation tree. Thus A1 appears higher in the tree than A2, A2 higher than A3 and so forth. This much is arbitrary.</Paragraph>
      <Paragraph position="4"> The choice of where the predicative tree goes, however, is consequential. There are k + 1 possible positions, of which only two can be seriously maintained: outermost, at the top of the tree; or innermost, at the bottom. We complete the (informal) definition of extended derivation by specifying the derived tree corresponding to such a derivation to manifest outermost predication as depicted in Figure 4(b).</Paragraph>
      <Paragraph position="5"> Both linguistic and technical consequences argue for outermost, rather than innermost, predication. Linguistically, the outermost method specifies that if both a predicative tree and a modifier tree are adjoined at a single node, then the predicative tree attaches &amp;quot;higher&amp;quot; than the modifier tree; in terms of the derived tree, it is as if the predicative tree were adjoined at the root of the modifier tree. This accords with the semantic intuition that in such a case, the modifier is modifying the original tree, not the predicative one. (The alternate &amp;quot;reading&amp;quot;, in which the modifier modifies the predicative tree, is still obtainable under an outermost-predication standard by having the modifier auxiliary tree adjoin at the root node of the predicative tree.) In contrast, the innermost-predication method specifies that the modifier tree attaches higher, as if the modifier tree adjoined at the root of the predicative tree and was therefore modifying the predicative tree, contra semantic intuitions. null From a technical standpoint, the outermost-predication method requires no changes to the parsing rules to be presented later, but only a single addition. The innermost-predication method induces some subtle interactions between the original parsing rules and the additional one, necessitating a much more complicated set of modifications to the original algorithm. (In fact, the complexities in generating such an algorithm constituted the precipitating factor that led us to revise our original, innermost-predication, attempt at redefining tree-adjoining derivation.)</Paragraph>
    </Section>
  </Section>
  <Section position="6" start_page="172" end_page="178" type="metho">
    <SectionTitle>
5 Formal Specification of Ex-
</SectionTitle>
    <Paragraph position="0"/>
    <Section position="1" start_page="172" end_page="178" type="sub_section">
      <SectionTitle>
tended Derivations
</SectionTitle>
      <Paragraph position="0"> In all three application areas of TAGs, the need is evidenced for a modified notion of derivation that retains the dependent notion of derivation for predicative trees but mandates independent adjunction for modifier trees. A formal definition of extended derivation can be given by means of a compilation of tree-adjoining grammars into linear indexed grammars. We discuss such a compilation in this section. This compilation is especially useful as it can be used as the basis for a parsing algorithm that recovers the extended derivations for strings. The design of the algorithm is the topic of Section 6.</Paragraph>
      <Paragraph position="1"> Linear indexed grammars (LIG) constitute a grammatical framework based, like context-free, context-sensitive, and unrestricted rewriting systems, on rewriting strings of nonterminal and terminal symbols. Unlike these systems, linear indexed grammars, like the indexed grammars from which they are restricted, allow stacks of marker symbols, called indices, to be associated with the nonterminal symbols being rewritten. The linear version of the formalism allows the full index information from the parent to be used to specify the index information for only one of the child constituents. Thus, a linear indexed production can be given schematically as: curred. For these reasons, we use the technique in this work.</Paragraph>
      <Paragraph position="2"> The compilation process that manifests the standard definition of derivation can be most easily understood by viewing nodes in a TAG elementary tree as having both a top and bottom component, identically marked for nonterminal category, that dominate (but may not immediately dominate) each other. (See Figure 5.) The rewrite rules of the corresponding linear indexed grammar capture the immediate domination between a bottom node and its child top nodes directly, and capture the domination between top and bottom parts of the same node by optionally allowing rewriting from the top of a node to an appropriate auxiliary tree, and from the foot of the auxiliary tree back to the bottom of the node. The index stack keeps track of the nodes that adjunction has occurred on so that the recognition to the left and the right of the foot node will occur under identical assumption of derivation structure. In summary, the following LIG rules are generated: . Immediate domination dominating foot: For each auxiliary tree node r/ that dominates the foot node, with children 01, * .., rl, .... , r/,, where r/a is the child that also dominates the foot node, include a production</Paragraph>
      <Paragraph position="4"> The Ni are nonterminals, the/3/strings of indices.</Paragraph>
      <Paragraph position="5"> The &amp;quot;..&amp;quot; notation stands for the remainder of the stack below the given string of indices. Note that only one element on the right-hand side, Ns, inherits the remainder of the stack from the parent.</Paragraph>
      <Paragraph position="6"> (This schematic rule is intended to be indicative, not definitive. We ignore issues such as the optionality of the inherited stack, how terminal symbols fit in, and so forth. Vijay-Shanker and Weir (1990) present a complete discussion.) Vijay-Shanker and Weir (1990) present a way of specifying any TAG as a linear indexed grammar.</Paragraph>
      <Paragraph position="7"> The LIG version makes explicit the standard notion of derivation being presumed. Also, the LIG version of a TAG grammar can be used for recognition and parsing. Because the LIG formalism is based on augmented rewriting, the parsing algorithms can be much simpler to understand and easier to modify, and no loss of generality is in. Immediate domination not including foot: For each elementary tree node r/ that does not dominate a foot node, with children r/i,..., r/,~, include a production b\[,\] --, t\[r/d...t\[,,\] . No adjunction: For each elementary tree node r/that is not marked for substitution or obligatory adjunction, include a production . Start root of adjunction: For each elementary tree node r/on which the auxiliary tree/3 with root node r k can be adjoined, include the following production: t\[..,or\]  foot node r/! can be adjoined, include the following production: ---. b\[..,fl 6. Start substitution: For each elementary tree node ~/marked for substitution on which the initial tree a with root node qr can be substituted, include the production We will refer to productions generated by Rule i above as Type i productions. For example, Type 3 productions are of the form t\[..~/\] -* b\[..T/\]. For further information concerning the compilation see the work of Vijay-Shanker and Weir (1990) and Schabes (1991). For present purposes, it is sufficient to note that the method directly embeds the standard notion of derivation in the rewriting process. To perform an adjunction, we move (by Rule 4) from the node adjoined at to the top of the root of the auxiliary tree. At the root, additional adjunctions might be performed. When returning from the foot of the auxiliary tree back to the node where adjunction occurred, rewriting continues at the bottom of the node (see Rule 5), not the top, so that no more adjunctions can be started at that node. Thus, the dependent nature of predicative adjunction is enforced because only a single adjunction can occur at any given node.</Paragraph>
      <Paragraph position="8"> In order to permit extended derivations, we must allow for multiple modifier tree adjunctions at a single node. There are two natural ways this might be accomplished, as depicted in Figure 6.</Paragraph>
      <Paragraph position="9">  tive and modifier adjunctions with top and bottom of each node separated 1. Modified start foot of adjunction rule: Allow moving from the bottom of the foot of a modifier auxiliary tree to the top (rather than the bottom) of the node at which it adjoined (Figure 6b).</Paragraph>
      <Paragraph position="10"> 2. Modified start root of adjunction rule: Allow moving from the bottom (rather than the top) of a node to the top of the root of a modifier auxiliary tree (Figure 6c).</Paragraph>
      <Paragraph position="11"> As can be seen from the figures, both of these methods allow recursion at a node, unlike the original method depicted in Figure 6a. Thus multiple modifier trees are allowed to adjoin at a single node. Note that since predicative trees fall under the original rules, at most a single predicative tree can be adjoined at a node. The two methods correspond exactly to the innermost- and outermost-predication methods discussed in Section 4. For the reasons described there, the latter is preferred. In summary, independent derivation structures can be allowed for modifier auxiliary trees by starting the adjunction process from the bottom, rather than the top of a node for those trees. Thus, we split Type 4 LIG productions into two subtypes for predicative and modifier trees, respectively. 4a. Start root of predicative adjunction: For each elementary tree node r/on which the predicative auxiliary tree fl with root node fir can be adjoined, include the following production: -+ 4b. Start root of modifier adjunction: For each elementary tree node y on which the modifier auxiliary tree/~ with root node r/~ can be adjoined, include the following production: --, Once this augmentation has been made, we no longer need to allow for adjunctions at the root nodes of modifier auxiliary trees, as repeated adjunction is now allowed for by the new rule 4b. Consequently, P~ules 4a and 4b must treat all modifier auxiliary tree root nodes as if they have adjoining constraints that forbid modifier tree adjunctions that do not correspond to modification of the tree itself.</Paragraph>
      <Paragraph position="12"> This simple modification to the compilation process from TAG to LIG fully specifies the modified notion of derivation. The recognition algorithms for TAG based on this compilation, however, must be adjusted to allow for the new rule types.</Paragraph>
    </Section>
  </Section>
class="xml-element"></Paper>
Download Original XML