XML Viewer - p96-1033

File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/96/p96-1033_metho.xml
Size: 22,259 bytes
Last Modified: 2025-10-06 14:14:19
<?xml version="1.0" standalone="yes"?>
<Paper uid="P96-1033">
  <Title>Magic for Filter Optimization in Dynamic Bottom-up Processing</Title>
  <Section position="3" start_page="247" end_page="251" type="metho">
    <SectionTitle>
2 Definite Clause Characterization
</SectionTitle>
    <Paragraph position="0"> of Filtering Many approaches focus on exploiting specific knowledge about grammars and/or the computational task(s) that one is using them for by making filtering explicit and extending the processing strategy such that this information can be made effective. In generation, examples of such extended processing strategies are head corner generation with its semantic linking (Shieber et al., 1990) or bottom-up (Earley) generation with a semantic filter (Shieber, 1988). Even though these approaches often accomplish considerable improvements with respect to efficiency or termination behavior, it remains unclear how these optimizations relate to each other and what comprises the logic behind these specialized forms of filtering. By bringing filtering into the logic underlying the grammar it is possible to show in a perspicuous and logically clean way how and why filtering can be optimized in a particular fashion and how various approaches relate to each other.</Paragraph>
    <Section position="1" start_page="247" end_page="247" type="sub_section">
      <SectionTitle>
2.1 Magic Compilation
</SectionTitle>
      <Paragraph position="0"> Magic makes filtering explicit through characterizing it as definite clauses. Intuitively understood, filtering is reversed as binding information that normally becomes available as a result of top-down evaluation is derived by bottom-up evaluation of the definite clause characterization of filtering. The following is the basic Magic algorithm taken from Ramakrishnan et al. (1992).</Paragraph>
      <Paragraph position="1"> Let P be a program and q(E) a query on the program. We construct a new program ping. Initially ping is empty.</Paragraph>
      <Paragraph position="2">  1. Create a new predicate magic_p for each predicate p in P. The arity is that of p.</Paragraph>
      <Paragraph position="3"> 2. For each rule in P, add the modified  version of the rule to p-~9. If rule r has head, say, p({), the modified version is obtained by adding the literal magic_p(t) to the body.</Paragraph>
      <Paragraph position="4"> 3. For each rule r in P with head, say, p({), and for each literal q~(~) in its body, add a magic rule to ping. The head is magic_qi(~). The body contains the literal magic_p(t), and all the literals that precede qi in the rule.</Paragraph>
      <Paragraph position="5"> 4. Create a seed fact magic_q(5) from the query.</Paragraph>
      <Paragraph position="6"> To illustrate the algorithm I zoom in on the application of the above algorithm to one particular grammar rule. Suppose the original grammar rule looks as follows: s (P0, P, VForm, SSem) : vp(Pl ,P,VForm, \[CSem\] ,SSem), np (P0,PI, CSem).</Paragraph>
      <Paragraph position="7"> Step 2 of the algorithm results in the following modified version of the original grammar rule: s (P0, P,VForm, SSem) : magic_s (P0,P,VForm, SSem) , vp(Pl ,P ,VForm, \[CSem\] , SSem) , np (P0, PI, CSem).</Paragraph>
      <Paragraph position="8"> A magic literal is added to the right-hand side of the rule which 'guards' the application of the rule. This does not change the semantics of the original grammar as it merely serves as a way to incorporate the relevant bindings derived with the magic predicates to avoid redundant applications of a rule. Corresponding to the first right-hand side literal in the original rule step 3 derives the following magic rule: magic_vp (Pl, P, VForm, \[CSem\] , SSem) : magic_s (P0, P, VForm, SSem) .</Paragraph>
      <Paragraph position="9"> It is used to derive from the guard for the original rule a guard for the rules defining the first right-hand side literal. The second right-hand side literal in the original rule leads to the following magic rule: magic_up (P0, P1, CSem) : magi c_s (P0, P, VForm, SSem) , vp(Pl,P,VForm, \[CSem\] ,SSem) .</Paragraph>
      <Paragraph position="10"> Finally, step 4 of the algorithm ensures that a seed is created. Assuming that the original rule is defining the start category, the query corresponding to the generation of the s &amp;quot;John buys Mary a book&amp;quot; leads to the following seed: magic_s (P0 ,P, finite ,buys (john, a (book) ,mary) ). The seed constitutes a representation of the initial bindings provided by the query that is used by the magic predicates to derive guards. Note that the creation of the seed can be delayed until run-time, i.e., the grammar does not need to be recompiled for every possible query.</Paragraph>
    </Section>
    <Section position="2" start_page="247" end_page="250" type="sub_section">
      <SectionTitle>
2.2 Example
</SectionTitle>
      <Paragraph position="0"> Magic compilation is illustrated on the basis of the simple logic grammar extract in figure 1. This grammar has been optimized automatically for generation (Minnen et al., 1996): The right-hand sides of the rules are reordered such that a simple left-to-right evaluation order constitutes the optimal evaluation order. With this grammar a simple top-down generation strategy does not terminate as a result of the head recursion in rule 3. It is necessary to use  memoization extended with an abstraction function and a subsumption check. Strict bottom-up generation is not attractive either as it is extremely inefficient: One is forced to generate all possible natural language expressions licensed by the grammar and subsequently check them against the start category. It is possible to make the process more efficient through excluding specific lexical entries with a semantic filter. The use of such a semantic filter in bottom-up evaluation requires the grammar to obey the semantic monotonicity constraint in order to ensure completeness(Shieber, 1988) (see below).</Paragraph>
      <Paragraph position="1"> The 'magic-compiled grammar' in figure 2 is the result of applying the algorithm in the previous section to the head-recursive example grammar and subsequently performing two optimizations (Beeri and Ramakrishnan, 1991): All (calls to) magic predicates corresponding to lexical entries are removed. Furthermore, data-flow analysis is used to fine-tune the magic predicates for the specific processing task at hand, i.e., generation3 Given a user-specified abstract query, i.e., a specification of the intended input (Beeri and Ramakrishnan, 1991) those arguments which are not bound and which therefore serve no filtering purpose are removed. The modified versions of the original rules in the grammar are adapted accordingly. The effect of taking data-flow into account can be observed by comparing the rules for mag+-c_vp and mag+-c_np in the previous section with rule 12 and 14 in figure 2, respectively.</Paragraph>
      <Paragraph position="2"> Figure 3 shows the results from generation of the sentence &amp;quot;John buys Mary a book&amp;quot;. In the case of this example the seed looks as follows: magic_sentence (decl (buys (john, a (book) ,mary) ) ).</Paragraph>
      <Paragraph position="3"> The \]acts, i.e., passive edges/items, in figure 3 resulted from semi-naive bottom-up evaluation (Ra-IFor expository reasons some data-flow information that does restrict processing is not taken into account. E.g., the fact that the vp literal in rule 2 is always called with a one-element list is ignored here, but see section 3.1.</Paragraph>
      <Paragraph position="4"> makrishnan et al., 1992) which constitutes a dynamic bottom-up evaluation, where repeated derivation of facts from the same earlier derived facts (as in naive evaluation; Bancilhon, 1985) is blocked. (Active edges are not memoized.) The figure 2 consist of two tree structures (connected through dotted lines) of which the left one corresponds to the filtering part of the derivation. The filtering tree is reversed and derives magic facts starting from the seed in a bottom-up fashion. The tree on the right is the proof tree for the example sentence which is built up as a result of unifying in the derived magic facts when applying a particular rule. E.g., in order to derive fact 13, magic fact 2 is unified with the magic literal in the modified version of rule 2 (in addition to the facts 12 and 10). This, however, is not represented in order to keep the figure clear. Dotted lines are used to represent when 'normal' facts are combined with magic facts to derive new magic facts.</Paragraph>
      <Paragraph position="5"> As can be reconstructed from the numbering of the facts in figure 3 the resulting processing behavior is identical to the behavior that would result from Earley generation as in Gerdemann (1991) except that the different filtering steps are performed in a bottom-up fashion. In order to obtain a generator similar to the bottom-up generator as described in Shieber (1988) the compilation process can be modified such that only lexical entries are extended with magic literals. Just like in case of Shieber's bottom-up generator, bottom-up evaluation of magic-compiled grammars produced with this Magic variant is only guaranteed to be complete in case the original grammar obeys the semantic monotonicity constraint.</Paragraph>
      <Paragraph position="6"> ~The numbering of the facts corresponds to the order in which they are derived. A number of lexical entries have been added to the example grammar. The facts corresponding to lexical entries are ignored. For expository reasons the phonology and semantics of lexical entries (except for vs) are abbreviated by the first letter. Furthermore the fact corresponding to the vp &amp;quot;buys Mary a book John&amp;quot; is not included.</Paragraph>
      <Paragraph position="8"/>
      <Paragraph position="10"> As a result of characterizing filtering by a definite clause representation Magic brings filtering inside of the logic underlying the grammar. This allows it to be optimized in a processor independent and logically clean fashion. I discuss two possible filter optimizations based on a program transformation technique called unfolding (Tamaki and Sato, 1984) also referred to as partial execution, e.g., in Pereira and Shieber (1987).</Paragraph>
    </Section>
    <Section position="3" start_page="250" end_page="251" type="sub_section">
      <SectionTitle>
3.1 Subsumption Checking
</SectionTitle>
      <Paragraph position="0"> Just like top-down evaluation of the original grammar bottom-up evaluation of its magic compiled version falls prey to non-termination in the face of head recursion. It is however possible to eliminate the subsumption check through fine-tuning the magic predicates derived for a particular grammar in an off-line fashion. In order to illustrate how the magic predicates can be adapted such that the subsumption check can be eliminated it is necessary to take a closer look at the relation between the magic predicates and the facts they derive. In figure 4 the relation between the magic predicates for the example grammar is represented by an unfolding tree (Pettorossi and Proietti, 1994). This, however, is not an ordinary unfolding tree as it is constructed on the basis of an abstract seed, i.e., a seed adorned with a specification of which arguments are to be considered bound. Note that an abstract seed can be derived from the user-specified abstract query. Only the magic part of the abstract unfolding tree is represented. null  relation between the magic predicates in the compiled grammar.</Paragraph>
      <Paragraph position="1"> The abstract unfolding tree in figure 4 clearly shows why there exists the need for subsumption checking: Rule 13 in figure 2 produces infinitely many magic_vp facts. This 'cyclic' magic rule is derived from the head-recursive vp rule in the example grammar. There is however no reason to keep this rule in the magic-compiled grammar. It influences neither the efficiency of processing with the grammar nor the completeness of the evaluation process.  Finding these types of cycles in the magic part of the compiled grammar is in general undecidable. It is possible though to 'trim' the magic predicates by applying an abstraction function. As a result of the explicit representation of filtering we do not need to postpone abstraction until run-time, but can trim the magic predicates off-line. One can consider this as bringing abstraction into the logic as the definite clause representation of filtering is weakened such that only a mild form of connectedness results which does not affect completeness (Shieber, 1985). Consider the following magic rule: magic_vp(VForm, \[CgemlArgs\] , SSem) :magic_vp (VForm, Args, SSem) .</Paragraph>
      <Paragraph position="2"> This is the rule that is derived from the head-recursive vp rule when the partially specified sub-categorization list is considered as filtering information (cf., fn. 1). The rule builds up infinitely large subcategorization lists of which eventually only one is to be matched against the subcategorization list of, e.g., the lexical entry for &amp;quot;buys&amp;quot;. Though this rule is not cyclic, it becomes cyclic upon off-line abstraction: null magic_vp (VForm, \[CSem I_3 , SSem) : magic_vp (VForm, \[CSem2l_\] , SSem) .</Paragraph>
      <Paragraph position="3"> Through trimming this magic rule, e.g., given a bounded term depth (Sato and Tamaki, 1984) or a restrictor (Shieber, 1985), constructing an abstract unfolding tree reveals the fact that a cycle results from the magic rule. This information can then be used to discard the culprit.</Paragraph>
      <Paragraph position="4">  Removing the direct or indirect cycles from the magic part of the compiled grammar does eliminate the necessity of subsumption checking in many cases. However, consider the magic rules 14 and 15 in figure 2. Rule 15 is more general than rule 14. Without subsumption checking this leads to spurious ambiguity: Both rules produce a magic fact with which a subject np can be built. A possible solution to this problem is to couple magic rules with the modified version of the original grammar rule that instigated it. To accomplish this I propose a technique that can be considered the off-line variant of an index- null ing technique described in Gerdemann (1991). 3 The indexing technique is illustrated on the basis of the running example: Rule 14 in figure 1 is coupled to the modified version of the original s rule that instigated it, i.e., rule 2. Both rules receive an index: s (PO, P, VForm, SSem) : magic _s (P0, P, VForm, SSem), vp(P1 ,P,VForm, \[CSem\], SSem), np (P0,P1 ,CSem, index_l).</Paragraph>
      <Paragraph position="5"> magic_rip (CSem, index_l) : magi c_s (P0, P, VForm, SSem), vp (P1, P, VForm, \[CSem\], SSem).</Paragraph>
      <Paragraph position="6"> The modified versions of the rules defining nps are adapted such that they percolate up the index of the guarding magic fact that licensed its application. This is illustrated on the basis of the adapted version of rule 14: np (P0, P, NPSem, INDEX) : magic_rip (NPSem, INDEX), pn (P0, P, NPSem).</Paragraph>
      <Paragraph position="7"> As is illustrated in section 3.3 this allows the avoidance of spurious ambiguities in the absence of subsumption check in case of the example grammar.</Paragraph>
    </Section>
    <Section position="4" start_page="251" end_page="251" type="sub_section">
      <SectionTitle>
3.2 Redundant Filtering Steps
</SectionTitle>
      <Paragraph position="0"> Unfolding can also be used to collapse filtering steps.</Paragraph>
      <Paragraph position="1"> As becomes apparent upon closer investigation of the abstract unfolding tree in figure 4 the magic predicates magic_sentence, magic_s and magic_vp provide virtually identical variable bindings to guard bottom-up application of the modified versions of the original grammar rules. Unfolding can be used to reduce the number of magic facts that are produced during processing. E.g., in figure 2 the magic_s rule: magic_s (finite, SSem) : magic_sentence (decl (SSem)) .</Paragraph>
      <Paragraph position="2"> can be eliminated by unfolding the magic_s literal in the modified s rule:  s(PO,P,VFOP~,SSem):magic_s(VFORM,SSem), null vp(P1,P,VF01~,,\[CSem\],SSem), np(P0,P1,CSem).</Paragraph>
      <Paragraph position="3"> This results in the following new rule which uses the seed for filtering directly without the need for an intermediate filtering step:  Note that the unfolding of the magic_s literal leads to the instantiation of the argument VFORM to finite. As a result of the fact that there are no other magic_s literals in the remainder of the magic-compiled grammar the magic_s rule can be discarded.</Paragraph>
      <Paragraph position="4"> This filter optimization is reminiscent of computing the deterministic closure over the magic part of a compiled grammar (DSrre, 1993) at compile time. Performing this optimization throughout the magic part of the grammar in figure 2 not only leads to a more succinct grammar, but brings about a different processing behavior. Generation with the resulting grammar can be compared best with head corner generation (Shieber et al., 1990) (see next section).</Paragraph>
    </Section>
    <Section position="5" start_page="251" end_page="251" type="sub_section">
      <SectionTitle>
3.3 Example
</SectionTitle>
      <Paragraph position="0"> After cycle removal, incorporating relevant indexing and the collapsing of redundant magic predicates the magic-compiled grammar from figure 2 looks as displayed in figure 5. Figure 6 shows the chart resulting from generation of the sentence &amp;quot;John buys Mary a book&amp;quot; .4 The seed is identical to the one used for the example in the previous section. The facts in the chart resulted from not-so-naive bottom-up evaluation: semi-naive evaluation without subsumption checking (Ramakrishnan et al., 1992). The resulting processing behavior is similar to the behavior that would result from head corner generation except that the different filtering steps are performed in a bottom-up fashion. The head corner approach jumps top-down from pivot to pivot in order to satisfy its assumptions concerning the flow of semantic information, i.e., semantic chaining, and subsequently generates starting from the semantic head in a bottom-up fashion. In the example, the seed is used without any delay to apply the base case of the vp-procedure, thereby jumping over all intermediate chain and non-chain rules. In this respect the initial reordering of rule 2 which led to rule 2 in the final grammar in figure 5 is crucial (see section 4).</Paragraph>
    </Section>
  </Section>
  <Section position="4" start_page="251" end_page="253" type="metho">
    <SectionTitle>
4 Dependency Constraint on
</SectionTitle>
    <Paragraph position="0"/>
    <Section position="1" start_page="251" end_page="253" type="sub_section">
      <SectionTitle>
Grammar
</SectionTitle>
      <Paragraph position="0"> To which extent it is useful to collapse magic predicates using unfolding depends on whether the grammar has been optimized through reordering the 4In addition to the conventions already described regarding figure 3, indices are abbreviated.</Paragraph>
      <Paragraph position="1">  book&amp;quot; with the magic-compiled grammar from figure 5. right-hand sides of the rules in the grammar as discussed in section 3.3. If the s rule in the running example is not optimized, the resulting processing behavior would not have fallen out so nicely: In this case it leads either to an intermediate filtering step for the non-chaining sentence rule or to the addition of the literal corresponding to the subject np to all chain and non-chain rules along the path to the semantic head.</Paragraph>
      <Paragraph position="2"> Even when cycles are removed from the magic part of a compiled grammar and indexing is used to avoid spurious ambiguities as discussed in the previous section, subsumption checking can not always be eliminated. The grammar must be finitely ambiguous, i.e., fulfill the off-line parsability constraint (Shieber, 1989). Furthermore, the grammar is required to obey what I refer to as the dependency constraint: When a particular right-hand side literal can not be evaluated deterministically, the results of its evaluation must uniquely determine the remainder of the right-hand side of the rule in which it appears. Figure 7 gives a schematic example of a grammar that does not obey the dependency constraint. Given  a derived fact or seed magic_cat_l(property_l) bottom-up evaluation of the abstract grammar in figure 7 leads to spurious ambiguity. There are two possible solutions for cat_2 as a result of the fact that the filtering resulting from the magic literal in rule 1 is too unspecific. This is not problematic as long as this nondeterminism will eventually disappear, e.g., by combining these solutions with the solutions to cat_3. The problem arises as a result of the fact that these solutions lead to identical filters for the evaluation of the cat_~ literal, i.e., the solutions to cat_2 do not uniquely determine cat_3.</Paragraph>
      <Paragraph position="3"> Also with respect to the dependency constraint an optimization of the rules in the grammar is important. Through reordering the right-hand sides of the rules in the grammar the amount of nondeterminism can be drastically reduced as shown in Minnen et al.</Paragraph>
      <Paragraph position="4"> (1996). This way of following the intended semantic dependencies the dependency constraint is satisfied automatically for a large class of grammars.</Paragraph>
    </Section>
  </Section>
  <Section position="5" start_page="253" end_page="253" type="metho">
    <SectionTitle>
5 Concluding Remarks
</SectionTitle>
    <Paragraph position="0"> Magic evaluation constitutes an interesting combination of the advantages of top-down and bottom-up evaluation. It allows bottom-up filtering that achieves a goai-directedness which corresponds to dynamic top-down evaluation with abstraction and subsumption checking. For a large class of grammars in effect identical operations can be performed off-line thereby allowing for more efficient processing.</Paragraph>
    <Paragraph position="1"> Furthermore, it enables a reduction of the number of edges that need to be stored through unfolding magic predicates.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML