File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/88/c88-2120_metho.xml

Size: 40,041 bytes

Last Modified: 2025-10-06 14:12:05

<?xml version="1.0" standalone="yes"?>
<Paper uid="C88-2120">
  <Title>An Augmented Context Free Grammar for Discourse</Title>
  <Section position="2" start_page="0" end_page="0" type="metho">
    <SectionTitle>
Remko SCHA and Livia POLANYI
BBN Laboratories
</SectionTitle>
    <Paragraph position="0"> latter case, a correct constituent analysis of the discourse is necessary to establish the arguments of the rhetorical relations. On the other hand, the rhetorical relations themselves constitute an important structure-building component of discourse.</Paragraph>
  </Section>
  <Section position="3" start_page="0" end_page="575" type="metho">
    <SectionTitle>
1. Introduction
</SectionTitle>
    <Paragraph position="0"> Though a wealth of insights on the structure and meaning of discourse ha,,; been gathered by researchers in linguistics, psychology, ethnomethodelogy and artificial intelligence, these insights have not been integrated into formal grammars which display the breadth, depth and precision ef formal treatments of sentential syntax and semantics. In the present paper we make a step towards a formal, integrated description of the surface structure and semantic interpretation of discourse. We introduce a formalism which uses augmented context free rules for specifying discourse grammars, and demonstrate its viability by developing a set of syntactic/semantic rules which covers a number of important discourse phenomena. We discuss the issue of parsing discourse in a semi-deterministic left-to-right fashion, and relate the grammar presented here to the strategies outlined in \[24\] \[25\] for building up a structural description of an unfolding discourse. Finally, we compare the formalism used here to some possible alternatives.</Paragraph>
    <Paragraph position="1"> 2. Discourse Structure and Discourse Semantics The semantic interpretation of the utterances in a discourse has been shown to depend on the structural relations obtaining among the segments of that discourse. \[8\] \[17\] In developing a grammar for the surface structure of discourse, it is our aim to account for the semantically relevant aspects of its structure. Two phenomena of fundamental importance for semantic interpretation depend crucially on discourse structure: context dependence and rhetorical structure.</Paragraph>
    <Paragraph position="2"> Context Dependence of Utterance Meanings Context dependence of utterance meanings is a pervasive phenomenon in language. When an utterance is analysed in isolation, its meaning is underdetermined in many ways as an effect of the following phenomena: * Indexicallty. One rarely makes a statement or asks a question about the universe at large: every utterance presupposes an implicit temporal, spatial and topical framework that constrains the scopes of the meanings of the constituents of the utterance. Every sentence must therefore be evaluated with respect to a frame of reference which i,,; usually left implicit in the sentence itself.</Paragraph>
    <Paragraph position="3"> * AnaphC/}ra. Many utterances contain pronouns and definite descriptions which constitute overt references to the previous discourse. An analysis of the structure of the previou:~ discourse is necessary to resolve such anaphoric references correctly.</Paragraph>
    <Paragraph position="4"> ,,Implicit arguments. Many natural language words are semantically unsaturated, needing externally provided arguments in order to be interpreted felicitously.</Paragraph>
    <Paragraph position="5"> Nevertheless, such constructions can be used in sentences which do not mention their arguments explicitly, if these arguments can be inherited from previous parts of the discourse. (&amp;quot;John is ta//er \[than Peter\].&amp;quot; &amp;quot;What is the speed \[of John's car\]?&amp;quot;) (Cf. \[41) The Rhetorical Structure of Discourse The insiflht of work focused on the rhetorical structure of discourse is that a speaker engaged in. a discourse may perform speech acts whose illocutionary force has scope over complex propositions which are built up out of individual sentence meanings and the rhetorical relations between them. (See, for example, \[13\], \[19\], and \[20\].) These rhetorical relations may be overtly expressed, or they may have to be abducted on the basis of the sentence meanings. Paradigmatic examples of such complex propositions are &amp;quot;,4 caused B&amp;quot;, &amp;quot;A was caused by B&amp;quot;, &amp;quot;A provides evklence for B&amp;quot;, etc. -- where A and B may stand for propositions expressed by individual sentences, or may themselves be complex prepositions expressed by discourse segments. Because of the In the approach to discourse semantics outlined in this paper, every sentence is initially interpreted in a local, context-independent fashion. This results in a meaning representation which will usually contain free variables, standing for the discourse-dependent elements in the utterance meaning. When a sentence is integrated into the ongoing discourse, these variables are bound to values picked up from the context into which the sentence is inserted. The next section describes this in more detail.</Paragraph>
    <Paragraph position="6"> 3. An Augmented Context Free Grammar for Discourse.</Paragraph>
    <Paragraph position="7"> Discourses have a hierarchical structure. They are built up recursively out of units of various kinds which can occur as constituents of each other. To account for this, a discourse grammar must be able to assign a tree structure to a discourse. We call this tree structure the discourse parse tree. To describe in a formal way how a discourse parse tree is constructed out of constituent sentences, we use a context free grammar whose non-terminal symbols are augmented with attribute/value pairs. (Distinct non-terminal categories have distinct sets of attributes.) Context free rules describe how the constituent segments of a discourse (which we call discourse constituent units or dcu's) are built up out of their subconstituents. The values of the attributes on a non-terminal represent the relevant structural and semantic properties of the dcu generated by that non-terminal. Every attribute has a fixed set of possible value-expressions. The value-expressions may be of different kinds: they may be atomic, they may themselves be sets of attribute/value pairs, or they may be logical expressions.</Paragraph>
    <Paragraph position="8"> Value-expressions often store parfia/ information; therefore, they may contain free variables. A value-expression stands for the set of its ground instances. (A value-expression without variables thus stands for a singleton set.) When an attribute has a value-expression which stands for the empty set, the complex category symbol that contains this value-expression fails to label a possible dcu.</Paragraph>
    <Paragraph position="9"> The context-free rules enforce agreement and upwards-inheritance of the relevant properties of different constituents through tile fact that different occurrences of the same variable take on identical values. To put this in more precise terms, we define the meaning of an augmented context free rule as follows. If A, B, C, Y, Z stand for complex category symbols (including attribute/value-expression pairs) and Idcul\]Y= and \[dcu2\]z are legitimate dcu's, the rule &amp;quot;A =&gt; B C&amp;quot; legitimizes the dcu ~(\[\[dcul\]y \[dcu2\]z\]A ) iff the substitution ~ is the most genera/unifier I of the terms &lt;B,C&gt; and &lt;Y,Z&gt;, and G(A) is a legitimate complex category symbol, not containing empty attribute-value expressions.</Paragraph>
    <Paragraph position="10"> 3.1. Attribute Values and Semantic Interpretation.</Paragraph>
    <Paragraph position="11"> The propagation of attribute values between dcu's plays an important part in establishing the semantic interpretation of the utterances in a discourse. We now list some of these attributes, and discuss their role in semantic interpretation and discourse processing. The Semantics attribute records a logical formula representing the meaning of the discourse constituent unit that it is associated with.</Paragraph>
    <Paragraph position="12"> Anaphorlc elements which have not been resolved inside the dcu are represented by free variables. There are two mechanisms for anaphor resolution: (1) unification with value-expressions of attributes of other dcu's, and (2) explicit search processes involving the Discourse Referents Set of accessible dcu's.</Paragraph>
    <Paragraph position="13"> The Discourse Referents Set records the entities introduced in the discourse unit that it is associated with. These, plus the entities in the Discourse Referents,Sets of the embedding units (dominating nodes in the tree) are the entities which are available for anaphodc reference in an utterance which extends or expands that discourse unit. Every Discourse Referent is a pair consisting of (1) the linguistic expression that introduced it, and (2) the semantic representation that the system attached to that expression. Discourse Referent Sets are accessed in different ways by a number of algorithms, which resolve the meanings of ~Tho concepts we use here ere essentially the ones that were developed for term unilication in first-order logic theorem proving ~30\]. We assume some straighfloPaLU~ generalization of these concepts which deals with the fact that our &amp;quot;terms&amp;quot; have a structtji# which, though not interestingly different, is somewhat richer.</Paragraph>
    <Paragraph position="14">  different types of context-dependent expressions such as definite descriptions, anaphoric pronouns~ demonstrative pronouns, the words &amp;quot;one&amp;quot; and &amp;quot;ones&amp;quot;, and implicit arguments of function nouns, comparatives, etc.</Paragraph>
    <Paragraph position="15"> The Reference-Time records the time-interval which is to serve as the temporal index for extensions and expansions of the discourse unit under consideration..The Reference-Time can be' reset by the occurrence of explicit tempera( adverbs. Narratives constitute a specific type of discourse units in which time plays a special role; they are marked by the occurrence of sentences with non-durative aspect (event-sentences). In a narrative, the reference-time is reset whenever an event occurs. \[15l \[12\] The Spatial Index and the Modal Index play a role in the semantic interpretation of the utterances which is similar to the role of the Reference-Time: they specify a value for components of the index of evaluation of the semantic formula. In English, they can only be updated by the occurrence of spatial and modal adverbials or other similarly explicit information.</Paragraph>
    <Paragraph position="16"> In the next section we describe the rules of a discourse grammar.</Paragraph>
    <Paragraph position="17"> They use some other attributes besides the ones just mentioned. These can be most easily explained as we introduce the rules that use them.</Paragraph>
    <Section position="1" start_page="573" end_page="575" type="sub_section">
      <SectionTitle>
3.2. Grammar Rules
</SectionTitle>
      <Paragraph position="0"> The grammar we present here, though necessarily limited and schematic in many ways, covers a wide range of phenomena that are usually discussed separately. An adequate account of discourse structure, however, depends on their integration into a consistent mechanism. The semantic phenomena that we pay attention to include anaphor resolution, scope of modal indices, movement of reference time, and rhetorical relations. The grammar consists of rules which descdbe how to build up various kinds of structurally different discourse constituent units. We distinguish the following kinds of dcu's: * Subordinations. These are binary structures in which the first element remains accessible; we view them as units in which all or most of the structurally relevant features are inherited from the left constituent. (In discourse, unlike in the sentence, the subordinating element is always to the left of the subordinated one.) In semantic subordinations there is a semantic relation between the two constituents; this is the case in rhetorical subordinations and in topic-dominant chains. Interruptions, on the other hand; though structurally analogous, are semantically very different: in this case, there is no semantic connection whatsoever between the two constituents.</Paragraph>
      <Paragraph position="1"> * Binary Coordinations. These are binary structures in which the second element has equal status to the first, thus making the first one inaccessible. Under this category we include rhetorical coordinations (the counterparts of the rhetorical subordinations), and adjacency pairs which are concerned with the interactiona/dimension of the discourse (we include question~answer pairs and request~response pairs).</Paragraph>
      <Paragraph position="2"> * N-ary Coordinations. These are flat structures which can contain arbitrarily many elements, of which, at any time; only the .most recent one is accessible. We include fists, monotonic lists, and narratives. To generate n-ary coordinations by means of a context free grammar, we must assign them a recursive structure: we build them up by means of binary rules which extend them to the right.</Paragraph>
      <Paragraph position="3"> This classification is not necessarily complete, and should be the topic of more extensive discussion. But it does cover the most important structures, and brings some order into the grammar rules that we present below.</Paragraph>
      <Paragraph position="4"> Notation: Category symbols have the form &amp;quot;cat \[~t:o~t ...... ~n:(~n\]&amp;quot;, where &amp;quot;cat&amp;quot; is the basic non-terminal symbol of the context-tree grammar, ~t ..... ~n are the attributes, and ~l ..... ~n are expressions which stand for the value-sets of these attributes. Variables are indicated by italicized strings. If in a phrase structure rule an attribute has as its value-expression a variable that does not occur anywhere else in the rule, we will, for the sake of readability, leave the attribute out of our specification of the rule.</Paragraph>
      <Paragraph position="5"> The grammar does not have a particular start symbol; dcu's of all kinds are recognized as well-formed &amp;quot;discourses&amp;quot;.Z Sentences are assumed as elementary dcu's. The following rules specify the internal structure of some important kinds of complex discourse constituent units. We discuss further details about our approach and about the grammar formalism as we introduce these rules.</Paragraph>
      <Paragraph position="6"> Lists: list \[drs: dIud2, schema:s, sere: s(x) &amp; s(y)\] =&gt; dcu1\[drs:dl, sere: s(x)\] dcu2 \[drs:d2, sere: s(y)\] The intuitive idea behind this rule is that if two adjacent dcu's can be analyzed as having semantically parallel structures, they can be conjoined in a fist structure. Note that there is no constraint on the category of the dcu's: the main category symbols in the right hand side or the rule are variables.</Paragraph>
      <Paragraph position="7"> The formal criterion for the application of the rule is the existence el a Z-formula s such that the semantic content of each of the two dcu's has the form s(u). s We require that the semantic representation of the dcu already has the structure s(u), or can be put in that form by means Of a very limited repertoire of logical equivalence transformations. (This repertoire needs to be specified In detail. It will probably at least include Z-abstraction.) This restriction excludes undesirable trivial values of s such as (;L u: If u = SEM1 then SEM1 else SEM2). The types of the ;L-variables of s are required to be elements in a predefined hierarchy of &amp;quot;natural kinds&amp;quot; which classifies the'meanings of lexical entries. The value of s, representing the common denominator between the meanings of the two constituents, is stored as the value of the &amp;quot;schema&amp;quot; attribute on the list-dcu.</Paragraph>
      <Paragraph position="8"> Note that this rule often creates ambiguity. To take a simple example: if &amp;quot;John likes Mary&amp;quot; is followed by &amp;quot;Peter likes Mary&amp;quot;, one can abstract out several different schema's, even without considering different choices for the types of ~.-variables. ~. P: ,deg(MARY), or ;~ R,x: R(x, MARY), or ;~ x,y: LIKE(x,y ), or Z x: LIKE(x, MARY). The last of these is the preferred one, because it is most specific. Rather than constraining the grammar rule to apply only in the most specific way to every case, we assume that a preference for the most specific rule-application is applied as a heuristic principle during the parsing process. The &amp;quot;sam&amp;quot; attribute stores the semantics of a dcu, The operation of the &amp;quot;&amp;&amp;quot;-operator which is used here to build up the semantics is more general than logical conjunction: its arguments can have different speakers and have different ilfocutionary force operators. The semantic representation built up by &amp;quot;&amp;&amp;quot; stores the propositional content of individual utterances in a richly indexed data structure. We shall not attempt to specify this data structure in detail in the. present paper. We intend that for the case of purely assertional monologue, the operation of &amp;quot;&amp;&amp;quot; reduces to logical conjunction. (The movement of narrative time, for instance, will therefore not be hidden inside the meaning of &amp;quot;&amp;&amp;quot;.) Note that the notion of &amp;quot;list&amp;quot; that we use here is more general than the one we have used in previous discussions \[23\]. For instance, this notion of &amp;quot;list&amp;quot; subsumes the notion of a &amp;quot;topic chain&amp;quot;. In the formula stored as the value of the &amp;quot;sem&amp;quot;-attribute of a dcu, typed free variables are used to represent unresolved anaphors, such as pronouns and destressed definite descriptions. The unification process which matches the &amp;quot;sem&amp;quot;-attrtbutes of two dcu's when they are joined to constitute a list may substitute expressions for these vadables and thus resolve the anaphoric reference. Thus, some strong reference resolution preferences are explained as following directly from the acknowledgement of parallel structure; in this case, anaphoric elements get resolved without any search through the space of available discourse referents. This accounts for anaphoric reference to topic or attentional focus \[32\], as well as anaphoric reference to the corresponding element in a semantically parallel structure.</Paragraph>
      <Paragraph position="9"> To accumulate the appropriate candidates for further anaphor resolution processes, the discourse referents (values of the &amp;quot;drs&amp;quot; attribute) of both constituent dcu's are synthesized into the discourse referent set of the list-dcu.</Paragraph>
      <Paragraph position="10"> To extend a list-dcu to become a list with one more element, there is the following additional rule: list \[schema:s, drs: drsl u drs2, sem: p &amp; s(y)\] =&gt; list \[schema:s, drs:drsl, sere:p\] dcu \[drs:drs2, sere: s(y) \] A list can be extended by another dcu if this dcu instantlates the structure described by the schema-attribute of the list.</Paragraph>
      <Paragraph position="11"> Monotonic Lists: m-list \[schema;s, drs: drs l u drs2, direction: (If x&lt;y then incr else It x&gt;y then decr else fail), last:y, sem: s(x) &amp; s(y)\] =&gt; dcu \[drs: drst, sere: s(x)\] dcu \[drs:drs2, sere: s(y)\]  values of x and y are n-tupIos in this case.</Paragraph>
      <Paragraph position="12"> If in a list the various arguments of the &amp;quot;sChema&amp;quot;-function are elements of a linearly ordered domain, the list may be a monotonic list. To make it possible to ascertain whether a next dcu can be added to a monotonic list, such lists carry the value of the most recent argument (&amp;quot;last&amp;quot;-attribute), and their &amp;quot;direction&amp;quot; (&amp;quot;increasing&amp;quot; or &amp;quot;decreasing&amp;quot;). (&amp;quot;fail&amp;quot; is a special constant which is not allowed as an attribute valueexpression.) null m-list \[schema:s, drs: drsl ~J drs2, direction: (It x&lt;y ^ p=incr then incr else If x&gt;y ^ p=decr then decr else fail), last:y, sere: q &amp; s(y) \] =&gt; m-list \[schema:s, drs:drsl, incr: p, last:x, sere:q\] dcu \[drs:drs2, sere: s(y) ^ y ~ d\] A monotonic list can be extended by another dcu which instantiates the structure described by its schema:attribute, provided that the increasing or decreasing ordering is maintained. (Monotonic lists are discussed in \[24\].) Narratives: &amp;quot;Narratives* are used to express that a series of states of affairs obtain at successive points along a timeline (reference times \[29\]). How a next main clause of a narrative will interact with the previously established reference time of the narrative, depends on its aspect, durattve clauses behave differently from non-durative ones \[35\].&amp;quot;We therefore need hvo different rules for extending a narrative. We first give the rule for duratives, narrative \[drs: drsl ~J drs2, reference-time:rt, tense: (if tt ~ xtthen tl else fail), sere: p &amp; \[s\],f\] =&gt; narrative \[drs:drsl, reference-time:rt, tense: tt, sere:p\] dcu \[drs: drs2, reference-time:rt, x-tenses: xt, aspect: durative, sere: s\] The reference-time attribute indicates the time-interval where the progression of narrative time has arrived, Le., an interval after the last event in the narrative so tar. A durative dcu which extends a narrative is evaluated at the reference-time of that narrative.</Paragraph>
      <Paragraph position="13"> The value of the &amp;quot;tense&amp;quot; attribute on a narrative dcu marks the temporally distinct modes of narrative that a particular language allows. For English this includes the distinction between past, present, pluperfect, and future narratives. On a sentence (or more complex dcu) which is to be integrated into a narrative, the relevant attribute is not &amp;quot;tense&amp;quot; (which stores the tense of the dcu), but &amp;quot;x-tenses&amp;quot;, which stores the tenses that may be externally imposed on the dcu. A sentence with PRESENT as its tense can be used in a context where the tense is set to either PRESENT, PAST, or FUTURE, Similarly, a PAST tense sentence is compatible with either a PAST or a PLUPERFECT framework. A sentence with the PLUPERFECT tense, however, is only compatible with a PLUPERFECT timeframe. When a dcu extends a/narrative, the &amp;quot;tense&amp;quot; of the narrative must be an element of the &amp;quot;x-t~'nse~&amp;quot; of/the dcu. narrative \[drs: drs l u drs2, reference-time:v, tense: (it tl ~ xtthee tl else fail), sem: p &amp; \[S\]u &amp; t&lt; i u &lt;i v\] =&gt; narrative \[drs:drsl, reference-time:t, tense: tl, sem:p\] dcu \[drs: drs2, reference-time:u, aspect:event, x-tenses: xt, sam: s\] An event-dcu which extends a narrative is evaluated at an interval u following the then current reference-time t of the narrative. The new reference-time of the extended narrative is established to be another interval v after u. (Notation: a &lt;i b means &amp;quot;a immediately precedes b&amp;quot;.) The effect of this is that a durative dcu without explicit time-adverbials is interpreted at an interval immediately alter the last event, which will be closed off by the next event. (Validity beyond this Interval, in both directions, can often be inferred from this by invoking plausible assumptions about the universe of discourse.) Another effect is, that successive events are always separated by a time-gap (though nothing is said about the size of this time-gap).</Paragraph>
      <Paragraph position="14"> The above rules only define how to extend a narrative that is already underway. The rules for beginning a narrative are similar have been omitted for reasons of space.</Paragraph>
      <Paragraph position="15">  accomplishments or achievements, while non-durative oees describe stares or activities. STo enhance th*~ icglbllity of the rules, we will from now on leave out the description of the upwards propagation of discourse referents. It occurs uniformly in the same way as in the rules glvenbet~)re.</Paragraph>
      <Paragraph position="16"> This rule parses semantic subordinations which involve an explicitly indicated subordinating rhetorical relation R (&amp;quot;for instance,&amp;quot; &amp;quot;because&amp;quot;). The meaning of this relation is assumed to be incorporated in the semantics of the subordinated dcu; this dcu therefore has as a value of its &amp;quot;sem&amp;quot;-attribute a ;~-function which expects a propositional argument, s The attributes and Values of a subordination are inherited from the subordinating constituent.</Paragraph>
      <Paragraph position="17"> The subordinated constituent is optionally followed by a pop-marker (e.g. &amp;quot;so&amp;quot;, &amp;quot;anyway&amp;quot;). All clue-words (push-markers, pop-markers, interruption-markers) are treated as independent units, separate from the sentences that they precede or follow.</Paragraph>
      <Paragraph position="18"> In the formulation of the semantic subordination rule we have assumed an attribute called &amp;quot;index&amp;quot;, containing reference time as well as spatial and modal index.The rule shows how the subordinated discourse constituent unit is semantically contextualized by the subordinating one. dcul \[~1: rxl ..... ~,: c~,,, index:i, sere: a &amp; R(a,\[b\]i)\]</Paragraph>
      <Paragraph position="20"> This rule parses semantic subordinations for which the rhetorical relation involved is not overtly marked, The variable R ranges over all subordinating rhetorical relations.Since its value is not stated explicitly, it must be abducted on the basis of plausibility considerations regarding the resulting semantics, The subordinated constituent is optionally preceded by a push-marker (e.g. &amp;quot;like&amp;quot;), and optionally followed by a pop-marker.</Paragraph>
      <Paragraph position="21"> For subordinations we need a more elaborate treatment of semantics than the one assumed in this paper. We need to distinguish between the total accumulated meaning of a discourse constituent unit and its &amp;quot;core meaning&amp;quot;, which is considered in computations regarding semantic relations with other dcu's. It is a characteristic property of subordination dcu's that they allow for interpretations in which the core meaning is identical to the core meaning of the subordinating constituent, without any contribution from the subordinated constituent. To represent this, we would need to assume at least two different &amp;quot;sam&amp;quot; attributes, or a more complicated structure for the value of the &amp;quot;sem&amp;quot; attribute. Rhetorical coordinations: dcu \[~l: mscg((Xr 131) ..... ~.: mscg(o~,,. 13.), sem: a &amp; b &amp; R(a,b)\] =&gt; dcul \[~1: ~l ..... ~,,: %,, sere: a\] dcu2\[~l: 131 ..... ~.: 13., sem: ~. x: R(x,b)\] A This rule parses semantic coordinations which involve an explicitly indicated binary coordinating rhetorical relation R (&amp;quot;therefore,&amp;quot; &amp;quot;thus,&amp;quot; &amp;quot;accordingly'), &lt;Ref. Mann, Talmy&gt; As in the subordination case described before, the meaning of the relation is incorporated in the semantics of the clause in which it occurs, which therefore denotes a predicate on propositions.</Paragraph>
      <Paragraph position="22"> The function &amp;quot;mscg&amp;quot; computes the &amp;quot;most specific common generalization&amp;quot; of its arguments in the hierarchy of value-expressions of the relevant attribute. (When there is no proper hierarchy defined on the value-expressions of an attribute, mscg degenerates into a function which yields the value of its arguments when the two arguments are equal and which yields a new free variable when they are not.) dcu \[~1: mscg(~xl, 131) ..... ~.: mscg(o~., 13,,), sam: a &amp; b &amp; R(a,b)\] =&gt; dcu\[~l: o~! ..... ~.: ~., sem: a\] dcu \[~j: ~1 ..... ~.: ~., sere: b\] This rule parses binary semantic coordinations which are not overtly marked as such, Therefore, the semantics of the second dcu is a proposition rather than a predicate on propositions, The variable R ranges over all binary coordinating rhetorical relations. As in the corresponding subordination case, the value of R must be computed by abduction, magic, or a similar A.I. technique.</Paragraph>
      <Paragraph position="24"> Topic-dominant ,chatnlngs are subordination structures. In these structures, the subordinated dcu gives information about a constituent of the predicate in the semantics of the subordinating dcu. The aboverule requires that there exists an element y such that the predicate of the left dcu and the semantics of the right dcu can both be formulated as expressions with the structure f(y) -- where, as before, only a limited eNota that most other rules implicitly assume that they operate on dcu's with a &amp;quot;sem ''~ value which is a proposition, In the current formulation, they therefore do not operate correctly on explicitly subordinated clauses. The appropriate refinements are not difficult to imagine, but go beyond the limited scope of the present paper.  repertoire of logical transformations can be used to achieve this formulation, starting from formulas which correspond directly to the Surface structure of the dcu's. These limitations are to be defined in such a way that the possible values of y correspond to the constituents eligible for dominance in \[6}, or the forward-/ooking centers of \[9\].</Paragraph>
      <Paragraph position="25"> The heuristics of the parsing process prefers applying rules for constructing list-structures to the rule for topic-dominant chaining* (Cf.</Paragraph>
      <Paragraph position="26">  This rule parses question/answer pairs. The semantics of a yes/no question is assumed to be a proposition; the semantics of a wh-question is assumed to be a set-denoting expression (cf. \[31\]). The semantics of an answer is a predication on the question-semantics.</Paragraph>
      <Paragraph position="28"> This rule parses request/response pairs. Semantically, these are very similar to question/answer pairs. We have chosen to exclude &amp;quot;rhetorical requests&amp;quot; by requiring the speaker/addressee relation to flip between request and response.</Paragraph>
      <Paragraph position="29"> Interruptions: tic. 1\[~,\]</Paragraph>
      <Paragraph position="31"> This rule allows for semantically unrelated interruptions of an ongoing discourse (cf. \[26\] \[10\] )interruptions may be introduced by specific markers such as &amp;quot;Oh!&amp;quot;.</Paragraph>
    </Section>
  </Section>
  <Section position="4" start_page="575" end_page="575" type="metho">
    <SectionTitle>
4. Discourse Parsing
</SectionTitle>
    <Paragraph position="0"> We consider the development of a formal grammar of discourse structure, such as the one sketched in the previous section, to be the first step towards a formal account of the process of discourse paining. We now briefly review some of the issues that would be involved in such an account.</Paragraph>
    <Paragraph position="1"> The most important issue in discourse parsing is the necessity of semi-determinism. Spontaneous dialogue involves unpremeditated turn-taking and interruption. In order for this to be possible, there must regularly be points in an interaction at which the interpretation of utterances so far is mutually established as independent of the discourse which is to follow. Moreover, an unanticipated next utterance can operate in a mutually understood way on the structure of the discourse so far (for instance, by abandoning a digression to pop to a previously interrupted dcu). Therefore, at points where such a move is allowed, there must also be mutual agreement on the structure of the discourse so far.</Paragraph>
    <Paragraph position="2"> The granularity of this &amp;quot;unpredictability without misunderstanding&amp;quot; seems to be the clause or sentence level* We therefore postulate an incremental left-to-right parsing process at this level of granularity, which operates in essentially deterministic mode. In \[24\] we gave an informal description of such a parsing process, which processes every incoming sentence incrementally, extending an existing discourse tree to the right by node insertion. An important assumption of the parsing process as described there, is that at any point it only uses information on the right edge of the existing discourse tree. This means that interlocutors just need to be. aware of the stack of information which corresponds to the labels on the right edge of the tree, rather than the complete details of the discourse that went befOre.</Paragraph>
    <Paragraph position="3"> Inspection of the grammar rules in the previous section suggests that this grammar Is compatible with the parsing strategy outlined in \[24\]: relevant information is always propagated up to the right edge of the tree, and dcu interpretations get propagated up without being influenced by the nature of the intervening nodes.</Paragraph>
  </Section>
  <Section position="5" start_page="575" end_page="575" type="metho">
    <SectionTitle>
5. Formalisms for Discourse Grammar.
</SectionTitle>
    <Paragraph position="0"> The augmented context free grammar developed above should be taken as a demonstration of the possibility and utility of formal grammars for decribing structural and semantic phenomena in natural language discourse. We expect,that work of this kind, especially if carried out on a larger scale, will constitute a more fruitful path to new insights than approaches which are .oriented towards essayistic description or unprincipled implementation.</Paragraph>
    <Paragraph position="1">  We should not make exaggerated claims concerning the formalism we have used here. Much more work is needed before it will be clear what kind of formal framework has the best fit with the phenomena. But it is probably useful to articulate our thoughts on how the augmented context free grammar formalism compares to other formalisms that could have been used in this work.</Paragraph>
    <Paragraph position="2"> The formalism we have used is a context free grammar augmented with attributes, which propagates feature values through term unification on value-expressions containing variables. Similar formalisms have been used for sentence-level syntax. The closest is the ACFG formalism used for the grammar of BBN's Spoken Language System\[11\] \[2\], which mainly differs in assuming a more limited syntax of attribute-value expressions. Definite Clause Grammars \[21\], if decoupled from their commitment to Prolog programming, are also very similar.</Paragraph>
    <Paragraph position="3"> Generalized Phrase Structure Grammar\[7\], as well as related theories such as HPSG and LFG, share many aspects of our approach: ar~ emphasis on context free surface structure, and the use of a unification process to enforce the desired agreement and inheritance behaviour on the values of attributes. However, these frameworks use unification on graphs rather than logical term-expressions. This probably creates additional expressive power, but it goes at the cost of ease of implementation and of conceptual clarity.</Paragraph>
    <Paragraph position="4"> It is a major advantage of logical term unification that there is an obvious and simple semantics for it: any object which may contain variables, be it an attribute value, a syntactic tree, or a context free rule, can be viewed as an abbreviation for the set of its ground instances; applying the most general unifier to a set of terms yields a term standing for the intersection of the sets of their ground instances. Compared to the conceptual and computational simplicity of logical term unification, the graph unification formalisms used in modern linguistic frameworks are rather cumbersome. There do not seem to be good linguistic reasoris for preferring graph unification.</Paragraph>
    <Paragraph position="5"> An interesting perspective on the kind of grammar we have used follows from the realization that it can be viewed as a particular instance of an attribute grammar as defined by Knuth\[16\]: one in which the values of the attributes on a node:are always synthesized from the values on the nodes of its immediate constituents. This raises the questi0n whether one might want to formulate this directional dependency explicitly in an attribute grammar notation. More interestingly, it raises the question whether there are phenomena that could be more elegantly described as inheritance from a top node to its constituents, rather than the other way around. We expect such phenomena to occur if we would want to Integrate more global constraintsl related to people's tasks, goals and plans, into our description of discourse.</Paragraph>
    <Paragraph position="6"> Finally, we want to reflect on the Augmented Transition Network formulation of discourse structure that we used in previous, more informal papers \[23}. ATNs do have some properties which are attractive for discourse. A looping arc which updates a register constitutes a powerful device that doesn't have a direct equivalent in other formalisms. In the current grammar We emulate the effect of such an arc by a recursive binary structure: lists and narratives are built up by repeated extension to the right. Intuitively, one sees lists and narratives as flat structures, and we have described them in those terms in previous papers \[24\]. The power of ATNs thus makes it possible to account more directly for the structures that seem plausible.</Paragraph>
    <Paragraph position="7"> The framework used in the present paper is conceptually simple, and more limited than any of its alternatives. Nevertheless, it seems powerful enough to describe the phenomena we encountered in developing the grammar presented here. Subsequent research will have to answer the question whether It is ultimately powerful enough to describe the full range of discourse phenomena in a felicitous way.</Paragraph>
    <Paragraph position="8"> In the present discussion we have ignored what in previous work we have called the level of speech event structure, which is concerned with the structure of discourse as a social activity. \[22\] \[23\] By the same token, we have left unaddressed the fact that discourse structures may reflect the tasks, goals and plans of the discourse participants, as they would be construed in A.I. based approaches to discourse analysis.</Paragraph>
    <Paragraph position="9"> Observations reported in \[36\] show (1) that speech event structure and linguistic discourse structure may be at odds with each other, and (2) that the linguistic discourse structure has the most direct semantic relevance in such cases. Though speech event structure and task structure have in fact considerable semantic relevance and must ultimately be factored In, we want to hold off on dealing with the complexities involved in this issue.</Paragraph>
    <Paragraph position="10"> The grammar presented in this paper provides a formal characterization of Discourse Parse Trees by means of the bottom-up rules used in its construction. The problem of mating these bottom-up rules with necessary high-level, top-down rules involving phenomena occurring in the task domain or interaction is essentially the problem of plan-recognition. In an operational system, the functionality of a linguistically based discourse parser would thus be very much enhanced by an efficient Plan Recognizer of the type envisioned in &amp;quot;plan based&amp;quot; and &amp;quot;intention based&amp;quot; pragmatic discourse models \[1\], \[14\], \[18\], \[33\]. On the other hand, incorporating a discourse grammar would improve the functionality of existing plan based models, which lack explicit mechanisms for relating sentential syntax and semantics to pragmatic plan structures.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML