File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/86/c86-1127_metho.xml

Size: 37,613 bytes

Last Modified: 2025-10-06 14:11:53

<?xml version="1.0" standalone="yes"?>
<Paper uid="C86-1127">
  <Title>BUILDRS: AN IMPLEMENTATION OF DR THEORY AND LFG</Title>
  <Section position="1" start_page="0" end_page="543" type="metho">
    <SectionTitle>
BUILDRS: AN IMPLEMENTATION OF
DR THEORY AND LFG
</SectionTitle>
    <Paragraph position="0"> This paper examines a particular PROLOG implementation of Discourse Representation theory (DR theory) constructed at the University of Texas. The implementation also contains a Lexical Functional Grammar parser that provides f-structures; these f-structures are then translated into the semantic representations posited by DR theory, structures which are known as Discourse Representation Structures (DRSs). Our program handles some linguistically interesting phenomena in English such as (i) scope ambiguities of singular quantifiers, (if) functional control phenomena, and (iii) long distance dependencies. Finally, we have implemented an algorithm tor anaphora resolution. Our goal is to use purely linguistically available information in constructing a semantic representation of discourse as far as is feasible and to forego appeals to world knowledge.</Paragraph>
    <Paragraph position="1"> ._\[Dtroduction * DR Theory DR theory, versions of which have been developed independently by Hans Kamp (1981) and Irene Helm (1982), has several motivations, but certainly one ot the principal ones was to examine the anaphodc properties of detinite and indefinite noun phrases. Out of this study emerged a novel theory o1 indefinites and definites that has provided, among other things, a semantic theory of intersentential anaphora.</Paragraph>
    <Paragraph position="2"> One of our goals in implementing DR theory for a fragment ol English was to investigate how a purely linguistic theory ot anaphora might help in the real time processing ol anaphors by automated natural language understanding systems.</Paragraph>
    <Paragraph position="3"> DR theory has two basic components: the DRS construction algorithm and the correctness definition. The DRS construction algorithm is a mapping from natural language discourses to DRSs, the semantic representations that the theory posits. The correctness definition maps DRSs into models that (i) provide the truth conditions of one sentence in isolation and (if) show how the content of one sentence contributes to the content of the antecedent discourse it follows. We have implemented only a fragment of the presently developed DRS construction algorithm.</Paragraph>
    <Paragraph position="4"> We want to argue that for philosophical reasons it makes little sense to try to implement on a computer not only the DRS construction algorithm but also the correctness definition. To understand why we hold this view, however, we have to provide at least a very sketchy overview of these two components ot DR theory.</Paragraph>
    <Paragraph position="5"> The language in which DRSs are described is to begin with quite simple. Its vocabulary consists of a set of individual reference markers (x, y, z, x 1, etc.), a set of mental state markers (p, Pl ,P2, etc.), and a set of n-ary predicates, for which English nouns, verbs and intersective adjectives will serve fine. We also have certain logical symbols in the DRS language: --~, v, ==~.</Paragraph>
    <Paragraph position="6"> Next, we define conditions and DRSs by a simultaneous recursion. (I will use boldfaced letters of the appropriate type as metalinguistic variables for relerence markers.)  Definition 1: 1. Suppose that q) is an n-ary predicate and x 1 ..... x n are reference markers. Then fi)(Xl, .... Xn) is an atomic condition.</Paragraph>
    <Paragraph position="7"> 2. Suppose x 1 and x 2 are reference markers. Then x I = x 2 is an atomic condition.</Paragraph>
    <Paragraph position="8"> 3. A DRS K is a pair of sets &lt;U, Con&gt;, where U is a set of reference markers and Con a set of conditions.</Paragraph>
    <Paragraph position="9"> 4. Let K 1 and K 2 be DRSs and let p be a mental state  reference marker. Then .~K1, K 1 vK2, K 1 :=~ K2, and p:K 1 are conditions.</Paragraph>
    <Paragraph position="10"> This framework yields a treatment of indefinite and definite noun phrases that has made an important contribution to understanding the anaphodc and &amp;quot;pseudo referential&amp;quot; behavior of indefinites. Let us first briefly sketch the theory's treatment of indefinites. When processed by the DRS construction algorithm, a singular indefinite introduces into a DRS a reference marker that functions essentially as a free variable. This reference marker can be identified with other reference markers that are introduced by anaphoric pronouns at potentially unlimited distances from the original indefinite NP. Indefiniles get their existential force by the way a DRS is assigned truth conditions.</Paragraph>
    <Paragraph position="11"> To take a simple example thai uses only indefinites and anaphoric pronouns, the DRS construction algorithm yields the DRS in (2) for  (1) A man loves a woman. She is beautiful.</Paragraph>
    <Paragraph position="12"> (2) &lt;1&gt; x,&lt;2&gt; y, &lt;5&gt; z &lt;1&gt; \[loves a woman(x)\] &lt;2&gt; woman(y) &lt;3&gt; loves(x, y) &lt;4&gt; \[She is beautiful\] &lt;5&gt; z = y &lt;5&gt; \[is beautiful(z)\] &lt;6&gt; beautiful(z)  The informal interpretation of (2) is that there are three objects (corresponding to x, y and z) that have the respective properties ascribed to x, y and z in the conditions &lt;1&gt; - &lt;6&gt;. The numbers to the left of the reference markers and conditions indicate how the construction algorithm might proceed from step to step in a top down algorithm on some sort of standard parse tree like an LFG c-structure. Initially, we begin with the unanalyzed sentence within a DRS K. The subject N(oun) P(hrase) node in the parse tree introduces a condition into the DRS, the condition man(x)', into Con K and a reference marker x into U K, \]-hat exhausts the content of the indefinite noun phrase. The  V(erb) P(hrase) node introduces an intermediate step in the DRS that is further broken down by the algorithm as it goes down the parse tree; we shall write this intermediate step as an as yet not fully analyzed condition -- 'loves a woman(x)'. The reference marker introduced by the NP already processed (the subject NP) is an argument of that condition. The unanalyzed condition is then broken down by the algorithm as it goes down the parse tree; the NP that is a consitluent of the VP introduces another reference marker and a condition 'woman(y)' into K; and finally, the verb itself inlroduces a condition into Con K, After step &lt;3&gt;, the algorithm has finished with the first sentence in (1). It now processes the second sentence, using the background DRS constructed from the first as a context for the interpretation of the new input. The subject noun phrase of the second sentence is simply an anaphoric pronoun. Because the pronoun is an NP, the algorithm requires that it introduce a new reference marker into the DRS. Because the pronoun is anaphoric, the reference marker it introduces must be linked to some already introduced reference marker in U K. Thus, U K provides a set of contextually &amp;quot;discourse individuals&amp;quot; (objects that have been talked about in the discourse) that can be referred to by NPs in subsequent discourse. Once, the algorithm is finished with 'she', the rest of the second sentence is processed in the same sequence of steps used to process the first.</Paragraph>
    <Paragraph position="13"> Our implementation in general follows the left to right process described above. Using f-structures as inputs to the construction algorithm allows us to bypass the steps that are enclosed within the square brackets, since we have already available in the f-structure the basio predicate argument structure of this simple sentence. Tile f-structures play a more important role in more complex sentences like those involving relative clauses; there they make transparent the way in which the predicate argument structure of subordinate clauses fits together with the predicate argument structure of the main clause.</Paragraph>
    <Paragraph position="14"> ~s~.tibJHLv_.</Paragraph>
    <Paragraph position="15"> The simple example in (1) already introduces the problem of anaphoric coreference. DR theory, like all linguistic theories, provides constraints on what are the possible, antecedent, coreferential NPs for an anaphoric, pronominal NP. We shall say that a reference marker u introduced by an antecedent NP can be linked anaphorically to a reference marker v introduced by a pronominal NPjust in case u is accessible to v. u is accessible to v just in case &lt;u,v&gt; belongs to the transitive closure of the relation that is defined as follows: a) u has already been introduced into U K prior to the introduction of v, b) u e UK,, and there is a K&amp;quot; such that K' ~ K is a condition in K&amp;quot;, c) K occurs as a component of some condition in K' and u has already been introduced into U K, prior to the introduclion of lhe condition containing K. 1 Our implementation uses this notion of accessibility to constrain tile process of anaphora resolution. While singular indefinile NPs simply introduce, when processed by the construction algorithm, reference markers and atomic conditions into the DRS, &amp;quot;quantificational&amp;quot; NPs (those involving determiners like 'every', 'each', and also 'many') will introduce logically complex conditions (see Kamp, (1981), Frey &amp;Kamp (1985)for details).</Paragraph>
    <Paragraph position="16"> (3) Eveiy man loves a woman yields on the &amp;quot;default&amp;quot; left-right scope reading the following DRS:</Paragraph>
    <Paragraph position="18"> As we shall see shorlly, the truth conditions of such a DRS are essentially this: pick any man; then there's a woman that loves him. Ttle DRS in (4) differs from the one that is the result of processing the first sentence in (1), in that the detern'liner 'every' makes a decidedly different contribution from a pure indefinite. 'Every' introduces a logically complex structure between two DRSs; it corresponds in first order logic to a universal conditional.</Paragraph>
    <Paragraph position="19"> Our implementation also countenances relative clauses and the truth functional seniential conjunctions that correspond to the DRS connectives already mentioned. Given tile original design of the algorithm reproduced in our implementation and tile notion of accessibilty, the following two sentences turn out  to have almost exactly the same DRS: (5) Every farmer who owns a donkey beats it.</Paragraph>
    <Paragraph position="20"> (6) If a farmer owns a donkey he beats it.</Paragraph>
    <Paragraph position="21">  The DRS in (7) is what (5) yields. The DRS for (6) is almost identical, except that the DRS in the consequent of the conditional :=~ contains two new reference markers in its universe z and w and the conditions z = x 1 and w = x 2. (7) ........................</Paragraph>
    <Paragraph position="22"> ..... 1 \[i; 1 n n/xl) I</Paragraph>
    <Paragraph position="24"> The construction algorithm thus shows how to arrive by a completely effective means at the same DRS and the same truth conditions for (5) and (6) and in so doing solves a longstanding puzzle concerning the &amp;quot;donkey sentences.&amp;quot; See Kamp (1981), Ideim (1982).</Paragraph>
    <Paragraph position="25"> So far we have seen only left-rlght scope assignments to sentences with indefinites and true quantificational NPs, But of course, there are olher possible scope assignments, which our implementation also generates. We generate tile left-right scope assignment as a default, but our implementation ca.</Paragraph>
    <Paragraph position="26"> generate the other scope assignments as weN. Sometimes these are needed in resolving anaphoric links, For instance, consider the discourse in (8):  (8) Every man loves a woman, She is beautiful.</Paragraph>
    <Paragraph position="27"> On the default left-right scope assignment, (8) yields: (9-a) ..... Z. ..................</Paragraph>
    <Paragraph position="29"> which is a DRS that is incoherent; z cannot be identified with a reference marker, since neither x 1 nor x 2 are accessible to z. If 'a woman' takes wide scope, however, an anaphoric link is possible, and it is one that our implementation finds:</Paragraph>
    <Paragraph position="31"> Definite noun phrases, which include definite descriptions and directly referential expressions like proper names, demonstratives, and indexicals, have a quite complex role in DR theory. All definiles generate adequacy or felicity conditions for discourses containing them When a definite NP  is used in a discourse, the discourse will be adequate if and only if the recipient has sufficient information given by the context and background assumptions to link the reference marker introduced with some contextually available individual and/or some reference marker already introduced in thediscourse.</Paragraph>
    <Paragraph position="32"> This thesis is developed and defended at length in Helm (1982). If such a link is not possible, then the discourse as a whole may lack a determinate truth value. The intuilion behind this requirement is that whenever the speaker correctly employs a definite NP, he has a &amp;quot;definite individual in mind&amp;quot; that the recipient must in at least some minimal way be able to isolate. These claims lead to a special role for definites in the construction algorithm. They always have in effect wide scope over those logical operators that serve as barriers to anaphoric links, since their denotation must, if the felicity conditions are to be accomodated, already have been introduced in the discourse. To take an example, consider the case of a proper name. Insofar as a proper name is a definite, the recipient of an utterance containing one must have some information enabling him to link the reference marker introduced with the appropriate, contextually available individual. The sort of information at issue may be often quite minimal and the set of contextually available individuals quite large (it includes at least potentially everything that can be named). But once this link is made, the logical barriers to further anaphoric links between the reference markers introduced by name and those introduced by anaphoric pronouns that underlie the constraint of accessibility have no effect. That is, the reference marker introduced by a proper name is always accessible to a subsequently introduced reference marker. The reason for explaining the way definites work is to make a case for distinguishing reference markers introduced by definites from those introduced by indefinites as we have done in our implementation.</Paragraph>
    <Paragraph position="33"> _Correc(ness D~ The model theory or correctness definition for the extensional fragment surveyed is very simple. We define a model for the DRS &amp;quot;language&amp;quot; to be an ordered pair &lt;D, \[\]&gt;, where D is a set of entities (the &amp;quot;domain&amp;quot; of tile model), and \[\] an interpretation function thai takes n-ary predicates into sets el n-tuples in ~ (w(Dn)). Further we define an embedding function f for a DRS K in a model M to be a map from reference markers in U K into the domain ot M. We will also define an extension of an embedding tunction t to an embedding function g for a DRS K 1 to be the function: g: (Dora(f) u UK1 ) ---) D. We shall abbreviate 'g extends t to an embedding of K as 'f ~K g&amp;quot; We now define, again by simultaneous recursion, the notions of a proper embedding of a DRS K in the model M and the satisfaction of a condition C in a DRS K in the model M by an embedding function. I will abbreviate 'f is a proper embedding of K in M' by 'If,K\] M = 1' and 'the model M  satisfies C under an embeddin 9 function f for K' as M I= f,K C'. Definition 2: Let x I ..... x n be reference markers, an n-ary DRS predicate, K, K 1 and K 2 DRSs, and let IV\] M be the extension of ~ in M. Then 1. If q) is an atomic condition of the form ~(Xl, .... Xn) , M I=f,K (p iff &lt;f(x 1 ..... f(xn)&gt; ~ \[~\]M.</Paragraph>
    <Paragraph position="34"> 2. Ifq) is an atomic condition of the !orm x I = x 2 , M I=f,K q) ift</Paragraph>
    <Paragraph position="36"> 3. If (p ~ Con K is a complex condition of the form .-, K1, then M I=f,K ~ iff -,3g ;2K1 fig, K1\]f M = 1 4. If~p ~ Con K is a complex condition of the form K 1 v K2,</Paragraph>
    <Paragraph position="38"> 6. \[f,K\]gM = 1 iff f is an embedding function such that: (i) g ~ f; (it) for every condition ~ in Con K M I=f ~; (iii) if K has an external anchor A, then A ..~..f.</Paragraph>
    <Paragraph position="39"> 7. If,K\] M -- 1 iff \[f,K\]A M = 1 where A is the empty function. 2  A ORS K is true in a model M just in case there is a proper embedding of K in M From the standpoint of the theory of information processing, a DRS represents the resuff of a recipient's processing of a verbal input. The DRS then as it stands captures the information thal the recipient has gleaned from the sentence. The correctness definition evatuates that information content. The two components of DR theory have thus distinctly different tasks: anaphora resolution and other phenomena like scope disambiguation that are necessary tor discourse understanding must take place at the level of DRS construction; the assignment of truth conditions is provided by the correctness definition. We want to emphasize that the mapping characterizing the correctness definition is not something that the recipient in general has access to or can construct. For a knowledge of such a mapping involves at least on occasion the knowledge of what are the denotations of directly referential expressions (since these provide the constraints that we have called external anchors on the embedding functions); and as almost two decades of philosophical argument have made plain, the recipient of a discourse containing directly referential expressions need net and generally does net know in any interesting sense what the denotations of those expressions are in order for the discourse to be comprehensible to him. In general, we know that we must distinguish between the truth conditional content of an expression and the content of the expression that is available to the interpreter. This is a thesis that follows directly from accepting the basic principles of the theory el direct reference, which DR theory at least to some extent incorporates. So insofar as we wish to model or to mimick the way humans process verbal inputs, then we must limit ourselves in implementations of natural language understanding to the (nformation provided by the DRS construction algorithm.</Paragraph>
    <Paragraph position="40"> In effect this is a useful constraint, for it forces the DR theoretician to make a principled division of labor between these two components of the theory. It also allows us to provide a much more clean cut approach to the implementation of semantic theories than is possible with theories that do net invite this sort of division. For we are able to separate tasks of discourse understanding, which, one feels, ought to be tasks that can be accomplished by means of effective algorithms, from the evaluation of a discourse at a set of indices-- which in general has no effective solution. This may be only a small and rather obvious, philosophical point, but it is one which is worth keeping in mind when one is trying to come upn with tractable analyses of various aspects of natural language understanding.</Paragraph>
    <Paragraph position="41"> .The Actual Implementation : BUILDRS The implementation we have constructed has three distinct modules-- an LFG parser, a DRS constructor, and finally an anaphoric resolution component. The inputs to BUILDRS are multisentential discourses, and the odl~uts are scope disambiguated semantic structures, i.e., DRSs, with reference markers introduced by anaphoric pronouns identitifed with the appropriate reference marker introduced by some previously processed NP.</Paragraph>
    <Paragraph position="42"> The LFG parser of the kind used in BUILDRS has already been discussed in detail by Frey and Reyfe (1983), Frey (1985) and Reyle (1985). We have agumented their LFG parser to handle compound sentences (those containing conjunctions like 'and', 'or' or 'if .... then'). The parser provides a separate f-structure for each of the sentential components of a compound sentence. These then serve as arguments to connectives. We also allow for the possibility of several scope assignments, when more than one operator is involved. We have also augmented the parser by attaching to each value of the specifier in an f-structure a unique identifier; in view of the fact that the LFG parser is a front end to our DRS maker, the identifiers we have chosen are new reference markers (i.e., reference markers that have not already appeared in previously processed discourse).</Paragraph>
    <Paragraph position="43"> Once the parser has linlshed its task, the DRS constructor begins its work. The overall structure of the constructor is similar to that of the informal algorithm discussed earlier and also is based on the work of Frey (1985). We first translate the semantically relevant entries in SUBJ into DRS conditions and partial DRS structures, then those in OBJ and finally PRED of the f-structure as a whole (the main verb) is translated into a DRS condition. The translations forlhe lexical items in an f-structure are stored in a separate database and can be easily augmented as the grammar and the lexicon grow. The translations of the determiners or specifiers in an f-structure yield structures that we shall call partial DRSs. In the fragment treated by BUILDRS 3, we deal only with singular determiners; 'a' (or its equivalents 'some' and the like) and 'every' introduce different partial DRSs. They have the form of a triplet. In the case of 'every' it is &lt;x, ~LP;LQ, (\[\] =~((\[x\],\[P\]), (\[\],\[Q\]))&gt;, where x is the reference marker introduced by the specifier in the fostructure, P and Q are variables for sets of properties (i.e. sets of conditions), and the structure (\[\] :=~(\[x\],\[P\]), (\[\],\[Q\])) is the schematic form of a DRS. In the case of 'a' it is &lt;x, ~,P;LQ, (\[x\],\[P, Q\])&gt;. The translations of a common noun phrase are also triplets, but they do not contain abstraction over properties or property sets. Instead, they supply the properties that are to be filled in for P and Q. So, for instance, the translation for the predicate 'man' is the triple &lt; ;Lx, 0, \[man(x)\]&gt;. Again following Frey (1985), we shall call such triples predicative DRSs. We have a special translation for the main predicate of the sentence's f-structure; it is an ordered triple of the form &lt;0.0. #fX. Y~&gt;. X and Y are to be filled in eventually by the reference markers introduced by the specifiers in SUBJ and OBJ respectively. The translations of the entries in the f-structures are Ihen combined together by means of a process called conversion. Conversion is like the application of a ;L-abstract to an argument. Following the general path of the construction algorithm, we begin by introducing a partial DRS with the specifier of SUBJ. Suppose for example that SUBJ contains 'every' as a specifier. Its translation is the partial DRS (i) &lt;x, ~LP;LQ, (\[\] =~((\[x\],\[P\]), (\[\], \[Q\])))&gt;. The common noun phrase in SUBJ (the head noun + any modifiers) yields a predicative DRS, which will contain a complex property if the common noun phrase is itself complex-i.e. contains modifiers like possessives or relative clauses. So in general it will be of the form (ii)&lt;Xx, 0, (UCN, CONcN\[x\])&gt; , where UCN is the set of reference markers and CONcN\[x \] is the set of conditions derived from the common noun phrase and where 'CONcN\[x \]' denotes the fact that at least one condition in CONcN contains 'x' as an argument. Converting (i) with (ii) yields the partial DRS &lt;x, XQ, (\[\] =~((\[x u UCN\],\[CONcN\[x/x\]\]), (\[\], \[Q\])))&gt;. 'ix/x\]' denotes the replacement of every occurrence of 'x' in the conditions in CONcN with the reference marker x.</Paragraph>
    <Paragraph position="44"> The process of conversion turns to OBJ and processes it in the same way. This yields another partial DRS that, like the one above, contains a properly abstract,e.g., &lt;y, ~.Q, q)&gt;. The first strategy is to construct a DRS exhibiting the left-right scope of quantifiers as they occur in the discourse. So in this case, we first combine the partial DRS from OBJ with the translation of the main predicate, which is of the form &lt;0, 0, ~(X, Y)&gt;. The structure &lt;y, XQ, fl)&gt; combines with &lt;0, 0, ~(X, Y)&gt; to yield a complete DRS of the form &lt;0, 0, ~'(X)&gt;. PSo'(X) is the result of the conversion of the partial DHS PSo with the predicative DRS &lt;0, 0, ~(X,y)&gt;. Finally, conversion is applied again using the partial DRS derived from SUB&amp; yielding another complete DRS and the desired result. To get alternative scope assignments for quantitiers, the program backtracks and tries to do conversion in a different way.</Paragraph>
    <Paragraph position="45"> To take an example, let us see how the DRS constructor would handle the sentence, 'every man loves a woman.' The output of the parser yields the structure:</Paragraph>
    <Paragraph position="47"> The DRS constructor yields the partial DRS (i) &lt;u 1, ~,O, (\[\] ((\[Ul\],\[man(ul)\]), (\[\],\[Q\])))&gt; for SUBJ and the partial DRS (if) &lt;u2, ;LQ, (\[u2\], \[woman(u2), Q\])&gt; for OBJ. The translation of the main PRED of the whole f-structure is: (iii) Ioves(X,Y).</Paragraph>
    <Paragraph position="48"> Conversion of (if) wilh (iii) yields the complete DRS (iv): &lt;0, 0, \[u2\] \[woman(u2), loves(X, u2)\]&gt;, and the conversion of (i) with (iv) yields &lt; O, 0, (\[\] =~.((\[Ul\], \[man(ul)\]), (\[u2\], \[woman(u2), loves(u1, u2)\])))&gt;, which is the desired result. To get the alternative scope reading, the successive conversion of (lit) with (i) and then with (if) yields the DRS: &lt;0, 0, (\[u2\],\[woman(u2), =~ ((lull,\[man(u1)\]), (if,\[loves(u1, u2)\])\])&gt;.</Paragraph>
    <Paragraph position="49"> This is the basic part of the DRS constructor. The two embellishments we have made to this basic part concern the treatment of relatives and possessives. The treatments of possessives and relatives are quite similar, so we shall describe here just one-- the treatment of relative clauses. The syntactic treatment of an NP containing a relative clause of the form 'DET N that S' yields following kind of l-structure: 4  Note that pro carries an identifier identical to that of the specifier of the head.</Paragraph>
    <Paragraph position="50"> Suppose that the translation of the,specifier introduces a new reference marker x. Then the translation of 'pro' yields the following partial DRS: (i) &lt;x, XP, (\[\],\[P\])&gt;. Since that is all that is in the SUBJ position of RADJ, we get (i) as the partial DRS associated with SUBJ. Now we use conversion to construct the partial DRS for PS0 (call this (if)), To get the default scope  assignment we convert (ii) with the translation for J3 and then convert that result again with (i). The result is a quasi-complete DRS o1 the form (iv) &lt;0, 0, (URADJ, COnRAD j)&gt; , where URADJ is the set of reference markers and COnRADJ is the set of conditions derived from the processing of RADJ. Quasicomplete DRSs are distinguished from complete DRSs, in that they are derived from f-structures containing 'pro'. To use (iv) in building up the partial DRS for the main NP, the program converts the structure for RADJ (iv) into a ~.-abstract term-- viz. (v) &lt;0, ;LP, (URADJ, \[COnRADJ, P\])&gt;. We are now abIe to combine (v) with the translation of PRED 1o get the translation of a complex common noun phrase-- viz. (vi) &lt;~.x, 0, (URADJ, \[COnRAD j, N(x)\])&gt;. Now this predicative DRS may be combined with the partial DRS produced by the specifier in the usual way to yield the partial DRS for the complete NP.</Paragraph>
    <Paragraph position="51"> Processing relative clauses in this way ailows us to handle  long distance dependencies. Consider the sentence, (12) A dog that Mary persuaded Adrienne to expect Fred 1o buy sleeps.</Paragraph>
    <Paragraph position="52"> ~12') _ -.: ISUBJ = |SPEC = a</Paragraph>
  </Section>
  <Section position="2" start_page="543" end_page="545" type="metho">
    <SectionTitle>
!PRED =dog
RADJ= ~'-SUBJ =PRED -Mary
</SectionTitle>
    <Paragraph position="0"/>
    <Paragraph position="2"> The DRS constructor begins with the specifier of the subj structure in the usual fashion and then begins to construct the common noun phrase from the inside out. So then we begin with the innermost XCOMP. Suppose that the specifier introduces a reference marker x. How, we have to ask, is the translation of an identification between two f-structures to be done here? We suppose that such an identification is very similar to pro and that it should be translaled in exactly the same way. So the translation and conversion of the innermost XCOMP yields the quasi-complete DRS &lt;0, 0 (\[f\], \[buy(f,x)\])&gt;.</Paragraph>
    <Paragraph position="3"> Now, we put this together with the translations of the next innermost SUBJ and OBJ and the translation of the predicate 'expect' by conversion to get the following quasi-complete DRS &lt;0, 0, (\[a, p\], \[expect(a,p), p: (\[f\], \[buy(f,x)\])\])&gt;. In a similar fashion we handle the construction of the entire RADJ; translation and conversion yield once again the quasi-DRS &lt;0, 0, (\[m, a, q\], \[persuade(m, a, q) q: (\[p\], \[expect(a,p), p: (\[f\], \[buy(f,x)\])\])\])&gt;. Now we convert the result into a ),.-abstract since we are done with RADJ and we can now form the complex noun phrase translation in the familiar fashion.</Paragraph>
    <Section position="1" start_page="543" end_page="545" type="sub_section">
      <SectionTitle>
Anaphora Resolution
</SectionTitle>
      <Paragraph position="0"> The pronoun resolver is the last module in the program and perhaps the most complex. It operates on a complete DRS as an input and searches for conditions of the form 'pronoun(x) = * '. The reason we need to operate on complete DRSs is that the constraints on anaphoric relations imposed by the accessibilily relation require that the logical structure of the sentence be determined. The scope of various quantifiers and truth functional operators, however, is only fully determined at the level of complete DRSs. Once a DRS for a discourse D has been constructed, we go back and examine it-- looking for pronouns. We also construct a database in which all the reference markers introduced in the DRS for D are stored in a tree structure, the transitive closure of which delines the accessibility relation on the universe of the DRS for D.</Paragraph>
      <Paragraph position="1"> Associated with each reference marker in this database are the gender and number of the NP that introduced it. When we  come to a reference marker introduced by a pronoun, it will occur on a certain node n m in the tree. The reference markers accessible to it are all those on nodes nj such that there is a path from the root to n m passing through each nj. At this point the program searches back to find the first available reference marker whose associated gender and number agrees with the gender and number of the pronoun, and which satisfies certain other constraints that the pronoun might have. One constraint is that a reference marker x in a DRS K introduced by a non-reflexive pronoun cannot be identified with a reference marker y, if K already contains a condition of the form ~(x, y, z 1 ..... Zn). The opposite is true for a reference marker introduced by a reflexive pronoun. If these constraints are met, the program then replaces the condition 'pronoun(x) = *' with a condition of the form x = y, where y is the reference marker that was found to match x. Let us take a look at a typical example.</Paragraph>
      <Paragraph position="2">  (13) A man loves a dog. He feeds it.</Paragraph>
      <Paragraph position="4"> The pronoun reserver now takes over and produces a tree structure of available dicourse referents as it goes through the CON list of the DRS. In this case we have a simple tree of the form u 1 , &lt;sing, masc&gt;--- u 2 , &lt;sing, neut&gt;--- x 1 , &lt;sing, masc&gt; --x 2, &lt;sing, neut&gt;. As it is constructing such a tree, it also looks for reference markers introduced by pronouns. The condition 'pronoun(x1) = &amp;quot;' tells it that x 1 is such a reference marker. At this point, it now searches back for the reference marker in the database with the appropriate number and gender. After rewriting the conditions, the pronoun resolver prints out the DRS:</Paragraph>
      <Paragraph position="6"> There are, however, several problems with the resolver as described so far. One has to do with the anaphoric behavior of definites. Many English speakers find (14) acceptable but (15) bad.</Paragraph>
      <Paragraph position="7">  (14) If Mary likes every one who likes John i, then she likes him i. (15) *11 Mary likes every one who likes someonei, then she likes him i.</Paragraph>
      <Paragraph position="8">  Definites like proper names seem 1o be available for anaphoric linkage despite the presence of logical barriers to anaphora. But given our description of the pronoun resolver, we have not said anything that would distinguish the anaphoric behavior of definites as opposed to indefinites. We will follow the suggestion of Kamp (1983} and treat definites as having wide scope over the logical barriers to anaphora available in the present fragment. 5 Thus, while the program constructs the accessibility tree of reference markers, it places each reference marker introduced by a proper name or other definite at the root of the tree as well as in its normal position. This creates a certain amount of duplication abut allows us to get the right reading for (14) while still getting the preferred &amp;quot;most recent NP&amp;quot; readings first for the majority of anaphoric discourses. Thus, our program succeeds in finding the right DRS for (14) but predicts (15) to be bad. It also predicts that a sentence like (16) will fail, which seems marglnal to at least some English speakers.</Paragraph>
      <Paragraph position="9"> (16) ? If John likes heri, Fred likes Mary i.</Paragraph>
      <Paragraph position="10"> There are still more complexities, however, to the pronoun resolver. Though presently not implemented, we see a need to distinguish the resolution strategy for pronouns that occur in subordinate clauses from the standard one. The need to do this is made evident by the apparent acceptability of Bach-Peters type sentences with indefiniles like the one in (17): (17) A rnaniwho hardly knows herj loves a womanj who scorns him i.</Paragraph>
      <Paragraph position="11"> The processing story that must be told to make (17) acceptable is quite complex. For instance, it cannot be that relative and other subordinate clauses are simply not processed until the main DRS is already completed. Though this processing strategy would in conjunction with our pronoun resolver predict (17) to be good, it would also predict the marginal (18) as equally acceptable: {18) ? A man who hardly knows herj loves Maryj.</Paragraph>
      <Paragraph position="12"> Pronouns in subordinate clauses seem to pose an additional complication for the pronoun resolver. We believe pronouns in subordinate clauses should be handled slightly differently than those occurring in main clauses. Like other pronouns, when the resolver finds a pronoun in a subordinate clause ~ it attempts to find the appropriate antecedent. But if it fails, it leaves the pronoun unresolved until it has processed the rest of the DI:IS. If it finds a condition later on that is linked to the NP in which o~ occurs, then it will try again to find antecedents for all the unresolved pronouns in cc This sort of strategy would make (17) acce~otable but not-118L which is what is desired. We hope to incorporate this into the pronoun resolver soon.</Paragraph>
      <Paragraph position="13"> The pronoun resolver is an attempt to take the resolution of pronouns as far as is possible on a purely grammatical level. We realize that there are some cases where resolution depends on world knowledge and guesses about the speaker's intentions and the like. But using world knowledge is often computationaily expensive. Our goal was to develop a program that would minimize appeals to world knowledge in building a semantic representation of the content of a discourse and would instead use to maximum advanlage the information that can be gleaned from the words themselves.</Paragraph>
      <Paragraph position="14"> N 0 r E5 1We should note that there are other ways of defining accessibility. One might be tempted to do it wholly in terms of the possibility of assigning the DRS containing the Identification coherent truth conditions. Chierchia and Rooth (1984) investigate this possibility. We do not find their approach computationally useful, however, insofar as checking for lruth conditional consistency is any more expensive than following lhe accessibility constraints on DRSs.</Paragraph>
      <Paragraph position="15">  propositional attitude verbs. \]here it seems not all definites function in the same way. Some permit anaphoric linkage across propositional attitude contexts and some do not.</Paragraph>
    </Section>
  </Section>
class="xml-element"></Paper>
Download Original XML