File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/98/w98-0607_metho.xml

Size: 28,611 bytes

Last Modified: 2025-10-06 14:15:06

<?xml version="1.0" standalone="yes"?>
<Paper uid="W98-0607">
  <Title>Integrating Referring and Informing in NP Planning</Title>
  <Section position="3" start_page="46" end_page="47" type="metho">
    <SectionTitle>
2 Information Representation
</SectionTitle>
    <Paragraph position="0"> To properly describe our NP-planning process, we need to describe how information is represented in ILEX. Domain knowledge is represented in terms of an information graph, which represents the entities of the domain, the interrelation between these entities (called facts); and the relations between these facts (e.g., a causal relation between two facts). Figure 1 shows an abstract representation of an information graph. At present, relations between facts are not used in tim NP-planner, so will not be discussed further here.</Paragraph>
    <Paragraph position="1"> Initially, the information graph representation was developed for text planning. However, following a suggestion from Alistair Knott, we have found it useful to use it for NP-planning as well.</Paragraph>
    <Section position="1" start_page="46" end_page="46" type="sub_section">
      <SectionTitle>
2.1 Entities
</SectionTitle>
      <Paragraph position="0"> Entities represent the objects of the domain. In the Museum domain, this includes not only the museum artifacts, but their designers, the materials they are made from, the styles they are made in, the periods they belong to, the locations of their manufacture, etc.</Paragraph>
      <Paragraph position="1"> Entities are typically specific entities: real-world individuals. However, some of the entities will be generic entities, those representing classes of entities, such as Art-Deco jewellery.</Paragraph>
      <Paragraph position="2"> We also cater to sets of entities, which can be realised through either plural anaphora, or conjunctive NPs.</Paragraph>
    </Section>
    <Section position="2" start_page="46" end_page="47" type="sub_section">
      <SectionTitle>
2.2 Facts
</SectionTitle>
      <Paragraph position="0"> A fact in ILEX represent a relation between two entities. These relations may be processual, e.g., that X made Y: maker(J-999, King01); or stative (e.g., that X is a type of Y: isa(j-999, generic-brooch).</Paragraph>
      <Paragraph position="1"> Each fact is represented as an attribute-value structure as below:  Note that apart from the predicate and argument information, several other fields qualify the informational status of the fact, including the polarity (whether or not the relation holds), and de feasibility (distinguishing between hard facts about the entity, and those which are only tendencies, e.g., Art-Deco jewellery tends to be made of enamel (see Knott et al. 1997 for discussion of defensibility in ILEX). The remaining fields, having a stronger affect on NP-planning, include: * Assimilation: the degree to which the system considers the user to have understood the information. This is of particular importance to reference, since adequate reference usually requires the user to know the information used for reference (see later for exceptions}.</Paragraph>
      <Paragraph position="2">  Importance/Interest: the degree to which the fact is considered important for the system to deliver to the user, and the system's estimate of the degree of interest to the user. These values are represented for each predicate type as a whole, and vary for different user models. These values are used when selecting the facts to use to produce a unique reference.</Paragraph>
    </Section>
  </Section>
  <Section position="4" start_page="47" end_page="49" type="metho">
    <SectionTitle>
3 NP Specification
</SectionTitle>
    <Paragraph position="0"> One of our goals in the design of the ILEX NP-planner was to provide a clean interface between text-planning and NP-planning, such that the text planner can specify what it wants from the NP without needing to know about syntax at all. To this end, we have developed a two-level specification of the NP, one at the semantic level, and one at the syntactic level. The text-planner specifies the NP only at the semantic level, leaving details of syntax totally to the NPplanner. null</Paragraph>
    <Section position="1" start_page="47" end_page="48" type="sub_section">
      <SectionTitle>
3.1 The NP Specification Interface
</SectionTitle>
      <Paragraph position="0"> The interface between the text-planner and the NP-planner is in the form of an attribute-value matrix, the attributes of which are:  1. Cat: the function of the NP being pro- null duced. The NP-planner allows a wide range of NP functions, not only referring, shown in figure 2 and discussed below: (a) Referring: an NP which uniquely or non-uniquely refers to the referent.</Paragraph>
      <Paragraph position="1"> More delicate options can be specified, such as refer-by-name, refer-by-type, or refer-by-pronoun; and also whether the reference should be unique or not.  (b) Describing: an indefinite NP giving an arbitrary number of the entity's attributes without attempting to be definitive. null (c) Classifying: an indefinite NP which  provides only the superclass of the item, e.g., this is a brooch (d) Defining: for generic entities, an NP which provides the entities defining characteristics, e.g., a necklace is an item ofjewellery worn around the</Paragraph>
      <Paragraph position="3"> (e) Eliciting: a wh- NP for the referent.</Paragraph>
      <Paragraph position="4"> Eliciting can be selective, e.g., which designer or non-selective, e.g., Who.</Paragraph>
      <Paragraph position="5"> If referring-np is selected, various sub-types of reference can also be preselected through this slot (the specification of the Cat can be logically complex).</Paragraph>
      <Paragraph position="6"> If no preselection is made by the textplanner, the system will decide NP function on the basis of constraints and defaults.</Paragraph>
      <Paragraph position="7"> For instance, in the usual case, the text-planner will specify only referring-np, and leaves it up to the NP planner to choose the exact type of reference.</Paragraph>
      <Paragraph position="8"> In some cases, the system will override the preselection if it is incompatible with the referring environment. For instance, if the text-planner specifies refer-by-name, but no name information is provided for the entity, then a choice between refer-by-type and refer-by-pronoun will be made. The NP-specification can thus be seen to offer a flexible interface, allowing the text-planner to determine the amount of control it desires to assert.</Paragraph>
      <Paragraph position="9"> Sem: the referent of the NP, an entity in the information graph (or a set of such entities if desired, realised as either a coordinated NP, or using plural anaphora).</Paragraph>
      <Paragraph position="10"> Syn: the slot to be constructed by the NPplanner, a syntactic structure.</Paragraph>
      <Paragraph position="11"> Orth: the slot to hold the eventual surface string for the NP. If the sentence planner provides a filler for this slot, then NP-planning is avoided and the string is used. ILEX thus allows canning of NPs when needed.</Paragraph>
      <Paragraph position="12"> Agenda: a list of fact-ids which are to be incorporated into the NP if possible. The aggregation module uses this slot to state its requirements from the NP, which facts the NP is to express.</Paragraph>
      <Paragraph position="13"> Restrictions: a list of fact-ids which should not be used in the NP. For instance, we might wish to avoid generating the sentence Other jewels designed by Jessie M.</Paragraph>
      <Paragraph position="14"> King include a brooch designed by Jessie M.</Paragraph>
      <Paragraph position="15"> King. To avoid such sentences, we place  the fact-id of the 'designer' fact into the Restrictions field for the mention of the brooch. The designer fact will not then be used as a referential restrictor.</Paragraph>
      <Paragraph position="16"> A sample NP-specification is shown below: Cat: unique-reference&amp;quot; Sem: J-999 Agenda: (FN-48 FN-56) Restrictions: (FN-59) ...where FN-48 and FN-56 are facts to include in the reference, and FN-59 is a fact to avoid. Problems of Modularity: One of the problems of a clean separation between NP-specification and NP-planning is that it might not be possible to incorporate all facts on the informing agenda into the NP. However, given that NPs syntactically allow any number of non-referring post-modifiers, our planner will handle any arbitrary number of facts on the agenda. However, in terms of intelligibility, too many post-modifiers will produce unintelligible NPs. We make the simplifying assumption that the text planner uses some metrics (simple or complex) to avoid overloading the NP. We are merely providing a mechanism to support NP-planning once the agenda is specified.</Paragraph>
    </Section>
    <Section position="2" start_page="48" end_page="49" type="sub_section">
      <SectionTitle>
3.2 Focus Spaces
</SectionTitle>
      <Paragraph position="0"> Apart from the text-planner's specification of the NP-form, the text-planner also maintains some variables concerning the current referring environment, mainly in terms of various focus spaces. These various spaces are:</Paragraph>
      <Paragraph position="2"> Mentioned Entities: entities mentioned at some point within the discourse.</Paragraph>
      <Paragraph position="3"> Shared Entitles: entities which the system assumes the addressee to know about. These entities include world-knowledge (e.g., Ronald Reagan), but also entities mentioned previously in the discourse (mentioned-entities), and entities in the immediate context (focal-objects). Entities in this space are potential confusers for definite reference with the.</Paragraph>
      <Paragraph position="4"> Focal Objects: the focal space includes a set of entities which may potentially be referred to as this z. Firstly, we have the Prior-Cb (backward looking centre, usually the subject of the prior sentence). Entities directly related to this may also be focal.</Paragraph>
      <Paragraph position="5"> This is also called the local focus in our system. Then there is the page-focus, the focus of the current object description in the ILEX system, e.g., this brooch. Other objects are also focal by being part of the immediate context of the reader/writer. In a web-browsing environment, this might include the current page (this page), or parts of the page (this picture).</Paragraph>
      <Paragraph position="6">  In addition to the focal spaces, there are also variables holding individual focal objects, including the Cb and Prior-Cb, Page-Focus and Discourse-Focus. We allow pronominalisation only when the object being referred to is Prior-Cb, which seems to produce coherent reference.</Paragraph>
    </Section>
  </Section>
  <Section position="5" start_page="49" end_page="52" type="metho">
    <SectionTitle>
4 NP Structure for Referring and
</SectionTitle>
    <Paragraph position="0"/>
    <Section position="1" start_page="49" end_page="49" type="sub_section">
      <SectionTitle>
Informing
</SectionTitle>
      <Paragraph position="0"> The NP-Planner has distinct procedures for each of the NP functions, one for classifying, one for referring, one for eliciting, etc. Due to lack of space, we will focus from now only on NPs primarily serving a referring function. Other nominal functions will be covered in a later paper. null The issue of how to construct a referring NP is well explored. The issue remaining is how non-referring information can best be incorporated into referring NPs. This section will look at the locations in the NP which can express non-referring information, without interfering in the reference of the NP. The following section will describe the algorithm which allows referring and informing to be integrated.</Paragraph>
    </Section>
    <Section position="2" start_page="49" end_page="49" type="sub_section">
      <SectionTitle>
4.1 Nucleus and Satelite Composition
</SectionTitle>
      <Paragraph position="0"> of the NP We consider the basic structure of the NP to fall into two components: a nucleus, which performs the nominal function of the NP, and optional satelites, where additional information can be placed. 1 The nucleus of the NP consists of all slots before the head, and the defining post-modifying slots (e.g., defining relative clauses, or prepositional phrases). The satelite elements are typically realised by indefinite NPs, or non-defining relative clauses (or complexes of such), e.g., \[Nue: this brooch\], \[Sat: designed by Jessie M. King\], or \[guc: King\], \[Sat: a Scottish designer\]. null In our model, all referring is performed by the nucleus - the satelite(s) are non-defining, i.e., perform only an informing role. However, as will be discussed below, the nucleus can also contain non-referring information.</Paragraph>
      <Paragraph position="1"> 1Nucleus and sateliPSe are terms taken from RST (Rhetorical Structure Theory, e.g., Mann &amp; Thompson, 1987), although usually applied to the relations between sentences.</Paragraph>
    </Section>
    <Section position="3" start_page="49" end_page="49" type="sub_section">
      <SectionTitle>
4.2 The Structure of the Nucleus
</SectionTitle>
      <Paragraph position="0"> The range of slots in a systemic analysis of the NP, in the order they typically appear (after Halliday 1994), appears below, and figure 3 shows a typical NP structure:</Paragraph>
      <Paragraph position="2"/>
    </Section>
    <Section position="4" start_page="49" end_page="51" type="sub_section">
      <SectionTitle>
4.3 Informing within the Nucleus
</SectionTitle>
      <Paragraph position="0"> While primarily for referring, non-referring information can sometimes be included in the nucleus without interfering with the referring function. For instance, we can add information to an already uniquely-referring NP, making the reference more explicit: The \[granny smith\] apple on the table; this \[enjoyable\] book. The degree to which informing and referring can be so integrated varies from domain to domain.</Paragraph>
      <Paragraph position="1"> The major constraint we seem to face is that there is a degree of expectation under conversational implicature that the speaker refers using information known to the addressee (see Dale &amp; Reiter 1996). Thus, in a situation where only one apple is visible, if I say pass me the Spanish apple, the addressee might be confused by the inclusion of the superfluous information, and perhaps think there must be another apple somewhere.</Paragraph>
      <Paragraph position="2"> However, in some registers this form of reference seems to offer no problems. Appelt (1985) mentions the case of the speaker pointing at some implement and saying use the wheelpuller. The addressee, not knowing the name, but having the item identified through pointing, accepts the naming. We thus have an NP whose head-noun is not serving a referring function, but rather an informing function, since the referring function was otherwise fulfilled.</Paragraph>
      <Paragraph position="3"> The newspaper genre is particularly strong on this type of reference, as shown by the newspaper article below: Student fights for life after flat fire: A young student was today fighting for  her life after fire ripped through her Edinburgh flat. Nicola Graham is in a &amp;quot;serious but stable&amp;quot; condition at the specialist burns unit in St John's Hospital, Livingston. Firefighters suspect the blaze may have been started by a dropped cigarette in Miss Graham's bedroom. The 19year-old was transferred from Edinburgh Royal Infirmary to St John's for emergency burns treatment .... &amp;quot; The sequence of references to the student successively add new information: A young student: Age and occupation; Nicola Graham: Name; Miss Graham: Marital status; the 19year-old: Age. This writer is not depending on assimilated information to refer, but, depending on the lack of potential confusors, is successfully referring with new information. While this style is more typical of newspaper reporting, where compact information delivery is important, it is still an issue which needs to be addressed in any NP-planner.</Paragraph>
      <Paragraph position="4"> In the register of museum object descriptions, it seems that the degree to which new information can be included in the nucleus is limited. New information seems not to be appropriate in the Deictic, Classifier, Thing or Qualifier slot, but is generally allowed in the Numerative and Epithet slots. This makes some degree of sense, since these slots are the least restrictive. The Numerative can be used restrictively when used contrastively, e.g., the five cups (butnot the set of three), but this is rare. Epithets generally add qualitative information, and are thus less restrictive. 2 Another approach is to examine the semantic types of pre-modifier elements, to see which, when inserted for informing reasons, seem to interfere with the referring function.</Paragraph>
      <Paragraph position="5"> We have found some of our fact-predicates interfere more, some less. As a result of this, we maintain a list of fact-predicates which are judged, for the current domain, to be suitable for pre-modifier slots without interfering with reference. This allows us to produce, for instance, this \[important\] designer; the |gold and enamel\] brooch designed by King; the \[quite in- null The tension in planning definite reference derives from the need to serve both the referring and informing functions. The referring function is mainly concerned with the Sere slot of the NP-specification: the task is to point uniquely at the filler of this slot, distinguishing it from all other entities. The planner may need to use any or all of the syntactic slots of the nucleus 2A study of 20 randomly chosen museum descriptions , undertaken by Cheng, from four museums and galleries revealed that only 1/3 of Epithets act restrictively.  to do this.</Paragraph>
      <Paragraph position="6"> On the other hand, the informing function is primarily concerned with the Agenda slot - the function is satisfied if all the facts in this slot are expressed somewhere in the NP (whether nucleus or satelite). While these facts can be placed in satelite position, it is often more coherent to place them within the nucleus.</Paragraph>
      <Paragraph position="7"> As such, the two functions are in competition for the syntactic slots, and structural decisions good for one function may be sub-optimal for the other. The usual approach is to allow the referring component to go first, generating the desired referring form. Then, the needs of the informing component are fitted into this structure. null However, we have found it far more rewarding to allow all syntactic decisions to be mutually negotiated between the two functions. Below, we describe the definite description algorithm as used in ILEX.</Paragraph>
    </Section>
    <Section position="5" start_page="51" end_page="51" type="sub_section">
      <SectionTitle>
5.1 Construction of NP
</SectionTitle>
      <Paragraph position="0"> The steps of building the NP are as follows: Build Nucleus: Since we wish to choose a referring expression which opportunistically serves some of the informing function, we will build the nucleus on the basis of i) the referential context, and ii) the agenda of things to say:  1. Location of Assimilated Agenda: the facts on the agenda need not all be new information (unassimilated) - the text-planner may place previously given information on  the agenda, perhaps for some pragmatic reason, e.g., to ensure that the addressee is aware of some fact at this point of the discourse.</Paragraph>
      <Paragraph position="1"> Assimilated facts on the agenda play an important role in our algorithm, since we will use them preferentially for referring.</Paragraph>
      <Paragraph position="2"> 2. Choose Referential form: we need to choose between refer-by-name, refer-by-pronoun and refer-by-type, on basis the referential context. Where the referential context allows more than one choice, we refer to the assimilated-agenda to help. For instance, if a Name fact is the agenda (and assimilated), we might favour refer-by-name, if Gender is the sole fact on the agenda, pronominal might be favoured. With several (assimilated or unassimilated) facts on the agenda, a common-group is preferred since it offers more opportunities for inclusion of facts (although proper-np expression also supports non-referring post-modification).</Paragraph>
      <Paragraph position="3"> 3. Choose Head Noun: for common nounphrases, the head noun will be chosen from the most specific assimilated isa fact about the object. However, an assimilated isa fact on the agenda is allowed to override the default.</Paragraph>
      <Paragraph position="4"> 4. Choose Determiner. for common nounphrases, the determiner will be chosen on the basis of the objects focal status (e.g., this if focal, the otherwise). If a fact specifying the owner of the entity being expressed is on the agenda (and assimilated), then a genitive deictic will be used.</Paragraph>
      <Paragraph position="5"> 5. Choose Restrictive Modifiers: if the common-noun-phrase form was selected, then we need to determine which modifiers are to be included to produce a unique reference. For instance, if we have the Style fact of a jewel on the agenda, and it is assimilated, then that will be preferred as a restrictive modifier. See below for more detail. null 6. Fill in Unused Slots: When we have a functioning referring form, then we can add information from the agenda into the unused slot, e.g., this book -b enjoyable ~ this enjoyable book. The system is provided with a list of fact-predicates which can be expressed in pre-modifier slots, e.g., in the Jewellery domain, Materials - this gold and enamel brooch, Fame- the famous designer called Jessie M. King.</Paragraph>
      <Paragraph position="6"> Add $atelites: Any information which was not consumed in the nucleus can now be placed into non-defining satelites, e.g., \[Nuc: Jessie M. King\], \[Sat: a Scottish designer\].</Paragraph>
    </Section>
    <Section position="6" start_page="51" end_page="52" type="sub_section">
      <SectionTitle>
5.2 The Confusor Set
</SectionTitle>
      <Paragraph position="0"> The confusor set is the set of entities which a partially constructed NP unintentionally refers to. For instance, if we have only selected the head noun, brooch, then the confusor set is all  brooches known to the system apart from the intended referent.</Paragraph>
      <Paragraph position="1"> The process of building a referring expression can be seen as successively reducing the confusor set until it is empty. Assuming a commonnp, the steps in this reduction are:  2. Restrict on class: Choose a head noun for the item, and eliminate all confusors which do not take the class.</Paragraph>
      <Paragraph position="2"> 3. Add Restrictive Modifiers: Choose a sub-set of assimilated facts which eliminates all confusors (see next sub-section).</Paragraph>
      <Paragraph position="3"> 4. Insert &amp;quot;One of&amp;quot; if needed: (not yet implemented) If insufficient assimilated facts to  eliminate all confusors, insert &amp;quot;one of' or &amp;quot;another off into the Pre-Deictic slot. &amp;quot;another of' is used if the confusor is already mentioned on the page.</Paragraph>
    </Section>
    <Section position="7" start_page="52" end_page="52" type="sub_section">
      <SectionTitle>
5.3 Choosing Restrictive Modifiers
</SectionTitle>
      <Paragraph position="0"> There are a number of strategies used to select the optimal set of restrictive modifiers to produce unique reference. There seems to be two main approaches. One attempts to select the smallest subset of modifiers which uniquely refers (e.g., Reiter 1990; Dale 1989). A solution which offers better ~omputational complexity is based on the premise that some fact-types are better suited as restrictive modifiers than others, and thus restrictive modifiers are chosen by incrementally taking the next modifier from the list (e.g., Reiter &amp; Dale 1992).</Paragraph>
      <Paragraph position="1"> In ILEX, we follow the incremental approach, adding restrictors in order from our (domaindependent) list (but only if the restrictor elim- null inates some confusors). We have found that ordering restrictors in terms of goodness, the NPs we generate are of better quality.</Paragraph>
      <Paragraph position="2"> The need to integrate informing into the process changes the process slightly. As stated above, the text-planner is allowed to place assimilated, as well as unassimilated, information on the agenda. If this has not happened, then we use the standard incremental strategy.</Paragraph>
      <Paragraph position="3"> However, if the text-planner has placed assimilated information on the agenda, then our planner places these at the front of the preferredrestrictors list.</Paragraph>
      <Paragraph position="4"> We note however, that there are cases where, while the text-planner may want the fact to be included, the fact is not a suitable restrictor.</Paragraph>
      <Paragraph position="5"> For instance, including the place-of-wearing fact on the agenda could result in an NP like the gold necklace that is worn around the neck. However, since the place-of-wearing does not actually discriminate (given all necklaces are worn around the neck), the fact was not used restrictively, and was later realised in a satelite of the NP, e.g., the gold necklace, which is worn around the neck. However, there may be facts which are partially restrictive, but nevertheless poor candidates for restriction. Our algorithm does not cater to these cases as yet. :</Paragraph>
    </Section>
  </Section>
  <Section position="6" start_page="52" end_page="53" type="metho">
    <SectionTitle>
6 An Example
</SectionTitle>
    <Paragraph position="0"> Agenda: Assume we are talking about an apple, and have the information as in figure 4 to express. In short, the facts on the agenda are: Owner, Variety, and Position.</Paragraph>
    <Paragraph position="1"> Referential Context: Assume also that we have several red apples, but only one on the table. The apple above has been mentioned, but not for a while, with other apples mentioned since.</Paragraph>
    <Paragraph position="2">  1. Choose Referential form: Since the item is not the Cb, we cannot use a pronoun. Since it doesn't have a proper-name, proper-noun reference is also out. We axe forced to use a common noun-phrase.</Paragraph>
    <Paragraph position="3"> 2. Choose Restrictive Modifiers: We have  a set of potential referential restrictors of: (Class Owner Color Variety Position Taste). Of these, we can only refer using assimilated roles, so we can use: (Class Color Position). We also have the agenda role-list of: (Owner Variety Position). of which the assimilated items axe: (Position). Since the Class fact is assimilated, we automatically take the class as the head of the referring NP, e.g., the apple. This is not however unique, so we need to add more restrictions. We use the first (and only) item in the assimilated agenda: Position: the apple on the table. This happens now to be unique, so we have a functional referring NP.</Paragraph>
    <Paragraph position="4"> 3. Fill in Unused Slots: This leaves two facts unexpressed: Owner and Variety. The Owner predicate can normally be expressed in one of two slots of the nucleus: * the Deictic slot e.g., John's apple on the table; or, * the Qualifier slot (after the Head noun, e.g., the apple that John owns on the table. (I assume here that non-restrictive relative clauses are always satelites, discussed below).</Paragraph>
    <Paragraph position="5"> In both of these slots, the inclusion of unassimilated Owner information seems to mess up the reference, seemingly because it implies the reader should already know the ownership. We thus leave the Owner role for expression in a satelite position (realised as a non-restrictive relative clause, e.g., the apple on the table, which john owns).</Paragraph>
    <Paragraph position="6"> The variety fact can be realised best through the Classifier slot, e.g., the Granny Smith apple on the table. This does not seem to interfere with the referring function, so this fact-type would occur on our list of facts which can appear in a pre-modifier slot without interfering with the referring function.</Paragraph>
    <Paragraph position="7"> This stage thus ends with the referring slot consisting of: the Granny Smith apple on the table. We have only one item left on the agenda, the Owner fact.</Paragraph>
    <Paragraph position="8"> Stage 2: Adding Satelites The Owner fact can be incorporated into the NP as a satelite (as a non-referring relative clause), e.g., the Granny Smith apple on the table, which John owns.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML