File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/90/w90-0113_intro.xml
Size: 36,730 bytes
Last Modified: 2025-10-06 14:04:58
<?xml version="1.0" standalone="yes"?> <Paper uid="W90-0113"> <Title>An Object Oriented Approach to Content Planning for Text Generation</Title> <Section position="3" start_page="0" end_page="99" type="intro"> <SectionTitle> 2. Background </SectionTitle> <Paragraph position="0"> The GENIE project at Columbia addresses the problem of how to extend users' expertise in interactive environments. A communicative problem arises in such environments because users tend to get stuck in a starter set \[Finin 83\] of commands and never progress to more sophisticated methods for accomplishing tasks. Question</Paragraph> <Section position="1" start_page="0" end_page="99" type="sub_section"> <SectionTitle> 9fi </SectionTitle> <Paragraph position="0"> answering in such an environment requires that a text generator not only respond to the specific question asked, but also attempt to add a little extra information in order to enrich the user's knowledge. The domain we consider is Berkeley Unix Mail, chosen because it is a microcosm of Unix, providing extremely rich and sophisticated functionality with a frustratingly hard to learn interface.</Paragraph> <Paragraph position="1"> The domain also allows us to focus on extending system expertise rather than domain problem solving since the activities, sending, receiving and managing messages, do not require high level thinking skills.</Paragraph> <Paragraph position="2"> Interactive environments can be characterized as computational tools that allow users to accomplish tasks.</Paragraph> <Paragraph position="3"> Examples range from desk top aides such as word processors, message systems and spread sheets, to cad/cam systems and multi-purpose programming environments. Typically a core set of functionality can be combined and manipulated to construct solutions to domain specific problems. A user approaches such an environment with a domain specific or computational goal, and constructs a plan to satisfy that goal with the functional mechanisms available in the environment.</Paragraph> <Paragraph position="4"> In such open-ended settings no two users will approach the system with the same background or needs.</Paragraph> <Paragraph position="5"> Consequently the degree of difficulty of functionality is more dependent upon the particular experience of a user, than on broad categories of functional difficulty or user expertise \[Chin 88\]. Users tend to develop their own highly personal repertory of commands. For example, an &quot;expert&quot; user who has extensive experience reading messages, and a &quot;novice&quot;, one who has almost no experience reading messages, may both need to be introduced to a method for storing mail in files. The level of expertise of the first user is less significant than whether his or her particular expertise contributes to how much more easily the new method can be communicated.</Paragraph> <Paragraph position="6"> Therefore a particular goal may be satisfied by more than one plan, where the criteria for which plan is &quot;best&quot; is dependent upon the context of the task at hand. This context can be viewed as a model of the user, and it is the primary influence on the choice of content and the structure in which that content appears as text. It consists of: * The discourse context w The type of question the user is asking, and the specificity of the question. For example, is the user asking for a plan, for an explanation for why a plan failed, or for a better plan than the one he or she is currently using 2.</Paragraph> <Paragraph position="7"> eventually this structure should represent the dialogue between the user and GENIE over time. We keep the more grand label to remind ourselves of this.</Paragraph> <Paragraph position="8"> situation&quot; the user is in, that is, the state of the environment and the user's position in it.</Paragraph> <Paragraph position="9"> This includes background knowledge about domain entities such as what some one's email (electronic mail) address is.</Paragraph> <Paragraph position="10"> * Beliefs about the User's Task Expertise w The computational goals GENIE believes the user knows about and the plans the user has for satisfying them.</Paragraph> <Paragraph position="11"> Finally, since a user's primary goal is to complete the task, and only secondarily to get new information, he or she would prefer a succinct, informative answer.</Paragraph> <Paragraph position="12"> However the answer should also be sufficiently grounded, that is, it should contain advanced organizers \[Gagne & Briggs 79\] that explicitly articulate any of its goals, assumptions, and decisions in choosing a course of domain action that are not obvious to the user.</Paragraph> <Paragraph position="13"> An example scenario might help illustrate these points. Consider the user model defined at the top of Figure 2-1. The user asks the question, &quot;how do I send a message&quot;, which implies a question intent to receive a plan for the goal &quot;send a message.&quot; The discourse context contains the question intent &quot;given a goal give me a plan&quot;, and the expectation that such a plan exists, and that GENIE will provide it to the user. The statement of the goal further indicates that the user is asking about sending a single message, in contrast to &quot;sending mail&quot; where the quantity of messages is ambiguous. The goal statement does not however does not say anything about who the recipient might be. The details of the goal statement can have a profound effect on how GENIE selects content, and chooses to present it. Physically, the user is in the Unix shell, and has no new mail. In the past the user has never initiated the sending of a message, but has only sent messages in reply to messages s/he has received. Finally, as background, in Berekely Unix mail, one cannot &quot;get into&quot; the mail environment with the command &quot;mail&quot; unless one has new mail. The text GENIE produces as a result of this scenario appears in the lower part of the figure.</Paragraph> <Paragraph position="14"> Sentences 1 - 3 are produced as a result of responding to the user by introducing a plan for sending mail. Sentence 1 is an advanced organizer, which makes sure GENIE is answering the question the user asks. Note that if GENIE had true dialogue capabilities, it might pose this as a question. Only ff the user responded in the affirmative would it proceed with the remainder of the text as is.</Paragraph> <Paragraph position="15"> Sentences 4 and 5 are enrichment, elucidating why the user's usual plan won't work. Note too, how discourse goals are merged. For example, sentence 3 is both an example and a definition of syntax.</Paragraph> <Paragraph position="16"> Text generation for interactive environments therefore requires addressing the following goals: 1. Provide information about the relationship between Input: Discourse: How can I send a message to message? Situation: User is in Unix, with no new mail.</Paragraph> <Paragraph position="17"> Kathy is a user with email address &quot;kathy.&quot; User knowledge of domain tasks: User has never sent a message from Unix shell, but has only initiated sending from &quot;read mode&quot; afar reading new mail.</Paragraph> <Paragraph position="18"> Output: (1) I assume you are in unix, your recipient is an individual, and the address is a local address. (2) You must supply the email address of the individual. (3)For example, to send mail to Kathy, type mail kathy since kathy is her email address.</Paragraph> <Paragraph position="19"> (4) You usual method is to read your mail, then use the mail command inside the mail environment. (5) Since you do not have any new mail, your usual method will not work. Figure 2-1: An example scenario users' task (their computational goal) and the methods ((plans) that may be used to accomplish them. Do this within the current context of the task at hand.</Paragraph> <Paragraph position="20"> 2. Respond informatively, but also attempt to provide enriching material.</Paragraph> <Paragraph position="21"> 3. Provide advanced organizers, but do it in a manner that is concise, clear and cohesive.</Paragraph> <Paragraph position="22"> Taken together these goals suggest that a large amount of information must be presented in as short a text as possible. Evidence from three informal studies indicates that natural occurring texts satisfy these goal. First a set of textual materials, including manuals, tutorials, texts and canned text on-line resources were studied. Tutorials and on-line resources tended to focus on responding directly to the problem of how to do tasks. Reference manuals, text books and especially &quot;advanced user&quot; manuals focused on enrichment. In all, 20 passages covering information about Unix, Lisp, Pascal, Logo and a number of word processors were analyzed. These texts have strongly influenced our approach to content selection and structuring. In particular, we found that four types of strategies occurred through which information about the relationship between domain plans and goals was given: * Introducing: Presenting plans that the user has not encountered before.</Paragraph> <Paragraph position="23"> * Reminding: Briefly describing plans to which the user has been exposed, but may have forgotten.</Paragraph> <Paragraph position="24"> * Clarifying Distinctions: Explaining distinctions and options about plans.</Paragraph> <Paragraph position="25"> * Elucidating Misconceptions: Clearing up misunderstandings that have developed about plans to which the user has been exposed.</Paragraph> <Paragraph position="26"> These strategies had a consistent general structure that included sub-strategies such as summarizing or elaborating on a plan that had their own linguistic structure. See \[Wolz et al. 90\] for a thorough discussion of these strategies. The &quot;Introduce&quot; strategy will be discussed extensively in the following sections.</Paragraph> <Paragraph position="27"> The second analysis looked at 30 unix related question answering sessions in which the correspondence occurred through electronic mail. We found that the responses still fell within the four strategies. The respondent made an attempt to respond informatively, and in 8 cases explicitly included contextual information that acted as an advanced organizer, or that might not have been obvious to the questioner. In 7 cases the respondent included information that was not merely in response to the question, but that could extend the questioner's expertise. Most importantly however, we found instances where sub-strategies, and sometimes the four basic strategies themselves, were combined to produce very concise utterances. We also noticed that sub-strategies were absent when their contents was not critical, or when it was obvious from the question. The last of these points have led us to the goal of conciseness.</Paragraph> <Paragraph position="28"> A third study of human-to-human tutoring in a computer lab also corroborated the strategies, the incidence of enriching behavior, the use of advanced organizers.</Paragraph> <Paragraph position="29"> However, verbal responses tended to be significantly more long winded, exhibiting more of a stream of consciousness than the mail messages. Since GENIE produces interactive text, our goal is to most closely match the mail message behavior, and aim for conciseness.</Paragraph> <Paragraph position="30"> 3. Limitation of Current Approaches GENIE was developed as a result of the design goals enumerated above, namely to answer a question within the task at hand, satisfy the dual discourse goals of responding and enriching, and do this clearly and concisely. These goals presented some special problems for capturing textual structure, and for defining choice points for refining that structure into natural language clauses and vocabulary.</Paragraph> <Paragraph position="31"> In order to cover two simultaneous discourse goals through clear and concise text, we discovered that schemas \[McKeown 85, Paris 87\] provided too rigid a structure with insufficient definition of choice points, while plan operators \[Hovy 88, Moore & Paris 89\] provided too many choice points without sufficient structure. In order to answer a question within the task at hand, GENIE needed to make choice point decisions based on intentional \[Appelt 85, Moore & Paris 89\], domain and textual \[McKeown 85, Paris 87\] knowledge. Previous work on text generation has tended to put the emphasis on one of the three, incorporating the other two implicitly. The problem can be best illustrated by looking at the informal description of the strategy of &quot;introducing&quot; presented in Figure 3-1. The structure of this strategy was derived from principles of instructional design \[Gagne & Briggs 79\] and on the analysis of text. The informal definition contains both structure, in the form of how to expand an introduction into sub-structures such as &quot;summarize&quot;, and under what circumstances to do so, for example whether the goal is satisfied by an action or another plan, or whether the user knows or does not know the subgoal. This strategy assumes a recursive definition of a goal, where the goal can be satisfied by an action or indirectly by a plan that itself consists of a set of subgoals. This definition of a goal follows that used in STRIPS \[Fikes and Nilsson 71\] where the term goal refers to a state difference. It is to be distinguished from the goal state, or end state which is sometimes referred to as &quot;the goal.&quot; Casting this informal description as a formal schema looses the explicit information about what circumstances motivate expansion into particular sub-structures. For example, the schema definition presented in Figure 3-2 does not contain choice point knowledge.</Paragraph> <Paragraph position="32"> Conversely, casting the description as a set of plan operators looses the explicit structure in the informal description. The structure emerges only during planning. Researchers in text generation using plan operators (Appelt85,Hovy88, Moore-paris89) have adapted the hierarchical definition of a plan operator developed by Sacerdoti \[Sacerdoti 77\] A discussion of the merits and disadvantages of the subtle differences between definitions is not appropriate here. Instead, we must focus here on the components of plan operators that are critical to representing choice points and sub-structure, namely a constraint/action-body pair. Constraints or preconditions define choice points, while the body of the operator defines actions that are taken if the constraints are met. Figure 3-3 shows informal definitions of some of the operators that would be required to model the sub-structures that appear in Figure 3-1.</Paragraph> <Paragraph position="33"> Plan operators introduce an even more insidious problem. Note that both Introduce-plan and Introduce-action contain the operator &quot;state-goal&quot; in their bodies. Other high-level strategies such as Remind Plan or Elucidate Misconception also include this sub-structure. Recall that GENIE will initiate two strategies, one in response and one to enrich. For example, to produce the text in Figure 2-1 GENIE chose to respond by introducing a new plan to the user, and enrich, by elucidating why the user's usual plan won't work in the current situation. The fundamental 1. Informally, introducing a plan consists of a. Stating the goal.</Paragraph> <Paragraph position="34"> b. If the goal is satisfied by an action, introducing the action, otherwise: c. Stating any assumptions that affected the choice of plan.</Paragraph> <Paragraph position="35"> d. Summarizing the sub-goals for the plan.</Paragraph> <Paragraph position="36"> e. For each sub-goal either introducing or reminding about the plan for the sub-goal depending on whether the model of user task expertise indicates the user knows how to satisfy the sub-goal.</Paragraph> <Paragraph position="37"> f. If the plan is not the top-level plan, reviewing the steps in the plan through an example.</Paragraph> <Paragraph position="38"> g. Relating each step in the example to a sub-goal.</Paragraph> <Paragraph position="39"> 2. Introducing an action consists of: a. If the goal which the plan satisfies is the top level goal, stating the goal.</Paragraph> <Paragraph position="40"> b. Presenting the syntax.</Paragraph> <Paragraph position="41"> c. Describing the parameters.</Paragraph> <Paragraph position="42"> d. Describing any preconditions that must exist for it to work.</Paragraph> <Paragraph position="43"> e. Describing the effects (which is not the same as stating the goal).</Paragraph> <Paragraph position="44"> f. If the goal which the plan satisfies is the top level problem with having two discourse strategies within a single text is that when each is instanfiated as a sequence of speech acts, redundancy and conflicts may appear unless the instantiafions of those structures are coUaboratively developed. Furthermore, in GENIE, the highest level strategies (remind, introduce, elucidate, clarify) all contain at least one instance of recursion on another top level strategy. For example, introducing a plan requires introducing or reminding the substeps of the plan. Published descriptions of cur~nt content planners don't show how this kind of open-ended recursion is handled. Examples tend to only show how a top level discourse goal is elaborated through a refinement process. When runaway recursion might occur, computational tricks such as limiting the number of levels of deepening is suggested. This is not sufficient for GENIE'S purposes. For example consider a domain goal G that is satisfied by plan P, that expands into subgoals, that are satisfied by plans that themselves expand until a set of actions is reached. Clearly one can't limit the expansion since the actions are a critical component of the text. If one blindly follows the informal description of introducing the top level plan P, then all of the subgoals and sub-plans will be included recursively. Merely stating the top level goal and plan, and the resulting actions would seem to be a natural intuitive solution. But this would require different operators for &quot;high&quot; level strategies, than for those further down the structure. This thwarts the elegant nature of independent, generalized structures such as schema's and plan operators. To further aggravate the situation, consider that many of those actions may have the same preconditions. Again, if one blindly follows a generalized &quot;introduce action&quot; operator, then for all those actions that include precondition P, a speech act stating P will be included. One would prefer that P be stated at most once.</Paragraph> <Paragraph position="45"> Avoiding this problem through &quot;special case&quot; operators is insufficient. For example, consider two plan operators that handle when to state the goal.</Paragraph> <Paragraph position="46"> 1. If introducing in response, include a statement of the goal unless it is explicitly mentioned in the question. 2. If introducing as enrichment, do the same thing except if you already mentioned the goal in response, don't mention it again.</Paragraph> <Paragraph position="47"> The second operator is dependent upon the result of the first one, and although superficially they seem to be independent entities, they are not. Now a third special case is introduced when &quot;elucidating a misconception&quot;, because here too, there are circumstances in which the goal should be stated. All three operators know about each other implicitly through their constraints. Because plan operators based on Sacerdoti's hierarchical model assume a set of abstraction spaces, &quot;high level&quot; operators always precede lower-level ones, and the higher ones consequently have an implicit influence on the subordinate ones.</Paragraph> <Paragraph position="48"> A final problem with implementing the informal structure of Figure 3-1 as plan operators is that a significant amount of domain specific knowledge is embedded in the operators, The operators in Figure 3-3 illustrate this point. The &quot;state-...&quot; operators are essentially &quot;inform&quot; speech acts, but they imply that a specific sort of domain object is to be the subject of the informing. Although text plan operators are supposed to be textual in nature, it is far too easy to implicitly embed domain knowledge in them. This compounds the complexity in developing constraints because textual, domain, and intentional knowledge can be haphazardly intertwined. A similar problem occurs in creating the tests on schemas, since the test may be described by an arbitrary lisp expression. In developing GENIE, we found ourselves mired in layers of operators that contained such complex inter-related constraints. This suggested a different sort of architecture that would allow us to deal with the complexity of inter-related, but distinct influences, and at the same time model both structure and choice of structure in one formalism.</Paragraph> <Paragraph position="49"> 4. An Object-Oriented Architecture for Content Planning Given the problems described in the previous section, we will focus here on the aspects of GENIE'S object-oriented architecture that address the problems of text structuring and content choice. The over-all architecture is described in \[Wolz 90a\]. In particular, this section will illustrate how special case operators are avoided by cleanly dividing the structuring process into two stages. First, rhetorical strategies are refined with minimal constraints to the point of rhetorical acts. Second, the rhetorical acts contend for inclusion in the set that is realized as text. Some acts may have cause to be explicitly included or excluded, and among those that are included some may be merged with others, while some may need to be explicitly kept separate. Before these processes can be described however, it is necessary to articulate the nature of object-orientedness as it applies to GENIE, and describe the input to the text structuring phase.</Paragraph> <Paragraph position="50"> 4.1. Object Classes for Content Planning GENIE employs three overlapping classes of objects; domain objects, intentional objects and discourse objects. Domain Objects classify domain entities, including both the commands such as &quot;mail&quot; or &quot;copy&quot; that the user invokes, and the entities manipulated by commands such as flies, messages and users. Intentional Objects allow GENIE tO construct and analyze plans, and include computational goals, plans to satisfy them, and actions (which contain commands as a subclass). Discourse Objects include rhetorical strategies, rhetorical plans, rhetorical acts, syntactic structures and vocabulary. Note that these classes are not mutually exclusive. For example, a goal such as &quot;wanting to send mail&quot; is both an intentional and domain object since it is a domain specific goal.</Paragraph> <Paragraph position="51"> Each object class has explicit relationships to other classes. For example the class &quot;goal&quot; contains an attribute &quot;who-satisfies-me&quot;, whose value can point to either a plan or an action. A plan in turn is an object class that is related to a rhetorical strategy &quot;talk-about.&quot; The strategy in turn knows under what circumstances the object about which it talks (the plan) should be introduced, reminded about, clarified or elucidated, that is, it is related to these other rhetorical strategies. Finally, rhetorical strategies may be instantiated either by rhetorical plans, that recurse to other strategies, or by rhetorical acts such as &quot;stating&quot; that can be manifested as English texL</Paragraph> </Section> <Section position="2" start_page="99" end_page="99" type="sub_section"> <SectionTitle> 4.2. Constructing Domain Objects for Inclusion </SectionTitle> <Paragraph position="0"> in the Text Recall that GENIE answers questions about how to do tasks. In other words, it provides descriptions of the relationships between goals, plans, actions and the effects of actions in the domain. Providing an answer involves understanding the user's question in a situational context by instantiating a set of domain objects, generating the set of objects (domain goals, plans, actions, effects) that are expected in reply, and selecting the discourse objects that talk about those objects.</Paragraph> <Paragraph position="1"> For example, consider the question from Figure 2-1: &quot;How do I send a message?&quot; As a result of parsing this sentence, the discourse context will include an instantiation of the very specific goal &quot;send-mail.&quot; The discourse context will also include the question intent that the user expects to be told the best plan for the goal in the current context. A specialized plan class knows how to construct an instantiation of such a plan, calling it best-plan. It creates a relationship between the instantiation of the &quot;send-mail&quot; goal, and best-plan and relates best-plan to an instantiation of an abstract plan for satisfying that goal. It also instantiates discriminator objects that capture the assumptions and decisions that were made in constructing the abstract plan. Like other objects that might be included in the final text, these discriminator objects, which are domain dependent, have rhetorical strategies that can talk about them. best-plan is then refined, by instantiating the subgoals of its abstract plan until the process bottoms out as actions.</Paragraph> <Paragraph position="2"> Figure 4-1 shows some of the itentional objects that are instantiated for best-plan. Describing the actual process here would take us too far afield. Note that the discriminators include which part of the user model affected the choice. Decisions that are based on the discourse and situational context are &quot;stronger&quot; than those that are based on the model of user task knowledge. The former are deduced from facts, the latter, like default heuristics for object types, are deduced from weaker beliefs. The strength of the deduction will affect how the decision is described in the text.</Paragraph> <Paragraph position="3"> The best plan by itself is not sufficient for GENIE'S design goals. In particular, its construction offers little insight into what the user knows about it, or for that matter whether the user knows any plan for satisfying the goal, and whether that plan works in the current context. Therefore, in this context it is also necessary to construct a User Model Plan, user-plan, that has a similar structure. The two plans are &quot;compared&quot; by annotating differences in the sets of objects produced for each. More i00 importantly, the best-plan and user-plan are annotated as to whether each is valid in the current situation, and if so, whether they are the same plan. The information is critical to how each will be &quot;talked about.&quot; In the scenario of Figure 2-1 the user-plan will not work because it requires that new mail exist, which is not the case in the scenario. Once the relationship between the goal, best-plan and user-plan have been established, text structuring can begin.</Paragraph> </Section> <Section position="3" start_page="99" end_page="99" type="sub_section"> <SectionTitle> 4.3. Structuring Content: Instantiating Rhetorical Strategies </SectionTitle> <Paragraph position="0"> Structuring content consists of selecting two rhetorical strategies, one for responding, and one for enriching, and instantiating a rhetorical plan for each strategy. The selection is based on the nature of the question intent, the domain plans that have been constructed, whether those domain plans satisfy the goal, and whether they are identical. The selection rules were compiled from our design goals, and result in one plan being described in response and possibly a second being described as enrichment. The plans for the two strategies are then reeursively expanded until the process bottoms out as a set of rhetorical acts.</Paragraph> <Paragraph position="1"> Structuring begins when the domain plans such as best-plan and user-plan are sent messages to &quot;talk-about&quot; themselves 3. In the example scenario, best-plan decides that it should be &quot;introduced in response to the question&quot; because it is not identical to user-plan. Had the plans been identical, best-plan would have chosen to be reminded instead.</Paragraph> <Paragraph position="2"> user-plan decides that it should be &quot;elucidated as enrichment&quot; because it doesn't work in the current context.</Paragraph> <Paragraph position="3"> The strategies map to rhetorical plans that are a sequences of message passing instructions with simple control structures, such as interating on a list, and binary branching. A rhetorical plan is refined by sending those messages to other rhetorical strategies or to rhetorical acts that cannot not be further refined. The method for expanding the rhetorical plan is dependent upon the object type of the entity to be talked about. Classes of objects may use the same rhetorical strategy through inheritance, however, when appropriate two classes may require the same strategy, but have their own private rhetorical plans for those strategies. For example, given the informal descriptions of introducing a plan or an action in Figure 3-1, it is clear that the text structure of introducing these two kinds of things is dissimilar. The rhetorical strategy associated with introducing a plan is formally described as: This plan says, to introduce a plan, state the goal it satisfies. For each choice made to construct it, describe the reason for the choice. For each subgoal, summarize the subgoal, then for each subgoal, if the subgoal is known by the user, remind the user about how it can be satisfied, otherwise, introduce how it can be satisfied. A particular object, like a domain plan, has a set of attributes or slots that link it to other objects. For example, a plan has a &quot;goal-l-satisfy&quot; slot that relates it to the goal it satisfies, and the goal in turn has a &quot;who-satisfies-me&quot; slot that points to the plan. The statement &quot;(state-it goal-I-satisfy)&quot; says to send a &quot;state-it&quot; message to the object related to me by my &quot;goal-l-satisfy&quot; attribute. When this message is sent, a rhetorical object of class &quot;state-it&quot; is instantiated that links the calling rhetorical object with the called object, creating ties between them and the domain objects.</Paragraph> <Paragraph position="4"> Of particular significance is the fact that rhetorical plan are minimally constrained. For example, the plan for introducing a domain plan has no constraints on when to include the statement of the domain goal, so this plan can be used both to describe the best-plan, or one of its sub-plans, or the user-plan without complex constraints. More importantly, this rhetorical plan does not need to keep track of which operators are appropriate for different kinds of objects. For example, when expanding its subgoals, it merely sends a message to the object related to the subgoal by the &quot;who-satisfies-me&quot; slot. The object, either a plan or an action, upon receiving the message instantiates its localized rhetorical plan for reminding or introducing. This process continues until a set of rhetorical acts is constructed.</Paragraph> </Section> <Section position="4" start_page="99" end_page="99" type="sub_section"> <SectionTitle> 4.4. Contending for Inclusion: Communication Between Rhetorical Acts </SectionTitle> <Paragraph position="0"> The structuring stage produces an initial ordering of rhetorical acts that contain functional descriptions which when given to the surface generator can produce text. The form of these descriptions is based on the theory of functional grammar \[Halliday 85\]. Due to the minimally constrained refinement employed during the expansion of the strategies, the set of rhetorical acts produce redundant, extraneous or obvious information that would be terribly verbose if instantiated as text. For example, Figure 4-2 shows the text that would be produced if the first 6 rhetorical acts instantiated in the send mail example were i01 given to the surface generator. Note that these 6 acts are only 1/5 of the rhetorical acts that are produced in this example.</Paragraph> <Paragraph position="1"> You want to send mail.</Paragraph> <Paragraph position="2"> I know you are in Unix.</Paragraph> <Paragraph position="3"> You want to specify a recipient.</Paragraph> <Paragraph position="4"> I assume your recipient is an individual.</Paragraph> <Paragraph position="5"> You want to specify a single recipient.</Paragraph> <Paragraph position="6"> I assume the address is a local address.</Paragraph> <Paragraph position="7"> Figure 4-2: Text Produced Before Content Filtering some distance may find themselves next to each other, or even merged, if the acts between them are excluded. Once processing is complete, the remaining objects apply knowledge for constructing clause level structures which are then passed to the surface generator.</Paragraph> <Paragraph position="8"> Figure 4-3 shows how the 6 acts from Figure 4-2 contend to be included, and are eventually merged to produce the first sentence in Figure 2-1. The first act of stating the main goal can be excluded because it is in the discourse, and is probably obvious to the user. The second and third statements of goals may be excluded because they are part of a goal-plan-goal chain, that is, they supply unnecessary detail about the relationship between the top level goal and the actions. The result of these deletions makes the two &quot;weak&quot; discriminator statements adjacent, at which point they can be merged. This newly merged object can then be merged with the &quot;strong&quot; discriminator that follows it. In general, when weak assertions out number strong ones, the weak assertions can subsume the strong assertions, if the significance of the strong assertion isn't important. For example, it is acceptable to say &quot;I assume X and Y&quot; even if you &quot;know X&quot; provided the distinction between knowing X and assuming X is not critical to the response.</Paragraph> <Paragraph position="9"> The contention stage reviews this list and specifically includes, excludes, merges or keeps separate acts within this list. For example information derived from either the situational context, such as the mode the user is in, should be included because it may not be obvious to the user.</Paragraph> <Paragraph position="10"> Information derived from the discourse context, such as an explicit reference to the goal may be excluded as obvious because it was referred to in the question. In the process of both reminding and clarifying, the same preconditions may be introduced twice. These can be merged into one utterance. However, if a component of an utterance is critical, such as a failed precondition when elucidating a misconception, then it should be kept separate.</Paragraph> <Paragraph position="11"> The rhetorical acts contain the methods for determining how they can be filtered. This includes specific rules for when they may be excluded, when they must be included, what other objects they may or must be merged with and which one is dominant, and when they are to be kept separate. Contention for inclusion in the text begins when the set of rhetorical acts is sent messages to &quot;execute&quot; themselves. Each examines its methods for how it may be merged etc. Processing continues until all objects return successfully. Some objects will have been excluded and some will have instantiated new &quot;merged&quot; objects, subordinating themselves to the merged object.</Paragraph> <Paragraph position="12"> Restructuring occurs during merges and excludes. A merge may move an act up or down depending on whether the dominant act comes before or after the submissive one. Two acts which are initially separated by</Paragraph> </Section> </Section> class="xml-element"></Paper>