File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/92/c92-2093_metho.xml
Size: 28,594 bytes
Last Modified: 2025-10-06 14:12:59
<?xml version="1.0" standalone="yes"?> <Paper uid="C92-2093"> <Title>INTEGRATING QUALITATIVE REASONING AND TEXT PLANNING TO GENERATE CAUSAL EXPLANATIONS</Title> <Section position="1" start_page="0" end_page="0" type="metho"> <SectionTitle> INTEGRATING QUALITATIVE REASONING AND TEXT PLANNING TO GENERATE CAUSAL EXPLANATIONS </SectionTitle> <Paragraph position="0"> Several works IMcKeowu 86, Suthers 88\] have emphasized the common aspects of Explanation Production in expert systems and Text Generation. The work described in this paper deals with text generation applied to a particular type of explanations: causal explanations of physical systems. They have akeady motivated influential developments in the field of qualitative reasoning about physical systems. A central goal of the theories developed in \[De Kleer 84\] and \[Forbus 84\] was to elaborate conceptual frameworks for providing causal accounts of physical systems, sensitive to our commonsense understanding of the physical world. Those qualitative causal models constitute an adequate starting point as we are interested in how people present such causal explanations in verbal form.</Paragraph> <Paragraph position="1"> We will describe our approach for text generation, based ou the study of texts collected in encyclopedia and textbooks, and currently developed in a system intended to be associated to the qualitative simulation system SQUALE \[J6z6quel & Zimmer 92\]. Our conceptual model, which constitutes the input to the text generation process, is based on Qualitative Process Theory IForbus 84\].</Paragraph> <Paragraph position="2"> According to the &quot;traditional&quot; division of tasks in text generation, the transition from conceptual representation of causal behaviour to causal explanation in natural laugouge is viewed as a three-stage process: content specifica tion, text organization and surface generation.</Paragraph> <Paragraph position="3"> The content specification task aims at posting communicative goals described by means of communicative acts on conceptual entities. In particular, the causal explanation to be produced is often restricted to ,some particular events of the causal behaviour. We will show how relevant information and appropriate communicative acts are identified.</Paragraph> <Paragraph position="4"> Text organization is the most elaborate part of our model and is also divided into three tasks.</Paragraph> <Paragraph position="5"> The first is concerned with the construction of a textual structure from a set of communicative acts established during content specification. This structure, which takes an intermediary place between communicative acts and surface realizations, specifies essentially prescriptions on grouping and ordering of textual units. This process is achieved through the application of discourse strategies which control local Iransitions from communicative acts to possible organizational preseriptions. We dcseribe three strategies used for structuring causal explanations: a causal chain strategy (for organizing simple causal chains), a parallel strategy (to impose a parallel structure on the text), and a concessive strategy (for performing concessive acts). The second task segments the textual structure into sentential contents. Several factors are revolved, mainly communicative form of textual relations and number of textual unite expressed in a sentence. 7'he stylistic adjustment task deals with stylistic considerations related to lexico-syntactic choice, s. We want to ensure an inter-sentential compatibility, from a stylistic point of view, between linguistic realizations. null Concerning surface generation, two types of systems have been considered: a sentence geuerator \[Fournier 91\] based on the Meaning-Text Theory \[Mel'cuk 88a\] and a syntactic component based on equivalenee classes between sentence structures.</Paragraph> <Paragraph position="6"> Our approach suggests a clear separation of knowledge sources involved in each of these stages. We will start with a presentation of the conceptual model, where we propose a characterization of causal interactions betweeo physical events (essentially, physical parameter changes). The next sections deal with the different stages of the text generation process.</Paragraph> <Paragraph position="7"> AcrEs DE COLING-92, NA~res, 23-28 no(n' 1992 6 1 7 PRoc. OF COLING-92, NANTES, AUO. 23-28, 1992 H- THE CONCEPTUAL FRAMEWORK Qualitative Process Theory helps to capture commonsense knowledge implicit in mathematical models of traditional physics. Besides, it provides an ontological framework for expressing physical causality, by expliciting the physical mechanisms. In order to describe physical situations, two kinds of structures are used : individuals views, for representing physical objects and behavioural abstractions, and physicalprocesses such as fluid and heat flows. Gradual parameters are associated to identified entities (individual views and processes) and causal dependencies are expressed by means of two types of qualitative relations: influences, for direct effects of processes, and qualitative proportionalities, which propagate direct effects of proeesses on the other parameters. It follows that the initial cause of a parameter change is always a process.</Paragraph> <Paragraph position="8"> Qualitative relations can aim at opposite effects. In die physical situation of figure 1, the mercury expansion tends to increase the mercury level while the tube expansion tends to lower it. The first influence is predominant, since an increase of the mercury level is observed in that situation, and thus the second is inhibited 1. In order to represent different types of causal interactions, we have introduced the causal primitives cause, contrary-cause and opposite-effects which are defined according to influences between events: ,, cause: causal relation between two events linked with a predominant influence (for example, mercury expansion is the cause of mercury level rise) * contrary-cause: contrastive relation between two events linked with an inhibited influence. One of the events is the &quot;contrary cause&quot; of the other (tube expansion is a contrary cause of mercury level rise).</Paragraph> <Paragraph position="9"> * opposite-effects: contrastive relation between two events which aim at opposite effects (mercury expansion and tube expansion).</Paragraph> <Paragraph position="10"> The last two primitives are called causal opposition relations. In that framework, physical system behaviours are represented with influence graphs, where physical events are linked with causal primitives. An influence graph encloses all events and causal interactions identified in the given physical situation.</Paragraph> <Paragraph position="12"> The content specification is primarily concerned with selection of relevant physical events and causal interactions from the influence graph. But the problem should not be narrowed to factual knowledge selection, for the conceptu1. It does not mean that the mercury expansion has no effect on the mercury level but that no effect is perceived at that level of a~tr~ction.</Paragraph> <Paragraph position="13"> ul content of the text cannot be considered independently of the speaker's communicative intentions. Communicative roles assigned to conceptual entities have crucial implications at each stage of the text generation process. Hence, they should be made explicit at the content specification level as well as their related conceptual units. In our model, the content specification task results in a setofcommunicafive acts, which represents the illocutionary structure of the message. These acts are performed so as to produce some effects on the hearer, as modifying his belief system or affecting his emotional states. They may be characterized according to these in tended effects. We foc us here on three types of acts: inform, explain and concede, and especially in the context of causal explanation production. Roughly speaking, an act of informing aims at making the hearer believe some proposition while an act of explaining, viewed as a discourse act, has the effect of Supporting a previously introduced proposition in order to favour its acceptance. (1) and (2) are respective realizations of acts of informing and explaining: (1) La dilatation du mercure provoque la hausse du niveau de mercure, (The mercury expansion causes the rise of the mercury level.) (2) Le niveau de mercure monte, paree que le mercure se dilate.</Paragraph> <Paragraph position="14"> (The mercury level rises, because of the mercury expansion.) (1) is a possible achievement of the act:</Paragraph> <Paragraph position="16"> In terms of speech acts, (1) i~ a single assertion, which informs the hearer of the causal connection between the two events. In contrast, (2) contains two assertions, the second assuming an explicative function. Note that in beth eases the same conceptual content is involved. Some differences between sentences (1) and (2) as the order of information units are due to distinct communicative role assignments.</Paragraph> <Paragraph position="17"> We will show in the next .section how these intended &quot;rhetorical&quot; effects influence choices at the text organization level.</Paragraph> <Paragraph position="18"> But now let us turn to the content specification procedures.</Paragraph> <Paragraph position="19"> The determination of communicative acts highly depends on the problem solving activity (here, qualitative simula2. In the formalism of Qualitative Process Theory, the Ds function provides the derivative sign of the parameter given as argument. Ds\[X\]=-I means that X decreases, Ds\[X\]=0 that X is steady and Ds\[X\]=I that X increases.</Paragraph> <Paragraph position="20"> Ac'rns DE COtiNG-92, NANTES, 23-28 AOt\]'r 1992 6 1 8 PROC. OF COLING-92, NANTES, AUG. 23-28, 1992 cause ~rary--cause cause w contained-liquid(m, tube, mercury) Ds\[temperature(w)\]=-I Ds\[tempe\[ature(tube)\]=l Ds\[tempe\[ature(m)\]=l liquid(w, water) heatjlow(hfl, w, tube) C/~-/degPpdegslte-~n'ea~Ts~ heat-flow( hf2 , tube, m) D s\[ volume( tube )\] =1 D s lvolume(m) \]= l talnlrary~cause cause fig I : An example of physical system (a thermometer immersed in hot water ) and its behavioural description with an influence graph</Paragraph> <Paragraph position="22"> augmente. Cette dilatation provoque la hausse du niveau tie mercure.</Paragraph> <Paragraph position="23"> (There is heat flow from the tube to the mercury. As a consequence, the mercury expands,for its temperature rises. This expansion causes the mercury level to rise.) fig 2 : Application of the causal chain strategy</Paragraph> <Paragraph position="25"> car sa quantitE de chaleur diminue, et celle du tube augmente, car sa quantitE de chaleur augmente.</Paragraph> <Paragraph position="26"> (There is a heat flow from the water to the tube. As a consequence, the temperature of the water decreases,for its heat decreases and the temperature of the tube increases, as its heat increases.) fig 3 : Application of the parallel strategy AcrEs DE COLING-92, NAgrp.s, 23-28 AO~r 1992 6 1 9 Paoc. OF COLING-92, NANTES, Auo. 23-28. 1992 tion) andthus is governed by both general rhetorical principles and domain dependent considerations. It proceeds as an incremental process and each step determines more precisely the relevant information. First the notion of themeobject, similar to global focus \[Grosz 86\] (and adapted to generation by Mc Keown), is used to carry out a rough selectiou of relevant causal interactions: events which are not in (direct or indirect) causal interaction with any theme-object are ignored. The theme-object identification depends on the user interaction mode with the system. In the simplest cases, theme--objects can be identified with privileged properties of the physical system, such as outputs of an artifact. But in a dialogue system, an analysis of user queries is necessary.</Paragraph> <Paragraph position="27"> Second, the selection is refined and emnmunicative acts are selected according to user's beliefs and expectations expressed in his request. Esr, ccially, communicative acts on causal opposition relations are introduced in two kind of situation: (a) if a complete description of causal behaviour is required or (b) within a concessive strategy, which seem s appropriate when the user focuses on an expected but yet unoeccurred event. For example, in order to reply to the question &quot;why doesn't the mercury level drop ?&quot;, causal opposition relations are not ignored for it suggests that the user is interested with inhibited influences which might have caused a mercury drop. Similar considerations are expressed in Mc Coy's shemata for correcting misconceptions about object classifications \[McCoy 89\].</Paragraph> </Section> <Section position="2" start_page="0" end_page="0" type="metho"> <SectionTitle> IV- TEXT ORGANIZATION </SectionTitle> <Paragraph position="0"> Text organization deals with multiple communicative acts simultaneously. This is one of the main features of our approach for it provides the ability to impose a global structure on the text. Global coherence cannot be ensured if communicative acts are considered independently of one another. Additionally, there is not a one to one correspondence between communicative acts and sentences of the text. A single utteranceoften realizes multiplecomm unicalive acts and inversely a single act may be spread on several utterances. For example, sentence (1) may also be uttered so as to achieve the three following acts: informing the hearer of mercury expansion and mercury level rise, and that these events are causally related. This conception in language generation has been initiated by Appelt in the KAMP system \[Appelt 85\]. He showed how multiple illocutionary acts can be achieved in a single utterance.</Paragraph> <Paragraph position="1"> Communicative acts are not directly mapped to linguistic forms. The textual structure is introduced as an intermediary level of abstraction between the specified content and the text. It is mainly composed of textual units, inter-clausal relations, which indicate how these units should be related, and order prescriptions.</Paragraph> </Section> <Section position="3" start_page="0" end_page="0" type="metho"> <SectionTitle> IE.I - Inter-Clausal Relations </SectionTitle> <Paragraph position="0"> Several text generation systems use inter-clausal relations which have been introduced in textual linguistics under various forms \[Grimes 75\], \[Van Djik 81\], \[Halliday 761.</Paragraph> <Paragraph position="1"> R hetorical Structure Theory (RST) \[Mann &Thompson 87\] is a signi fican t approach of this trend and it has been partially used in text planning systems \[Hovy 88\], \[Moore & Swartout 89\]. In RST, the structure of a text is obtained by composition of rhetorical relations expressing various kindsof dependencies, ~metimes semantic (&quot;subject matter&quot; relations in RST terminology), sometimes purely rhetoricai (presentational relation in RST). In contrast, in our approach the intended rhetorical effects are considered as communicative goals and are specified at the content specification level. As a consequence, inter-clausal relations involved in our system are exclusively semantic. Furthermore, they are characterized in a way similar to the paratactic/hypotactic distinction of rhetorical predicates \[Grimes 75\] so that they can take different communicative forms. Under the equiponderant form, the same interest is assigned to the arguments of the relation while under the preponderant form, one of them is presented as more central. null Arguments of relations are composed of textual units, which are semantic translations of conceptual entities. An argument may be simple (limited to a single textual unit) or complex (composed of several textual units).</Paragraph> <Paragraph position="2"> At present our work on inter-clausal relations focuses on causal and adversative relations (i.e. on relations involved in the expression of our causal primitives).</Paragraph> <Section position="1" start_page="0" end_page="0" type="sub_section"> <SectionTitle> IV.I.I - Causal Inter-Clausal Relations </SectionTitle> <Paragraph position="0"> In the cause-effect inter-clausal relation, events denoted in the first argument are presented as being the cause of events denoted in tile second. According to the communicative form (equipondorant or preponderant) of the relation, different types of communicative acts may be achieved. The following elementary transitions (in a simplified form) both lead to the intrc~luction of a cause-effect relation (in PROLOG notation, arguments in capitalized letters represent variables): transition(inform_of_a_causal link, inform(cause(C,t~ ), cause-effect(equi,C,E)).</Paragraph> <Paragraph position="1"> transition(explain with a causal link, \[explain(C,E), cause(C, E)I, cau.~'e-effect(pre, C, E)).</Paragraph> <Paragraph position="2"> But there is not necessarily a direct correspondence between relations of conceptual level and relations of textual level, as in the above transitions. Hence, the following transition may hold in some context, even if no causal interaction exists between on the one hand C 1, E 1 and on the other</Paragraph> <Paragraph position="4"> As mentioned in section H, behavioural description of a physical system contains not only causal links between events but also causal opposition relations. These relations are often phrased in natural language with concessive connectives (but, though, however,...) which, following Piaget \[Piaget 78\], express a discordance between cause am1 effect. The sentences (3) and (4) illustrate respectively expressions of contrary~cau.~e and opposite-effects relations: null (3) Le tube se dilate uu~is le niveau de mercure raonte. (The tube expands but the mercury level rises.) (4) Le tube se dilate raais le mercure se dilate aussi. (The tube expands but the mercury expands too.) The study of linguistic connectives within the linguistic Theory of Argumentation \[An~ombre&Ducrot 83\], \[Aascombre 84\] and especially the French connective mais (but) has showed that two kinds of semantic opposition may be expressed by concessive connectives: direct opposition, where one of the two linked propositions is presented as an argument for a conclusion oppo~d to the other proposition, and indirect opposition, where the two propositions are presented as arguments for opposite conclusions. (3) conveys a direct opposition since the conclusion (the mercury level drop) aimed by the first proposition is opposed to the second one (the mercury level rise) and (4) conveys an indirect opposition since the two propositions aim at opposite conclnsions (mercury level drop and mercury level rise).</Paragraph> <Paragraph position="5"> In order to represent these semantic oppositions, we have introduced the adversative inter~zlausal relations directopposition and indirect-opposition. Acts of informing on causal opposition relations are then realized by means of these adversative relations:</Paragraph> <Paragraph position="7"> indirect.~opposition(equi,Cl ,C2)).</Paragraph> <Paragraph position="8"> But also concessive acts, taking advantage of the concessive nature of adversative relations:</Paragraph> <Paragraph position="10"> A s we diseussed earlier, corn manicafi ve acts should not be examined independently ofoneanother if one wants topro~ duce well-structured texts. A local transition represents a possible treatment of a limited number of communicative acts at textual level. Hence, choice and application of local transitions are governed by discourse strategies, They determine transitions which may be applied after an analysis of the overall set of communicative acts. In particular, they cxploit the underlying conceptual structure ofcomnurnicalive acids. For cau~l explanations, the mainly used strategies are the following: * Causal Chain Strategy: If the underlying conceptual structure is a causal chain and communicative acts are of the inform or explain type then lollow causal order and apply transitions informofa_causallink and explain with a causal link (cffig 2).</Paragraph> <Paragraph position="11"> * Parallel Strategy: It is a form of parallelism rhetorical &quot;figure&quot; which may be used when the underlying conceptual structure is comlx~sed of two causal chains with a common initial cause. This strategy also exploit the causal order and transitions inform of common cause link and inform of multiple causal_links hold the highest priorities (cffig 3).</Paragraph> <Paragraph position="13"> * Concessive Strategy: This strategy deals with concessive acts which involve causal opposition relations.</Paragraph> <Paragraph position="14"> A preferential order is suggested: the conceded fact pre-cedes the other units.</Paragraph> <Paragraph position="15"> Priorities are assigned to the strategies so that conflicts can be solved.</Paragraph> </Section> <Section position="2" start_page="0" end_page="0" type="sub_section"> <SectionTitle> IVJ - Textual Structure Segmentation </SectionTitle> <Paragraph position="0"> The purpose of the segmentation task is to delimit the content of the ~utences which will constitute the final text.</Paragraph> <Paragraph position="1"> The determination of a sentential content involves several heuristics. Some of them aim at increasing the scope of the sentence while others aim at reducing it. One of the main heuristics deals with the communicative nature of inter-clausal relations: since preponderant relations often appear in subordinate forms at syntactic level, equiponderant relations are privileged. Hence, delimitation of sentential content starts with the choice of equiponderant relations. The content is then completed with preponderant relations. In addition, the number of textual units of a sentence is limited and introduced relations may be removed from the sentential content if it contains too many textual units. The segmentation is &quot;also coustralued by the conceptual nature of inlormation units. For instance, an initial cause may be realized in a single sentence.</Paragraph> <Paragraph position="2"> AcrEs DE COLING-92, NAIVrEs, 23-28 APt)l&quot; 1992 6 2 1 Prtoc. ov COLING-92, NANTES, AUO. 23-28, 1992 V- SURFACE GENERATION VI - RELATED WORKS The text planner is independent of the surface generation component. (except the stylistic adjustment task). Two different types of sentence generation approaches have been examined.</Paragraph> <Paragraph position="3"> Oar main approach aims at coupling the text planning system with a generic sentence generator \[Fournier 91 \] based on the Meaning-Text Theory \[Mel'cuk 88\]. In the Meaning-Text Model, the input representation is a semantic network enriched with several structures, mainly the bipartitions theme/rheme, asserted/presupposed and given/new.</Paragraph> <Paragraph position="4"> The construction of semantic networks is based on a conceptual-semantic dictionary, which specifies correspondences between concepts and semantic definitions. The given~new structure has already been integrated in text generation systems (e.g. INirenburg 88\]). It denotes speci fic linguistic constructions and will constrain choices during realization. For instance, a predicative structure will be rather realized as an assertion if it belongs to the new part or as a nominalization if it belongs to the given part. This characterization of semantic representations plays a crucial role from a textual perspective. The generation process must be sensitive to progression of new information through the text, even at the realization stage. In this spirit, distribution of semantic units among given and new parts is carried out with regard to conceptual units introduced in previous sentences. Predicative structures of the semantic network whose referents have been previously evoked are systematically marked as given and the others as new. Once the semantic representation is built (only the semantic network and the given/new structure have been considered), the realizer produces surface forms and paraphrases may be generated, essentially by lexical substitutions.</Paragraph> <Paragraph position="5"> The second approach we are experimenting relies on equivalence classes between sentence structures, which may be compared to Discourse Grammars described in \[Danlos 85\]. An equivalence class represents potential realizations of a kind of semantic content. Our investigation is currently limited to classes related to causality. We have defined classes for expressing causal and opposition relations. This categorization of linguistic expressions provides a suitable basis for stylistic adjustment mechanisms. Syntactic and lexical constraints arising from stylistic considerations govern the choice of sentence structures within classes. In particular, we have implemented a &quot;principle of stylistic variety&quot; which prevents excessive use of similar syntactic constructions. This principle is particularly useful in causal explanation generation because concepts involved are of similar types. To illustrate this point, let us consider the text in figure 2. When generating the last sentence, stylistic adjustment recommends the realizer to avoid coordinative structures (with connectives such as thus, consequently, ...) since such a construction has been used in the previous sen-tence. A structure with a verb conveying causality (e.g. to cause, to result in, ...) is then selected.</Paragraph> <Paragraph position="6"> Our system benefits from a number of earlier works on language generation, as those we have mentioned along the above sections. We will focus the comparison on two particular points. The first concems the overall architecture and modularization in text generation systems; the second deals with text planning applied to causal explanation generation. Text planning and surface generation have often been conceived as separated components. But separation of tasks is not always extended within text planning. For example, rhetorical schemata used in TEXT \[McKeown 85\] specify in the same structure not only information units which may be introduced bat also their orders in the final text. Other systems as GOSSIP \[Carcagno & Iordanskaja 89\] and PAULINE \[Hovy 88b\] have isolated content determination from text structuring. The approach described in this paper also proposes a clear separation of the content specification and the textual organization skills. The main motivation behind this division is that it is convenient to decide what to say (more precisely, to set (almost) all communicative goals to achieve) before taking any organizational decisions. Hence, content specification proceeds regardless of structuring considerations. However, we think that a more cooperative interaction between these two major components is necessary to allow goal revisions at the text organization level.</Paragraph> <Paragraph position="7"> Oar system can also be compared to TAILOR \[Paris 87\] which generates natural language descriptions of physical devices. A significant advantage of TAILOR is its ability to combine structural and causal description strategies. Nevertheless, causal interactions are restricted to simple causal links and there is no attempt to explicit the roles they can play la discourse.</Paragraph> </Section> </Section> <Section position="4" start_page="0" end_page="0" type="metho"> <SectionTitle> VII - FUTHER DEVELOPMENTS </SectionTitle> <Paragraph position="0"> First a better conceptual characterization of physical systems would contribute to improve the quality of causal explanations. We need a more precise description of causal interactions which allows, for instance, to discern enable~ ment conditions from other causal links.</Paragraph> <Paragraph position="1"> With regards to text planning, number of extensions are possil)le. We intend to define strategies for structural descriptions and also enhance the control mechanisms of discourse strategies. Furthermore, practical validation of the overall approach requires a larger coverage of communicative acts. Another interesting extension would consist in relating stylistic adjustment mechanisms to pragmatic features \[Hovy 89\] in order to strengthen context sensitivity.</Paragraph> </Section> class="xml-element"></Paper>