File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/03/n03-2011_metho.xml
Size: 9,787 bytes
Last Modified: 2025-10-06 14:08:15
<?xml version="1.0" standalone="yes"?> <Paper uid="N03-2011"> <Title>Rhetorical Parsing with Underspecification and Forests</Title> <Section position="3" start_page="0" end_page="0" type="metho"> <SectionTitle> 2 Parse forests and underspecification </SectionTitle> <Paragraph position="0"> We will illustrate the underspecification of ambiguities with the following example: &quot;(1) Yesterday the delegates elected their new representative by a narrow margin. Even though (2) Smith got only 234 votes, (3) he accepted the position. But (4) his predecessor was rather irritated by the results.&quot; We take it that even though unambiguously marks a CONCESSION between the embedded clause (2, satellite) and the matrix clause (3, nucleus). For the purpose of illustration, we also assume that &quot;but&quot; can only signal a bi-nuclear CONTRAST relation with the second nucleus (4); the span of the first nucleus is in this case ambiguous (1-3 or 2-3). For linking (1) to the remaining material, we suppose that either ELABORATION (with nucleus (1)) or SEQUENCE holds. Further relations are possible, which will add to the possibilities, but our points can be made with the situation as just described.</Paragraph> <Paragraph position="1"> Instead of enumerating all possible rhetorical trees for our example text, we use a parse forest representation which compactly encodes the different analysises. A parse forest is basically an attributed And-Or-graph with the properties of subtree sharing and containment of ambiguities. The first property means that a subtree, which plays different roles in some bigger structure, is represented only once. The second property ensures that two subtrees which have in common the same category and the same terminal yield, but which differ in the first step of a leftmost derivation are unified together.</Paragraph> <Paragraph position="2"> Fig. 1 shows a simplified parse forest for the example text.</Paragraph> <Paragraph position="3"> Fig.1: Parse forest for the input text Subtree sharing is indicated by nodes (e.g. &quot;1&quot;) which have several incoming edges. Containment of ambiguities is exemplified in fig. 1 by the upper left contrast node which represents a disjunctive hypothesis concerning the span of the relation.</Paragraph> <Paragraph position="4"> Reitter and Stede (to appear) developed an XML-DTD scheme to represent such parse forests in XML notation.</Paragraph> </Section> <Section position="4" start_page="0" end_page="0" type="metho"> <SectionTitle> 3 Discourse structure parsing </SectionTitle> <Paragraph position="0"> In our approach, we combine a standard chunk parser which identifies the relevant units for discourse processing with a feature-based grammar which builds larger rhetorical trees out of these chunks. The categories and features we use are summarized in table 1.</Paragraph> <Paragraph position="1"> Cat. Feat.</Paragraph> <Section position="1" start_page="0" end_page="0" type="sub_section"> <SectionTitle> Values Comment </SectionTitle> <Paragraph position="0"> RST-tree cat macro_seg, s, ip, pp, ... The category of the RST-tree: macro segments, phrases sentences etc.</Paragraph> <Paragraph position="1"> type ns, nn, term Type of RST-tree: nuc-sat, multi-nuclear or terminal role nuc, sat Nucleus or satellite relation elaboration, contrast, cause, ...</Paragraph> <Paragraph position="2"> The relation which combines the daughters of the RST-tree.</Paragraph> <Paragraph position="3"> rst dp no_dp, but, although, null ...</Paragraph> <Paragraph position="4"> The discourse particle triggering the relation, or no_dp, if absent.</Paragraph> <Paragraph position="5"> There are three groups of grammar rules: 1. Rules combining chunks to terminal RST-trees 2. Rules combining discourse particles and sentence fragments to non-primitives RST-trees 3. Rules combining sentences or groups of sentences (so called macro segments) to non-primitive RST- null trees.</Paragraph> <Paragraph position="6"> An example for a rule in group 1 is the one which builds a terminal RST-tree of category mc (main clause) out of a discourse particle, and sentence fragment and a full stop (all examples are given in Prolog-style notation, with curly brackets indicating feature structures):</Paragraph> <Paragraph position="8"> punct({cat:fullstop}).</Paragraph> <Paragraph position="9"> Rules like this one are used to build terminal RST-trees for sentences like (4) in our example text. The second group of rules is exemplified by a rule which combines two terminal RST-trees - a subordinate clause containing a conjunction like even though and another clause - to a hypotactic RST-tree: (2) rst({cat:mc, rel:concession, dp:no_dp, type:ns}) ---> rst({cat:sc, dp:even_though, role:sat}), rst({cat:mc, dp:no_dp, role:nuc}).</Paragraph> <Paragraph position="10"> The macro segment building rules of the third group can be divided into two subclasses. The first class is constituted by rules which construct RST-trees on the basis of a relation that is triggered by a discourse particle. An example of this type is the possible contrastrelation between segments 4 and 2-3 in (1), which is triggered by the discourse particle but.</Paragraph> <Paragraph position="12"> rst({cat:macro_seg, role:nuc, dp:but}).</Paragraph> <Paragraph position="13"> The other subclass contains rules which freely construct branching RST-trees without the overt evidence of discourse particles. The relations which are typically involved here are SEQUENCE and ELABORATION. Relations which have in common the same type of nucleussatellite-configuration are unified into a single rule using the list-valued form of the relation-feature:</Paragraph> <Paragraph position="15"> rst({cat:macro_seg, role:nuc, dp:no_dp}), rst({cat:macro_seg, role:nuc, dp:no_dp}).</Paragraph> <Paragraph position="16"> Fig. 2 shows a parse tree which reflects one analysis of our example text. Note that the segments into which the input is broken usually smaller than sentences.</Paragraph> <Paragraph position="17"> as they can be used to combine partial structures without any structure triggering discourse particles. Furthermore, rules of the kind shown in (4) are on the one hand necessary to produce all possible branching structure over a given sequence of terminal elements. On the other hand they introduce massive ambiguities into the grammar which causes the number of analyses to grow according to the Catalan numbers (cf. Aho and Ullman, 1972, p. 165).</Paragraph> <Paragraph position="18"> It is therefore crucial that during parsing the construction of parse trees is strictly avoided because that would turn an otherwise polynomial parsing algorithm like chart parsing into an exponential one. Instead we incrementally build the parse forest mentioned in section 2. This is done by assigning a unique id to each edge introduced into the chart and by storing the ids of the immediate daughters within the edge. After parsing the parse forest is constructed by partitioning the set of edges into equivalence classes. Two chart edges E1 and E2 are in the same equivalence class if they a) have identical start and end positions and b) the categories of E1 and E2 subsume each other. For the subsumption test it is necessary to ignore the role-feature, because this feature is an attribute of the parse forest edges and not of the parse forest nodes.</Paragraph> <Paragraph position="19"> Besides keeping the parsing algorithm polynomial it is of equal importance to keep the grammar constant low. For example, rule (4) which establishes a SEQUENCE/ELABORATION relation between two macro segments also connects two simple clauses (of category mc), a macro segment and a simple clause, or a simple clause and a macro segment. The standard move to avoid this kind of rule multiplication is to introduce an unary chain rule of the form rst({cat:macro_seg}) ---> rst({cat:mc}) which ensures the desired level shifting.</Paragraph> <Paragraph position="20"> Because of the inherent relational nature of RST trees this solution is blocked. Instead we use an inheritance hierarchy like that in fig. 3 and replace rule (4) with the following one, which is underspecified w.r.t to the category feature.</Paragraph> <Paragraph position="21"> (5) rst({cat:macro_seg, rel:[sequence,elaboration], dp:no_dp, type:nn}) ---> rst({cat:rst_tree, role:nuc, dp:no_dp}), rst({cat:rst_tree, role:nuc, dp:no_dp}).</Paragraph> </Section> </Section> <Section position="5" start_page="0" end_page="0" type="metho"> <SectionTitle> 4 Related work </SectionTitle> <Paragraph position="0"> Similar to Marcu (2000) we assume discourse markers as indicators for rhetorical relations.</Paragraph> <Paragraph position="1"> But contrary to Marcu (1999) and also to Schilder (2002) we use a full-fledged discourse grammar and a standard parsing algorithm, which makes it, in our opinion, unnecessary to propose special rhetorical tree building operations, as suggested e.g. by Marcu (1999).</Paragraph> <Paragraph position="2"> By using the chart parsing algorithm combined with the construction of an underspecified parse forest, it can easily be shown that our method is of cubic complexity.</Paragraph> <Paragraph position="3"> This is a crucial property, because it is commonly assumed that the number of distinct structures that can be constructed over a sequence of n discourse units is exponential in n, (as it is for example implicit in the DCG based algorithm proposed by Schilder, 2002).</Paragraph> <Paragraph position="4"> Our system is robust in the same way as the one in Schilder (2002) because the grammar admits under-specified rhetorical trees in the absence of overt discourse markers.</Paragraph> </Section> class="xml-element"></Paper>