File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/06/p06-2047_intro.xml
Size: 5,188 bytes
Last Modified: 2025-10-06 14:03:41
<?xml version="1.0" standalone="yes"?> <Paper uid="P06-2047"> <Title>Graph Branch Algorithm: An Optimum Tree Search Method for Scored Dependency Graph with Arc Co-occurrence Constraints</Title> <Section position="4" start_page="0" end_page="362" type="intro"> <SectionTitle> 2 Optimum Tree Search in a Scored DG </SectionTitle> <Paragraph position="0"/> <Section position="1" start_page="0" end_page="361" type="sub_section"> <SectionTitle> 2.1 Basic Framework </SectionTitle> <Paragraph position="0"> Figure 1 shows the basic framework of the optimum dependency tree search in a scored DG. In general, nodes in a DG correspond to words in the sentence and the arcs show some kind of dependency relations between nodes. Each arc has a preference score representing plausibility of the relation. The well-formed dependency tree constraint is a set of well-formed constraints which should be satisfied by all dependency trees representing sentence interpretations. A DG and a well-formed dependency tree constraint prescribe a set of well-formed dependency trees. The score of a dependency tree is the sum total of arc scores. The optimum tree is a dependency tree with the highest score in the set of dependency trees.</Paragraph> </Section> <Section position="2" start_page="361" end_page="361" type="sub_section"> <SectionTitle> 2.2 Dependency Graph </SectionTitle> <Paragraph position="0"> DGs are classified into some classes based on the types of nodes and arcs. This paper assumes three types of nodes, i.e. word-type, WPP-type2 and concept-type3. The types of DGs are called a word DG, a WPP DG and a concept DG, respectively.</Paragraph> <Paragraph position="1"> DGs are also classified into non-labeled and labeled DGs. There are some types of arc labels such as syntactic label (ex. &quot;subject&quot;,&quot;object&quot;) and semantic label (ex. &quot;agent&quot;,&quot;target&quot;). Various types of DGs are used in existing systems according to these classifications, such as</Paragraph> </Section> <Section position="3" start_page="361" end_page="361" type="sub_section"> <SectionTitle> 2.3 Well-formedness Constraints and Graph Search Algorithms </SectionTitle> <Paragraph position="0"> There can be a variety of well-formedness constraints from very basic and language-independent constraints to specific and language-dependent constraints. This paper focuses on the following four basic and language-independent constraints which may be embedded in data structure and/or the optimum tree search algorithm.</Paragraph> <Paragraph position="1"> (C1) Coverage constraint: Every input word has a corresponding node in the tree No two arcs in a tree occupy the same valence of a predicate (C1) and (C2), collectively referred to as &quot;covering constraint&quot;, are basic constraints adopted by almost all dependency parsers. (C3) is adopted by the majority of dependency parsers which are called projective dependency parsers. A projective dependency parser fails to analyze non-projective sentences. (C4) is a basic constraint for valency but is not adopted by the majority of dependency parsers.</Paragraph> <Paragraph position="2"> Graph search algorithms, such as the Chu-Liu-Edmonds maximum spanning tree algorithm (Chu and Liu, 1965; Edmonds, 1967), algorithms based on the dynamic programming (DP) principle (Ozeki, 1994; Eisner, 1996) and the algorithm based on the B&B method (Hirakawa, 2001), are used for the optimum tree search in scored DGs. The applicability of these algorithms is closely related to the types of DGs and/or well-formedness constraints. The Chu-Liu-Edmonds algorithm is very fast (C7B4D2BEB5 for sentence length D2), but it works correctly only on word DGs. DP-based algorithms can satisfy (C1)-(C3) and run efficiently, but seems not to satisfy (C4) as shown in 2.4.</Paragraph> <Paragraph position="3"> (C2)-(C4) can be described as a set of co-occurrence constraints between two arcs in a DG. As described in Section 2.6, the DF can represent (C2)-(C4) and more precise constraints because it can handle co-occurrence constraints between two arbitrary arcs in a DG. The graph branch algorithm described in Section 3 can find the optimum tree from the DF.</Paragraph> </Section> <Section position="4" start_page="361" end_page="362" type="sub_section"> <SectionTitle> 2.4 SVOC and DP </SectionTitle> <Paragraph position="0"> (Ozeki and Zhang, 1999) proposed the minimum cost partitioning method (MCPM) which is a partitioning computation based on the recurrence equation where the cost of joining two partitions is the cost of these partitions plus the cost of combining these partitions. MCPM is a generalization of (Ozeki, 1994) and (Katoh and Ehara, 1989) which compute the optimum dependency tree in a scored DG. MCPM is also a generalization of the probabilistic CKY algorithm and the Viterbi algo5Another condition for projectivity, i.e. &quot;no arc covers top node&quot; is equivalent to the crossing arc constraint if special root node , which is a governor of top node, is introduced at the top (or end) of a sentence.</Paragraph> </Section> </Section> class="xml-element"></Paper>