File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/05/i05-2002_abstr.xml
Size: 1,463 bytes
Last Modified: 2025-10-06 13:44:12
<?xml version="1.0" standalone="yes"?> <Paper uid="I05-2002"> <Title>A Hierarchical Parsing Approach with Punctuation Processing for Long Chinese Sentences</Title> <Section position="1" start_page="0" end_page="0" type="abstr"> <SectionTitle> Abstract </SectionTitle> <Paragraph position="0"> In this paper, the usage and function of Chinese punctuations are studied in syntactic parsing and a new hierarchical approach is proposed for parsing long Chinese sentences. It differentiates from most of the previous approaches mainly in two aspects.</Paragraph> <Paragraph position="1"> Firstly, Chinese punctuations are classified as 'divide' punctuations and 'ordinary' ones. Long sentences which include 'divide' punctuations are broken into suitable units, so the parsing will be carried out in two stages. This 'divide-and-rule' strategy greatly reduces the difficulty of acquiring the boundaries of sub-sentences and syntactic structures of sub-sentences or phrases simultaneously in once-level parsing strategy of previous approaches. Secondly, a grammar rules system including all punctuations and probability distribution is built to be used in parsing and disambiguating.</Paragraph> <Paragraph position="2"> Experiments show that our approach can significantly reduce the time consumption and numbers of ambiguous edges of traditional methods, and also improve the accuracy and recall when parsing long Chinese sentences.</Paragraph> </Section> class="xml-element"></Paper>