File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/00/w00-1211_intro.xml

Size: 2,076 bytes

Last Modified: 2025-10-06 14:01:01

<?xml version="1.0" standalone="yes"?>
<Paper uid="W00-1211">
  <Title>Statistics Based Hybrid Approach to Chinese Base Phrase Identification</Title>
  <Section position="2" start_page="0" end_page="0" type="intro">
    <SectionTitle>
1 Introduction
</SectionTitle>
    <Paragraph position="0"> Decomposing syntactic analysis into several phases so as to decrease its difficulty is a new stream in NIP research. The successful POS tagging has encouraged researchers to explore further possibility for resolving sub-problems in parsing(Zhou, et al, 1999). The typical examples are the recognition of BaseNP in English and Chinese.</Paragraph>
    <Paragraph position="1"> In English BNP (base noun phrase) is defined as simple and non-nesting noun phrases, i.e. noun phrases that do not contain other noun phrase descendants (Church, 1988). After that researches on BNP identification reports promising results for such task in English. Observing that the Chinese BNP is different form English, (Zhao &amp; Huang, 1999) puts forward the definition of Chinese BNP in terms of combination of determinative modifier and head noun. According to them a BNP in Chinese can be recursively defined as:  Inspired by these researches, we extend the concept of BNP to Base Phrase in Chinese. It is based on such knowledge that there are many structures, not only NP, in which the trivial components closely attach to their central words and constitute a basic phrase in a Chinese sentence. Obviously, resolving all these base phrases will greatly benefit Chinese parser by reliving it from some pre-processing (though non-trivial) and enable it focus on the most subtle syntactic structures.</Paragraph>
    <Paragraph position="2"> Since the whole system of Chinese base phrase is still under discussing, this paper just presents some tentative research achievements on statistics based hybrid model to Chinese base phrase identification. For the 7 types we considered at present, our algorithm turns out promising results and smoothes the way for a better Chinese parser.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML