File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/96/c96-1039_abstr.xml

Size: 1,238 bytes

Last Modified: 2025-10-06 13:48:29

<?xml version="1.0" standalone="yes"?>
<Paper uid="C96-1039">
  <Title>Identification and Classification of Proper Nouns in Chinese Texts</Title>
  <Section position="1" start_page="0" end_page="0" type="abstr">
    <SectionTitle>
Abstract
</SectionTitle>
    <Paragraph position="0"> Various strategies are proposed to identify and classify three types of proper nouns in Chinese texts. Clues from character, sentence and paragraph levels are employed to resolve Chinese personal names.</Paragraph>
    <Paragraph position="1"> Character, Syllable and Frequency Conditions are presented to treat transliterated personal names, To deal with organization names, keywords, prefix, word association and parts-of-speech are applied. For fair evaluation, large scale test data are selected from six sections of a newspaper.</Paragraph>
    <Paragraph position="2"> The precision and the recall for these three types are (88.04%, 92.56%), (50.62%, 71.93%) and (61.79%, 54.50%), respectively.</Paragraph>
    <Paragraph position="3"> When the former two types are regarded as a category, the performance becomes (81.46%, 91.22%). Compared with other approaches, our approach has better performance and our classification is automatic.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML