File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/06/w06-0117_abstr.xml

Size: 731 bytes

Last Modified: 2025-10-06 13:45:19

<?xml version="1.0" standalone="yes"?>
<Paper uid="W06-0117">
  <Title>Posts and Telecommunications yuandong@bupt.edu.cn</Title>
  <Section position="2" start_page="0" end_page="0" type="abstr">
    <SectionTitle>
Abstract
</SectionTitle>
    <Paragraph position="0"> This paper presents two word segmentation (WS) systems and a named entity recognition (NER) system in France Telecom R&amp;D Beijing. The one system of WS is for open tracks based on n-gram language model and another one is for closed tracks based on maximum entropy approach. The NER system uses a hybrid algorithm based on Class-based language model and rule-based knowledge. These systems are all augmented with a set of post-processors.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML