File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/06/w06-0117_abstr.xml
Size: 731 bytes
Last Modified: 2025-10-06 13:45:19
<?xml version="1.0" standalone="yes"?> <Paper uid="W06-0117"> <Title>Posts and Telecommunications yuandong@bupt.edu.cn</Title> <Section position="2" start_page="0" end_page="0" type="abstr"> <SectionTitle> Abstract </SectionTitle> <Paragraph position="0"> This paper presents two word segmentation (WS) systems and a named entity recognition (NER) system in France Telecom R&D Beijing. The one system of WS is for open tracks based on n-gram language model and another one is for closed tracks based on maximum entropy approach. The NER system uses a hybrid algorithm based on Class-based language model and rule-based knowledge. These systems are all augmented with a set of post-processors.</Paragraph> </Section> class="xml-element"></Paper>