File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/00/w00-1219_abstr.xml
Size: 819 bytes
Last Modified: 2025-10-06 13:41:56
<?xml version="1.0" standalone="yes"?> <Paper uid="W00-1219"> <Title>Extraction of Chinese Compound Words - An Experimental Study on a Very Large Corpus</Title> <Section position="1" start_page="0" end_page="0" type="abstr"> <SectionTitle> Abstract </SectionTitle> <Paragraph position="0"> This paper is to introduce a statistical method to extract Chinese compound words from a very large corpusL This method is based on mutual information and context dependency. Experimental results show that this method is efficient and robust compared with other approaches. We also examined the impact of different parameter settings, corpus size and heterogeneousness on the extraction results. We finally present results on information retrieval to show the usefulness of extracted compounds.</Paragraph> </Section> class="xml-element"></Paper>