File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/04/p04-1080_intro.xml
Size: 1,099 bytes
Last Modified: 2025-10-06 14:02:24
<?xml version="1.0" standalone="yes"?> <Paper uid="P04-1080"> <Title>Learning Word Senses With Feature Selection and Order Identification Capabilities</Title> <Section position="2" start_page="0" end_page="0" type="intro"> <SectionTitle> Abstract </SectionTitle> <Paragraph position="0"> This paper presents an unsupervised word sense learning algorithm, which induces senses of target word by grouping its occurrences into a &quot;natural&quot; number of clusters based on the similarity of their contexts. For removing noisy words in feature set, feature selection is conducted by optimizing a cluster validation criterion subject to some constraint in an unsupervised manner. Gaussian mixture model and Minimum Description Length criterion are used to estimate cluster structure and cluster number.</Paragraph> <Paragraph position="1"> Experimental results show that our algorithm can find important feature subset, estimate model order (cluster number) and achieve better performance than another algorithm which requires cluster number to be provided.</Paragraph> </Section> class="xml-element"></Paper>