File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/06/e06-1042_abstr.xml
Size: 1,297 bytes
Last Modified: 2025-10-06 13:44:48
<?xml version="1.0" standalone="yes"?> <Paper uid="E06-1042"> <Title>A Clustering Approach for the Nearly Unsupervised Recognition of Nonliteral Language[?]</Title> <Section position="1" start_page="0" end_page="0" type="abstr"> <SectionTitle> Abstract </SectionTitle> <Paragraph position="0"> In this paper we present TroFi (Trope Finder), a system for automatically classifying literal and nonliteral usages of verbs through nearly unsupervised word-sense disambiguation and clustering techniques.</Paragraph> <Paragraph position="1"> TroFi uses sentential context instead of selectional constraint violations or paths in semantic hierarchies. It also uses literal and nonliteral seed sets acquired and cleaned without human supervision in order to bootstrap learning. We adapt a word-sense disambiguation algorithm to our task and augment it with multiple seed set learners, a voting schema, and additional features like SuperTags and extra-sentential context. Detailed experiments on hand-annotated data show that our enhanced algorithm outperforms the base-line by 24.4%. Using the TroFi algorithm, we also build the TroFi Example Base, an extensible resource of annotated literal/nonliteral examples which is freely available to the NLP research community.</Paragraph> </Section> class="xml-element"></Paper>