File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/92/h92-1047_abstr.xml
Size: 929 bytes
Last Modified: 2025-10-06 13:47:36
<?xml version="1.0" standalone="yes"?> <Paper uid="H92-1047"> <Title>The Acquisition of Lexical Semantic Knowledge from Large Corpora</Title> <Section position="1" start_page="0" end_page="0" type="abstr"> <SectionTitle> ABSTRACT </SectionTitle> <Paragraph position="0"> Machine-readable dictionaries provide the raw material from which to construct computationaily useful representations of the generic vocabulary contained within it. Many sublanguages, however, are poorly represented in on-line dictionaries, ff represented at all. Vocabularies geared to specialized domains are necessary for many applications, such as text categorization and information retrieval. In this paper I describe research devoted to developing techniques for building sublanguage lexicons via syntactic and statistical corpus analysis coupled with analytic techniques based on the tenets of a generative lexicon.</Paragraph> </Section> class="xml-element"></Paper>