File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/04/w04-1806_abstr.xml

Size: 1,048 bytes

Last Modified: 2025-10-06 13:43:56

<?xml version="1.0" standalone="yes"?>
<Paper uid="W04-1806">
  <Title>Automatically Inducing Ontologies from Corpora</Title>
  <Section position="2" start_page="37" end_page="37" type="abstr">
    <SectionTitle>
Abstract
</SectionTitle>
    <Paragraph position="0"> The emergence of vast quantities of on-line information has raised the importance of methods for automatic cataloguing of information in a variety of domains, including electronic commerce and bioinformatics. Ontologies can play a critical role in such cataloguing. In this paper, we describe a system that automatically induces an ontology from any large on-line text collection in a specific domain. The ontology that is induced consists of domain concepts, related by kind-of and part-of links. To achieve domain-independence, we use a combination of relatively shallow methods along with any available repositories of applicable background knowledge. We describe our evaluation experiences using these methods, and provide examples of induced structures.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML