File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/04/n04-3012_intro.xml

Size: 2,509 bytes

Last Modified: 2025-10-06 14:02:18

<?xml version="1.0" standalone="yes"?>
<Paper uid="N04-3012">
  <Title>WordNet::Similarity - Measuring the Relatedness of Concepts</Title>
  <Section position="2" start_page="0" end_page="0" type="intro">
    <SectionTitle>
1 Introduction
</SectionTitle>
    <Paragraph position="0"> WordNet::Similarity implements measures of similarity and relatedness that are all in some way based on the structure and content of WordNet.</Paragraph>
    <Paragraph position="1"> Measures of similarity use information found in an is-a hierarchy of concepts (or synsets), and quantify how much concept A is like (or is similar to) concept B. For example, such a measure might show that an automobile is more like a boat than it is a tree, due to the fact that automobile and boat share vehicle as an ancestor in the WordNet noun hierarchy.</Paragraph>
    <Paragraph position="2"> WordNet is particularly well suited for similarity measures, since it organizes nouns and verbs into hierarchies of is-a relations. In version 2.0, there are nine separate noun hierarchies that include 80,000 concepts, and 554 verb hierarchies that are made up of 13,500 concepts.</Paragraph>
    <Paragraph position="3"> Is-a relations in WordNet do not cross part of speech boundaries, so similarity measures are limited to making judgments between noun pairs (e.g., cat and dog) and verb pairs (e.g., run and walk). While WordNet also includes adjectives and adverbs, these are not organized into is-a hierarchies so similarity measures can not be applied.</Paragraph>
    <Paragraph position="4"> However, concepts can be related in many ways beyond being similar to each other. For example, a wheel is a part of a car, night is the opposite of day, snow is made up of water,aknife is used to cut bread, and so forth. As such WordNet provides relations beyond is-a, including has-part, is-made-of, and is-an-attribute-of. In addition, each concept is defined by a short gloss that may include an example usage. All of this information can be brought to bear in creating measures of relatedness. As a result these measures tend to be more flexible, and allow for relatedness values to be assigned across parts of speech (e.g., the verb murder and the noun gun).</Paragraph>
    <Paragraph position="5"> This paper continues with an overview of the measures supported in WordNet::Similarity, and then provides a brief description of how the package can be used. We close with a summary of research that has employed WordNet::Similarity.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML