XML Viewer - c96-1013

File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/96/c96-1013_metho.xml
Size: 20,106 bytes
Last Modified: 2025-10-06 14:14:05
<?xml version="1.0" standalone="yes"?>
<Paper uid="C96-1013">
  <Title>Concept clustering and knowledge integration from a children's dict ionary</Title>
  <Section position="2" start_page="55" end_page="55" type="metho">
    <SectionTitle>
2 Transforming definitions
</SectionTitle>
    <Paragraph position="0"> Our definitions may contain up to three general types of information, as shown in the examples in  information. Such information is frequently used for noun taxonomy construction (Byrd et al., 1987; Klavans et al., 1990; Barri~re and Popowich, To appear August 1996).</Paragraph>
    <Paragraph position="1"> * general knowledge or usage: This gives information useflfl in daily life, like how to use an object, what it is made of, what it looks llke, etc. * specific example: This presents a typical situation using the word defined and it involves specific persons and actions.</Paragraph>
    <Paragraph position="2"> Cereal is a kind of food. \[description\] Many cereals are made from corn, wheat, or rice. \[usage\] Most people eat cereal with milk in a bowl. \[usage\] Asia is what is left after something burns. \[usage\] It is a soft gray powder. \[description\] Ray watched his father clean the ashes out of the fireplace.  The information given by the description and general knowledge will be used to perform the knowledge integration proposed in section 3. The specific examples are excluded as they tend to involve specific concepts not always deeply related to the word defined.</Paragraph>
    <Paragraph position="3"> Our processing of the definitions results in the construction of a special type of conceptual graph which we call a temporary graph. The set of relations used in temporary graphs come from three sources. Table 1 shows some examples for each type.</Paragraph>
    <Paragraph position="4"> 1. the set of closed class words, ex: of, to, in, and; 2. relations extracted via defining formulas ex: partof, made-of, instrument; defining formulas correspond to phrasal patterns that occur often through the dictionary suggesting particular semantic relations (ix. A is a part of B) (Ahlswede and Evens, 1988; Dolan et al., 1993).</Paragraph>
    <Paragraph position="5"> 3. the relations that are extracted from the syntactic structure of a sentence, ex: subject, object, goal, attribute, modifier.</Paragraph>
    <Paragraph position="6"> As some relations are defined using the closed class words, and many of those words are ambiguous, the resulting graph will itself be ambiguous. This is the main reason for calling our graphs temporary as we assume a conceptual graph, the ultimate goal of our translation process, should contain a restricted set of well-defined and non-ambiguous semantic relations. For example, by can be a relation of manner (by chewing), time (by noon) or place (by the door). By keeping the preposition itself within the temporary graph, we delay the ambiguity resolution process until we have gathered more information and we even hopefully avoid the decision process as the ambiguity might later be resolved by the integration process itself.</Paragraph>
    <Paragraph position="7">  \[eat\]- &gt; (agent)- &gt; \[John\] \[A\]-&gt; (goal)-&gt; \[B\] \[e at\]- &gt; ( goal)- &gt; \[grow\]  and their corresponding temporary graphs</Paragraph>
  </Section>
  <Section position="3" start_page="55" end_page="66" type="metho">
    <SectionTitle>
3 Knowledge integration
</SectionTitle>
    <Paragraph position="0"> This section describes how given a trigger word, we perform a series of forward and backward searches in the dictionary to build a CCKG containing useful information pertaining to the trigger word and to closely related words. The primary building blocks for the CCKG are the temporary graphs built from the dictionary definitions of those words using our transformation process mentioned in the previous section. Those temporary graphs express similar or related ideas in different ways and with different levels of detail.</Paragraph>
    <Paragraph position="1"> As we will try to put all this information together into one large graph, we must first find what information the various temporary graphs have in common and then join them around this common knowledge.</Paragraph>
    <Paragraph position="2"> To help us build this CCKG and perform our integration process, we assume two main knowledge structures are available, a concept hierarchy and a relation hierarchy, and we assume the existance of some graph operations. The concept hierarchy concentrates on nouns and verbs as they account for three quarters of the dictionary definitions. It has been constructed automatically according to the techniques described in (Barri~re and Popowich, To appear August 1996). The relation hierarchy was constructed manually. A rich hierarchical structure between the set of relations is essential to the graph matching operations we use for the integration phase.</Paragraph>
    <Paragraph position="3"> As we are using the conceptual graph formalism to represent our definitions, we can use the graph  matching operations defined in (Sowa, 1984). The t, wo operations we will need are the maximal common subgraph algorithm and the maximal join algorithm. null 3.1. Maximal common subgraph The maximal common subgraph between two graphs consists of finding a subgraph of tile first graph that is isomorphic to a subgraph of the seeond graph. In our case, we cannot often expect to find two graphs that contain an identical subgral)h with the exact same relations and concepts. Ideas cart be expressed in many ways and we therefore need a more relaxed matching schema. We describe a few elements of this &amp;quot;relaxation&amp;quot; process and illustrate them by an example in Figure 2.</Paragraph>
    <Paragraph position="4">  (1) John makes a nice drawing on a piece of paper with the pen. \[make\]- &gt;(sub)- &gt;\[John\] - &gt;(obj)- &gt;\[drawing\]- &gt;(nit)- &gt;\[nice\] - &gt;(on)- &gt;\[piece\]- &gt;(or)- &gt;\[paper\] -&gt;(with)- &gt;\[pen\] (2) John uses the big crayon to draw rapidly on the paper. \[(haw\]- &gt;(sub)- &gt;\[John l -&gt;(on)-&gt;\[paper\] - &gt;(inst ........ t)- &gt; \[crayon\]  sui)graph and maximal join algorithms Semantic distance between concepts. In the maximal common subgraph algorithm proposed by (Sow% :1984), two concepts (C1,CY) could be matched if one snbsumed the other in the concept hierarchy. We can relax that criteria to match two concepts when a third concept C which subsumes C1 and C2 has a high enough degree of informativeness (Resnik, 1995). The concept hierarchy can be useful in many cases, but it is generated from the dictionary and might not be complete enough to find all similar concepts.</Paragraph>
    <Paragraph position="5"> In the example of Figure 2, when using tile concept hierarchy to establish the similarity between pen and crayon, we find that; one is a subclass of lool and the other of wax, both then are substoned by the general concept something. We have reached the root of the noun tree in the concept hierarchy and this would give a similarity of 0 based on the informativeness notion.</Paragraph>
    <Paragraph position="6"> We extend the subsumption notion to the graphs. Iustead of finding a concept that subsulnes two concepts, we will try finding a common subgraph that subsumes the graph representation of both concepts. In our example, pen and crayon have a common subgraph \[write\]-&gt;(inst)-&gt;~. The notion of semantic distance can be seen as the informativeness of the subsuming graph. The resuiting maximal comlnon snbgraph as shown in Figure 2 contains the concept label-1. This label is associated to a covert category ~s presented in (Barri~re and Popowich, To appear August 1996).</Paragraph>
    <Paragraph position="7"> We carl update tile concept hierarchy and add this label-1 as a subclass of something and a superclass of pen and crayon. It expresses a concept of &amp;quot;writ-</Paragraph>
    <Paragraph position="9"> Relation subsmnption. Since we have a relation hierarchy in addition to our concept hierarchy, we can similarly use subsumption to match two relations. In i,'igure 2, with is subsumed by instrument, and by lnapping them, we disantbiguate wilh from corresponding to another semantic relation, such as possession or accompaniment. This is a case where an arnbiguons preposition left in the temporary graph is resolved by the integration process.</Paragraph>
    <Paragraph position="10"> Predictable meaning shift. A set of lexical implication rules were developed by (Ostler and Atkins, 1992) for relating word senses. Based on them, we are developing a set of graph matching rules. Figure 2 exemplifies one of theln where two graphs containing the same word (or morphologically related), here draw and drawing, used as different parts of speech can be related.</Paragraph>
    <Paragraph position="11"> Relation transitivity. Some relations, like part-of, in, from can be transitive. For example, we can map a graph that contains a concept A in a certain relation to concept B onto another graph where concept A is in the same relation with a part or a piece of B as exemplified in Figure 2. Transitivity in relations is in itself a challenging area of study (Cruse, 1986) and we have only begun to explore it.</Paragraph>
    <Section position="1" start_page="66" end_page="66" type="sub_section">
      <SectionTitle>
3.2 Maximal join
</SectionTitle>
      <Paragraph position="0"> The basic operation for the integration of temporary graphs is the maximal join operation where a union of two graphs is formed around their maximal common subgraph using the most specific concepts of each. We just saw how to relax the maximal common subgraph operation and we will perform the join around that &amp;quot;relaxed&amp;quot; subgraph. Figure 2 shows the result of the maximal join.</Paragraph>
      <Paragraph position="1"> The join operation allows us to bring new conccpts into a graph by finding relations with ex- null isting concepts, as well as bringing new relations between existing concepts.</Paragraph>
    </Section>
    <Section position="2" start_page="66" end_page="66" type="sub_section">
      <SectionTitle>
3.3 Integration process
</SectionTitle>
      <Paragraph position="0"> Given the concept hierarchy, relation hierarchy and graph matching operations, we now describe the two major steps required to integrate all the temporary graphs into a CCKG.</Paragraph>
      <Paragraph position="1"> TRIGGER. PHASE. Start with a central word, a keyword for the subject of interest that becomes the trigger word. The temporary graph built from the trigger word forms the initial CCKG. To expand its meaning, we want to look at the important concepts involved and use their respective temporary graphs to extend our initial graph. We deem words in the definition to be important if they have a large semantic weight.</Paragraph>
      <Paragraph position="2"> 2.'he semantic weight of a word or its informativeness can be related to its frequency (l~esnik, 1995). Itere, we calculate the number of occurrence of each word within the definitions of nouns and verbs in our dictionary. The most frequent word &amp;quot;a&amp;quot; occurs 2600 times among a total of 38000 word occurrences. Only 1% of the words occur more than 130 times, 5% occur more than 30 times but over 60% occur less than 5 times.</Paragraph>
      <Paragraph position="3"> Ordering the dictionary words in terms of decreasing number of occurrences, the top 10% of these words account for 75% of word occurrences.</Paragraph>
      <Paragraph position="4"> For our current investigation, we propose this as the division between semantically significant words, and semantically insignificant ones. So a word from the dictionary is deemed to be semantically significant if it occurs less than 17 times. Note that constraining the number of semantically significant words is important in limiting the exploration process tbr constructing the concept cluster, as we shall soon see.</Paragraph>
      <Paragraph position="5"> Trigger forward: Find the semantically significant words fi'om the CCKG, and join their respective temporary graphs to the initial CCKG.</Paragraph>
      <Paragraph position="6"> Trigger backward: Find all the words in the dictionary that use the trigger word in their definition and join their respective temporary graphs to the CCKG.</Paragraph>
      <Paragraph position="7"> Instead of a single trigger word, we now have a cluster of words that are related through the CCKG. Those words ,form the concept cluster.</Paragraph>
      <Paragraph position="8"> EXPANSION PHASE. We try finding words in the dictionary containing many concepts identical to the ones already present in the CCKG but perhaps interacting through different relations allowing us to create additional links within the set of concepts present in the CCKG. Our goal is to create a more interconnected graph rather than sprouting from a particular concept. For this reason, we establish a graph matching threshold to decide whether we will join a new graph to the CCKG being built. We set this threshold empirically: the maximal common subgraph between the CCKG and the new temporary graph must contain at least three concepts connected through two relations.</Paragraph>
      <Paragraph position="9"> Expansion forward: For each semantically significant word in the CCKG, not already part of the concept cluster, find the maximal common subgraph between its temporary graph and the CCKG. If matching surpasses the graph matching threshold, perform integration (maximal join operation) and add the word in the concept cluster. Continue forward until no changes are made.</Paragraph>
      <Paragraph position="10"> Expansion backward: Find words in the dictionary whose definitions contain the semantically significant words from the concept cluster. For each possible new word, find the maximal common subgraph between its temporary graph and the CCKG. Again, if matching is over the graph matching threshold, perform integration and add the word to the concept cluster. Continue until no changes are made.</Paragraph>
      <Paragraph position="11"> We can set a limit to the number of steps in the expansion phase to ensure its termination. Ilowever in practice, M'ter two or three steps forward or backward, the maximal common subgraphs between the new graphs and CCKG do not exceed the graph matching threshold and thus are not added to the cluster, terminating the expansion.</Paragraph>
    </Section>
    <Section position="3" start_page="66" end_page="66" type="sub_section">
      <SectionTitle>
3.4 Example of integration
</SectionTitle>
      <Paragraph position="0"> Figure 3 shows the starting point of an integration process with the trigger word (TW) lelter, its definition, its temporary graph (TG), the concept cluster (CC) containing only the trigger word, and the CCKG being the same as the temporary graph. Then we show the trigger forward phase.</Paragraph>
      <Paragraph position="1"> The number of occurences (NOte) of each word present in the definition of letter is given. Using the criteria described in the previous section, only the word message is a semantically significant word (SSW). We then see the definition of message, the new concept cluster and the resulting CCKG.</Paragraph>
      <Paragraph position="2"> The trigger backward phase, would incorporate the temporary graphs for address, mail, post office and stamp. The expansion forward phase would further add the temporary graphs for the semantically significant words: {send, package} during the first step and then would terminate with the second step as no more semantically significant words not yet explored have a maximal common subgraph with the CCKG that exceeds the graph matching threshold. The expansion backward would finally add the temporary graphs for card and note, again terminating after two steps.</Paragraph>
      <Paragraph position="3">  The resulting cluster is: {letter, message, address, mail, post office, stamp, send, package, card, note}. The resulting CCKG shows the interaction between those concepts which smnmarizes general knowledge about lnow we use those concepts together in a da.ily conversation: we go to the post office to mail letters, or packages; we write letters, notes and cards to send to peoI)le through the mail, etc. Ilaving such clusters and such knowledge of the relationship between words as part of our lexical knowledge base can be useflfl to understand or even generate a text containing the concepts involved in the cluster.</Paragraph>
      <Paragraph position="4"> S'I'All.'I'ING POIN'F: TW: letter Def: A letter is a message you write on paper, TG: same as CCKG</Paragraph>
    </Section>
  </Section>
  <Section position="4" start_page="66" end_page="66" type="metho">
    <SectionTitle>
4 Discussion
</SectionTitle>
    <Paragraph position="0"> 'l'lu:ough this paper, we showed the multiple steps leading us to tile building of Concept Clustering Knowledge Graphs (CCKGs). Those knowledge structm:es arc built within the Lexical Knowledge Base (LKB), integrating lnultiple parts of the I,Kt~ around a particular concept to form a clus.ter and express the multiple relations among the words in that cluster. The CCKGs could be either permanent or temporary structures depending on the. applicatkm using the LKB. For example, for a text understanding tusk, we can build before hand the CCKGs corresponding to one or multiple key-words from the text. Once built, the CCKGs will help us in our comprehension and disambiguation of the text.</Paragraph>
    <Paragraph position="1"> By using the American lh;ritage First l)ictionary a~s our source of lexical information, we were able to restrict our vocabulary to result ill a project of reasonable size, dealing with general knowledge about (lay to day concepts and actions.</Paragraph>
    <Paragraph position="2"> The ideas explored using this dictionary can be extended to other dictionaries as well, but the task might becorne more complex as the defilfitions in adult's dictionaries are not as clear and usage oriented. In fact, an LKB lmilt fl'om a children's dictionary could be seen as a starting point from which we could extend our acquisition of knowledge using text corpora or other dictionaries. Certainly, if we euvisage applications trying to understand children's stories or help in child education, a corpora of texts for children would be a good source of information to extend our LKB.</Paragraph>
    <Paragraph position="3"> The graph operations (maximM commou sub-graph and maximal join) defined on conceptual graphs, anti adapted here, play an important role in our integration process toward a final CCKG.</Paragraph>
    <Paragraph position="4"> Graph matching was also suggested as an alternatiw; to taxonomic search when trying to establish semantic similarity between concepts. As well, by putting a threshohl on the graph matching process, we were able to limit the expansion of our clustering, as we can decide and justify the incorporation of a new concept into a particular cluster. Many aspects of the concept clustering and knowledge integration processes have already been implemented and it will soon be possible to test the techniques on different trigger words using different thresholds to see how they effect the quality of the clusters.</Paragraph>
    <Paragraph position="5"> (~lustering is often seen as a statistical operation that puts together words &amp;quot;somehow&amp;quot; related. ltere, we give a meaning to their clustering, we tint\[ and show the connections between concepts, and by doing so, we build more than a cluster oF words. We build a knowledge graph where the concepts interact with each other giving impel taut implicit information that will be useful for Natural Language Processing tusks.</Paragraph>
  </Section>
  <Section position="5" start_page="66" end_page="66" type="metho">
    <SectionTitle>
5 Acknowledgments
</SectionTitle>
    <Paragraph position="0"> i\['his research was supported by the Institute for Robotics and Intelligent Systems. The autlnors would like to thank the anonymous referees for their comments and suggestions, and Petr Kubon for his many comments on the paper.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML