File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/93/w93-0310_intro.xml

Size: 2,380 bytes

Last Modified: 2025-10-06 14:05:30

<?xml version="1.0" standalone="yes"?>
<Paper uid="W93-0310">
  <Title>Computation of Word Associations Based on the Co-Occurences of Words in Large Corpora I</Title>
  <Section position="3" start_page="84" end_page="84" type="intro">
    <SectionTitle>
2 Model
</SectionTitle>
    <Paragraph position="0"> According to the law of association by contiguity the learning of associations can be described as follows: If two words i and j occur together, the association strength aid(t) between i and j is increased by a constant fraction of the difference between the maximum and the actual association strength. This leads for association strengths between 0 and 1 to the following formula:</Paragraph>
    <Paragraph position="2"> If word i occurs in another context, i. e. not in proximity to word j, the association strength between i and j is decreased by a constant fraction:</Paragraph>
    <Paragraph position="4"> Under the assumption that the learning rate 01 and the inhibition rate 02 are of identical size, the expected value aid of the association strength aid(t) from i to j for t ~ o~ is equal to the conditional probability of j given i (compare Foppa, 1965):</Paragraph>
    <Paragraph position="6"> From these assumptions it could be expected that a stimulus word i leads to those response j, for which the value of equation 3 is at a maximum.</Paragraph>
    <Paragraph position="7"> Rapp &amp; Wettler (1991) compared this with other predictions, where additional assumptions on learning and reproduction were taken into account. With equation 3, mainly words with high corpus frequencies, e. g. function words, were predicted as associative responses. The predictions were improved when the following formula was used with an exponent of a = 0.66, and the word with the highest rid was considered to be the associative response.</Paragraph>
    <Paragraph position="8"> p(jli) (4) = p(j)'-&amp;quot;7 The introduction of the denominator indicates that in the association experiment less frequent words are used than during language production. This inhibition of frequent words can be explained by the experimental situation, which furthers responses that are specific to the stimulus word. The exponential function can be interpreted as the tendency that subjective estimations are often found to be exponential functions of the quantities to be estimated.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML