File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/80/c80-1015_metho.xml

Size: 13,718 bytes

Last Modified: 2025-10-06 14:11:19

<?xml version="1.0" standalone="yes"?>
<Paper uid="C80-1015">
  <Title>A MODEL OF NATURAL LANGUAGE PROCESSING OF TIME-RELATED EXPRESSIONS</Title>
  <Section position="1" start_page="0" end_page="0" type="metho">
    <SectionTitle>
A MODEL OF NATURAL LANGUAGE PROCESSING
OF TIME-RELATED EXPRESSIONS
Yutaka Kusanagi
UNIVERSITY OF TSUKUBA
SAKURA-MURA, IBARAKI, JAPAN
</SectionTitle>
    <Paragraph position="0"> This paper proposes a model of automatic processing of time-related expressions by introducing the notion of focus in the cognition level. The linguistic categories of time are determined on the basis of the relationships among the time of an extra-linguistic situation, the focused time and the time of utterance.</Paragraph>
  </Section>
  <Section position="2" start_page="0" end_page="0" type="metho">
    <SectionTitle>
INTRODUCTION
</SectionTitle>
    <Paragraph position="0"> It is generally understood that there is no one-to-one correspondence between the extra-linguistic (real world) time of situations and the grammatical expressions of time in sentences of natural languages. If there is no correspondence of the extra-linguistic situation to the linguistic expressions, it is impossible to simulate this process by a computer model. This paper proposes a model of automatic processing of time-related expressions by introducing the notion of focus in the cognition level between the extra-linguistic time of situations and the linguistic expressions of time.</Paragraph>
    <Paragraph position="1"> As well, it points the way toward the computer-realization of the model. The computer programming language used for this model is SNOBOL.</Paragraph>
    <Paragraph position="2"> Failure of correspondence between an extra-linguistic situation and linguistic expressions of the time concept can easily be demonstrated in most languages, if not all. For example, suppose the extra-linguistic situation is the speaker's writing during a certain period in the past (say, between two and three o'clock yesterday). In reference to it, we can use any of the following sentences.</Paragraph>
    <Paragraph position="3">  (i) I wrote a letter.</Paragraph>
    <Paragraph position="4"> (2) I was writing a letter.</Paragraph>
    <Paragraph position="5"> (3) I have written a letter.</Paragraph>
    <Paragraph position="6"> (4) I had written a letter (when John came).</Paragraph>
    <Paragraph position="7">  The differences among sentences (i) through (4) are attributed to the speaker's different point of view.</Paragraph>
    <Paragraph position="8"> Comri@ mentions that 'aspects are different ways Qf viewing the internal temporal constituenc\[ ~ of situations.' The speaker's way of viewing, however, affects not only his choice of aspects but also his choice of tenses. Note the following examples from Japanese. (5) ah sokoni aru.</Paragraph>
    <Paragraph position="9"> 'There it isI' (6) ah sokoni atta.</Paragraph>
    <Paragraph position="10"> 'There it was (all the time)!' There is no difference in the denotative meaning of sentences (5) and (6). Both sentences can be used when someone has found something he has been looking for. Whether the speaker uses sentence (5) or sentence (6) depends, again, on his view point.</Paragraph>
  </Section>
  <Section position="3" start_page="0" end_page="0" type="metho">
    <SectionTitle>
FOCUS
</SectionTitle>
    <Paragraph position="0"> The difference in the speaker's view-point, or that of cognition, cannot be ignored, as seen in the above examples, when we consider the time-related expressions of natural language. Physical events in the extra-linguistic world present themselves as sense data, and are perceived not passively, but actively by an intervention of mind, and are subsequently stored in the momory as experiences. The experiences represent the material that can be formulated and articulated as the linguistic forms.</Paragraph>
    <Paragraph position="1"> We believe that unless we incorporate cognition into the process of generating linguistic expressions we cannot explain the relationship between the extra-linguistic situations and the linguistic expressions. And we consider the notion of focus the most important cognitive element in connection with time-related expressions. 'Focus' is defined as the period along the extra-linguistic time line upon which the speaker directs his attention.</Paragraph>
  </Section>
  <Section position="4" start_page="0" end_page="0" type="metho">
    <SectionTitle>
MODEL
</SectionTitle>
    <Paragraph position="0"> By incorporating the speaker's focus into a model we can generate the time-related expressions rather -101 -automatically, that is, by one-to-one correspondence, and therefore it is possible to implement this model as a computer model to generate linguistic expressions.</Paragraph>
    <Paragraph position="1"> For the model we consider three levels; that is, i. the extra-linguistic (real world) level,  2. the cognition level, and 3. the linguistic level.</Paragraph>
    <Paragraph position="2">  Information on extra-linguistic situations observed by the speaker is stored in the extra-linguistic level in the form of the situation and its existing period of time. The speaker's focus in the cognition level is matched against the situation(s) in the extra-linguistic level and the way they match or do not match determines the selection of linguistic items such as verbs, adjectives, negation, aspect and tense.</Paragraph>
    <Paragraph position="3"> The linguistic level forms the sentences by syntactic, morphological and phonological rules.</Paragraph>
  </Section>
  <Section position="5" start_page="0" end_page="0" type="metho">
    <SectionTitle>
EXTRA-LINGUISTIC LEVEL
</SectionTitle>
    <Paragraph position="0"> The computer model under consideration simulates this process by our creating three levels here also. The extra-linguistic level stores information on situations in the following form (Nos. 2-9 of the appended program).</Paragraph>
  </Section>
  <Section position="6" start_page="0" end_page="0" type="metho">
    <SectionTitle>
SITUATION START-DS END-DS END-SS
</SectionTitle>
    <Paragraph position="0"> The information includes situation in the form of verbal expressions and their arguments, the starting time of dynamic situations, the ending time of dynamic situations, and the ending time of static situations. In the number for the time, the first two digits stand for the month, the next two for the day, the next two for the hour, and the last two for the minute. This number system may be expanded to include the year, the second, etc.</Paragraph>
    <Paragraph position="1"> Extra-linguistic situations are divided into two categories; dynamic and static. Fox the distinction, let us quote LyonsJ(p. 483) .</Paragraph>
    <Paragraph position="2"> A static situation (or state-ofaffairs, or state) is one that is conceived of as existing, rather than happening, and as being homogeneous, continuous and unchanging throughout its duration. A dynamic situation, on the other hand, is something that happens (or occurs, or takes place): it may be momentary or enduring; it is not necessarily either homogeneous or continuous, but may have any of several temporal contours;...</Paragraph>
    <Paragraph position="3"> Dynamic situations are in turn divided into two types. Those such as 'reading' are situations occurring for a certain period of time. Therefore situations of this type have a starting time and an ending time. The second type of dynamic situation, like '(a light) going on', is momentary, and therefore it is understood that the starting time and the ending time are identical.</Paragraph>
    <Paragraph position="4"> Static situations, on the other hand, continue for a certain period of time. But they may be results of dynamic situations and they may also change into other situations again by dynamic situations. The starting time of a static situation is usually the ending time of a dynamic situation and its ending time, if any, is the starting time of another dynamic situation.</Paragraph>
    <Paragraph position="5"> It is worth mentioning here that what are stored in this level are not linguistic expressions but situations.</Paragraph>
    <Paragraph position="6"> Therefore, the time period of an existing situation is not influenced by the type of particular linguistic expressions of a particular natural language. Taking an English expression for example, '(a light) goes on' is considered to be momentary event.</Paragraph>
    <Paragraph position="7"> Therefore, we cannot say '~a light is going on.' However, 'lights are going on' is possible, in which case we can identify a different starting time and ending time. Further, if we could observe the period of duration of a light going on by a high-speed film for instance, the different starting and ending times could be recorded in the data by expanding time digits to second or millisecond.</Paragraph>
    <Paragraph position="8"> However, there are some phenomena peculiar to individual languages. For instance, the situation of 'coming' is  --102-considered to have a time duration in English. Therefore, 'John is coming' is a paraphrase of 'John is on the way.' But in Japanese it is considered to be momentary, that is, 'coming' is a sort of switch from 'is not here' to 'is here'. Therefore there is not Japanese expression equivalent to English 'John is coming.' Theoretically, this kind of difference among individual languages should be taken care of by elaborating the structure of situations in the data of this level, but in the current model it is taken care of by the linguistic level.</Paragraph>
  </Section>
  <Section position="7" start_page="0" end_page="0" type="metho">
    <SectionTitle>
COGNITION LEVEL
</SectionTitle>
    <Paragraph position="0"> Now in the cognition level, the focused situation and the period of focus as well as the time of utterance are inserted into the program (Program Nos. 10-27). The focused situation is in the form of verbal expression and its arguments. The period of focus is in the form of either a time adverbial such as 'now', 'this morning,, 'yesterday' and 'last month', or numbers for chronological time points similar to those of extra-linguistic situation time. When the input is a time adverbial, the program calculates it against the time of utterance and translates it into chronological numbers (Program Nos. 28-40).</Paragraph>
    <Paragraph position="1"> So far, data stored in the pro- null gram are: i. Extra-linguistic situations, each comprising: a. the starting time-point of a dynamic situation(A), b. the ending time-point of a dynamic situation(B), c. the ending time-point of a static situation(C); 2. Time of utterance(O): 3. Focused situation, comprising: a. the starting time-point of focus(S), and b. the ending time-point of  focus(E).</Paragraph>
    <Paragraph position="2"> Here it is checked whether a particular extra-linguistic situation exists within the focused period or moment and, if so, how it is focused. On the basis of the relationship among the three kinds of time-points of an extra-linguistic situation (A,B and C), and the two kinds of focus time-points of the same focused situation (S and E), the cagegories of aspects are determined (Program Nos. 41-62).</Paragraph>
    <Paragraph position="3"> The categories of aspects are determined in the following way:  and if the focus is on the point, and if the focus is completely inside the SS, then: STATIVE, otherwise, the Dynamic Situation(DS) is checked.</Paragraph>
    <Paragraph position="4"> If it is otherwise, and if the SS overlaps the focus completely, then: STATIVE, otherwise, the DS is checked.</Paragraph>
    <Paragraph position="5">  II. If the focus is completely outside the DS, then: NEGATIVE.</Paragraph>
    <Paragraph position="6"> III.</Paragraph>
    <Paragraph position="7"> 1.</Paragraph>
    <Paragraph position="8"> If the focus is on a point, and if the DS is on a point, then: PERFECTIVE.</Paragraph>
    <Paragraph position="9"> 2. If it is otherwise, and a. if the focus is on the beginning of the DS, then: PERFECTIVE (INGRESSIVE), b. if the focus is on the ending of the DS, then: PERFECTIVE (COMPLETED), c. if neither of the above, then: IMPERFECTIVE.</Paragraph>
    <Paragraph position="10"> IV.</Paragraph>
    <Paragraph position="11"> i.</Paragraph>
    <Paragraph position="12"> 2.</Paragraph>
    <Paragraph position="13"> adeg b.</Paragraph>
    <Paragraph position="14"> i.</Paragraph>
    <Paragraph position="15"> ii.</Paragraph>
    <Paragraph position="16">  iii.</Paragraph>
    <Paragraph position="17"> If the focus is on a period, and if the DS is on a point, then: PERFECTIVE.</Paragraph>
    <Paragraph position="18"> If it is otherwise, and the DS overlaps the focus completely, then: IMPERFECTIVE, if the DS overlaps the focus partially, and the beginning of the DS is on or before the ending of the focus, then: PERFECTIVE (INGRESSIVE), the ending of the DS is on or before the ending of the focus, then: PERFECTIVE (COMPLETED), it is neither of the above, then: PERFECTIVE.</Paragraph>
    <Paragraph position="19"> This aspect assignment is illustrated in the diagrams in the Appendix I. The categories of tenses are in turn determined on the basis of the relationship between the focused time and the time of utterance (Program Nos. 63-69). If the focus is on the time of utterance, the PRESENT is assigned. If the focus is before the time of utterance, the PAST is assigned~ if the reverse, the FUTURE. Japanese and Russian, for example, work in this way. In English one more  --103-category of tense is needed. That is, if the focus is on a period and the end of the focus coincides with the time of utterance, the PAST-PRESENT is assigned. This category will later cover expressions like 'I have been studying linguistics.'</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML