File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/84/p84-1011_metho.xml
Size: 8,607 bytes
Last Modified: 2025-10-06 14:11:38
<?xml version="1.0" standalone="yes"?> <Paper uid="P84-1011"> <Title>with Taigen as its Core, Working Group Report of Natural Language Processing in Information</Title> <Section position="2" start_page="0" end_page="42" type="metho"> <SectionTitle> 2. Coac_~pt of_~_Deoendencv Structure based on Case Gramma\[_/n Jap_a_D~ </SectionTitle> <Paragraph position="0"> In Japan, we have come to the conclusion that case grammar is most suitable grammar for Japanese syntactic analysis for machine translation systems. This type of grammar had been proposed and studied by Japanese linguists before Fillmore's presentation.</Paragraph> <Paragraph position="1"> As word order is heavily restricted in English syntax, ATNG~Augmented Transition Network Grammar) based on CFG~Context Free Grammar ) is adequate for syntactic analysis in English. On the other hand, Japanese word order is almost unrestricted and K~l!,jlio--shi play an important role as case labels in Japanese sentences. Therefore case grammar is the most effective grammar for Japanese syntactic analysis.</Paragraph> <Paragraph position="2"> In Japanese syntactic structure, the word order is free except for a predicate(verb or verb phrase) located at the end of a sentence. In case grammar, the verb plays a very important role during syntactic analysis, and the other parts of speech only perform in partnership with, and equally subordinate to. the verb.</Paragraph> <Paragraph position="3"> That is. syntactic analysis proceeds by checking the semantic compatibility between verb and nouns. Consequently. the semantic structure of a sentence can be extracted at the same time as syntactic analysis.</Paragraph> <Paragraph position="4"> 3. __ca.$_e_Er ame .~oYer n~ed ..by_ J:hu~/C_ll The case frame governed by !_bAag_<tn and having l~/_~Luio:~hi, case label and semantic markers for&quot; nouns is analyzed here to illustrate how we apply case grmlmlar to Japanese syntactic analysis in our system.</Paragraph> <Paragraph position="5"> }i~ff.TCil consists of vet b.</Paragraph> <Paragraph position="6"> ~'~9ou _.s'hi ~adjec:tive and L<Cigo~!d()!#_mh~ adjectival noun.. L~bkujo ,~hi include inner case and outer' case markers in Japanese syntax. But a single Iqol,'ujo ~/l; corresi:~ond.~ to several deep cases: for instance, &quot;.\'I&quot; indicates more than ten case labels including SPAce. Sp~:ee TO. TIMe, ROl,e, MARu,-:I . GOAl. PARtr,cu'. COl'~i,or~ent. CONdition. 9ANge ...... We analyze re\]atioP,<; br:twu,::n \[<~,kuj~, ,>hi anH cas,:, labels and wr.i..i,c thcii~ out, manu~,l\]y acc,.:,idii~, t,:, the ex~_m,;:\]e.s fotmd o;;t ill samr, te texts.</Paragraph> <Paragraph position="7"> ...................................</Paragraph> <Paragraph position="8"> * This project is being carried out with the aid of a specia\], gro~H for the promotion of scien,:.c ah,! technology from the Science and Techno\]ogy Agency of the Japane:ze GovoYf~: ~,t. As a result of categorizing deep cases, 33 Japanese case labels have been determined as shown in Table I.</Paragraph> <Paragraph position="9"> (13) h~ * $~.,~., Space-TO (14&quot;) hP~ - ~ Space-THrough (15) ~Z~ ~.~, SOUrce (16) ~,~,~. GOAl (17) \[~ ATTribute (18) ~.{:~ * iz~ CAUse (19) ~ * ii~. ~. TOO~ (20) $~ MATerial (21) f~ ~- '~ COMponent (22) 7\]~ MANner (23) ~= CONdition (24) ~\] ~ PURPOse (25) {~J ROLe (26) \[-~ ~ ~.~ COnTent (27) i~ \[~l ~. ~ RANge (28) ~ TOPic (29) \[Lg...~,, VIEwpoint (30) ,L'~ tt~ COmpaRison (32) ~ DEGree 5%~/~-@. 3 ~0@-~/-,5 (33l P~\]~ '~ PREdicative ~ &quot;~,.~ 8 Note: The capitalized letters form English acronym for that case label.</Paragraph> <Paragraph position="10"> the When semantic markers are recorded for nouns in the verbal case frames, each noun appearing in relation to l/2u(~'n and Kclkuio-shi in the sample text is referred to the noun lexicon.</Paragraph> <Paragraph position="11"> The process of describing these case frames for lexicon entry are given in Figure \].</Paragraph> <Paragraph position="12"> For each verb, l<ctkuio-Mtt and Keiuoudoi~-_.shi, Koktuo-shi and case labels able to accompany the verb are described, and the semantic marker for the noun which exist antecedent to that Kokuio-shL are described.</Paragraph> </Section> <Section position="3" start_page="42" end_page="43" type="metho"> <SectionTitle> 4. Sub-cat~or_ies of Parts of SDeech </SectionTitle> <Paragraph position="0"> accordiDg to their Syntactic Features The parts of speech are classified into 13 main categories: nouns, pronouns, numerals, affixes, adverbs. verbs. ~eiy_ou--~h~. Ke~uoudou-shi.</Paragraph> <Paragraph position="1"> Renlcli-shii~adnoun), conjunctions, auxiliary verbs, markers and ./o~shi(postpositional particles;. Each category is sub-classified and divided into 56 sub-categories(see Appendix A); those which are mainly based on syntactic features, and additionally on semantic features.</Paragraph> <Paragraph position="2"> For example, nouns are divided into 11 sub-categories; proper nouns, common nouns, action nouns I (S~!tC~!--~jc i sh i ), action nouns 2 (others }. adverbial nouns. ~bk:+-tio-shi-teki-i,~ishi (noun with case feature ~, ~l~:okuio-shi-teki-i~i~hi (noun with conjunction feature), unknown nouns, mathematical expressions, special symbols and complementizers. Action nouns are classified into ,~lhc(~-mc'ishi ia noun that can be a noun-plus-St~U,,doing> composite verb) and other verbal nouns, because action noun \] is also used as the word stem of a verb.</Paragraph> <Paragraph position="3"> Adverbs are divided into 4 sub-categories for modality , aspect and tense. In Japanese, the adverb agrees with the auxiliary verb.</Paragraph> <Paragraph position="4"> C~in~utsu-futu-shi agrees with aspect, tense and mood features of specific auxiliary verb, Joukuou-fz~u-shi agrees with aspect and tense, Teido-fuku-shi agrees with gradability.</Paragraph> <Paragraph position="5"> Auxiliary verbs are divided into 5 sub-catagories based on modality, aspect, voice, cleft sentence and others.</Paragraph> <Paragraph position="6"> Verbs may be classified according to their case frames and therefore it is not necessary to sub-classify their sub-categories.</Paragraph> </Section> <Section position="4" start_page="43" end_page="43" type="metho"> <SectionTitle> 5. Semantic Markimz of Nouna </SectionTitle> <Paragraph position="0"> We analyze semantic features, and assign semantic markers to Japanese words classified as nouns and pronouns. Each word can give five possible semantic markers.</Paragraph> <Paragraph position="1"> The system of semantic markers for nouns is made up of tO conceptual facets based on 44 semantic slots, and 38 plural filial slots at the</Paragraph> <Paragraph position="3"/> <Section position="1" start_page="43" end_page="43" type="sub_section"> <SectionTitle> 5.1 Concept of semantic markers </SectionTitle> <Paragraph position="0"> The tO conceptual facets are listed below.</Paragraph> <Paragraph position="1"> I) Thing or Object This conceptual facet contains things and objects; that is, actual concrete matter. This facet consists of such semantic slots as Nation/Organization, Animate object, Inanimate object, etc.</Paragraph> <Paragraph position="2"> 2) Commodity or Ware This conceptual facet contains commodity and wares; that is, artificial matter useful to humans. This facet consists of such semantic slots as Material. Means/Equipment, Product. etc.</Paragraph> </Section> </Section> <Section position="5" start_page="43" end_page="43" type="metho"> <SectionTitle> 3) Idea or Abstraction </SectionTitle> <Paragraph position="0"> This conceptual facet contains ideas and abstractions: that is. non-matter as the result of intellectual activity in the human brain. This facet contsists of such semantic slots as Theory, Conceptual object. Sign/Symbol, etc.</Paragraph> </Section> <Section position="6" start_page="43" end_page="44" type="metho"> <SectionTitle> 4) Part </SectionTitle> <Paragraph position="0"> This conceptual facet contains parts: that is, structural parts, elements and contents of things and matter.</Paragraph> <Paragraph position="2"/> </Section> <Section position="7" start_page="44" end_page="44" type="metho"> <SectionTitle> 5 Attribute </SectionTitle> <Paragraph position="0"> This conceptual facet contains attributes: that is, properties, qualities or features representative of things. This facet consists of semantic slots such as Property Characteristic.</Paragraph> <Paragraph position="1"> Status Figure, Relation, Structure, etc.</Paragraph> </Section> class="xml-element"></Paper>