File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/86/c86-1091_intro.xml
Size: 2,421 bytes
Last Modified: 2025-10-06 14:04:34
<?xml version="1.0" standalone="yes"?> <Paper uid="C86-1091"> <Title>Towards the Automatic Acquisition of Lexical Data</Title> <Section position="3" start_page="0" end_page="387" type="intro"> <SectionTitle> 2. The Morphologic Classification Scheme </SectionTitle> <Paragraph position="0"> Out: classification schem(; is based primarily on the sets el suffixes that call be attached to certain stems.</Paragraph> <Paragraph position="1"> Every different set: constitutes a morphological class, and every lexicon entry Calls exactly into one of these classes. ALtogether there are about 70 different ones.</Paragraph> <Paragraph position="2"> For each class two lists are stored: One contaJnLng the set of suffixes belonging to the class, and another one containing the syntactic interpretation for each oF these suffixes. Super\] reposed on this scheme is information about 'umlaut' and prefix 'ge'. rPhey occult oii\] y J n a few pos i tions, dependi ng on the word cat_egory. \]):very possJ b\]e comb\] nation \] s represented by a col: ta i n hUmOr ( call vn I uo stored a\] ong wi th each \] oxJ con entry.</Paragraph> <Paragraph position="3"> We di st\] rlgni sh between morpho\] ogic and syntactic Jnfornlation, the latter being a CeatLlrO e\[ the \]exeme itself (and not expressed by inflection). Mnrphologic information consists of the Following features:</Paragraph> <Paragraph position="5"> 'Phe syntactic inIYormation is stored in the feature SY.</Paragraph> <Paragraph position="6"> it consists of the following data: word category (verb, noun, pronoun, etc. ) - gender o \[ nouns subcategory (auxJlJa17y, modal, proper name, etc.) - case (for prepositions) - auxiliary for present and past: pel:foct ('habon' or 'soin' ) - separab\]e vorbadjuncts This Jnformatlon is coded into a number, the Ci~sL digit roprosent:ing the word category, the other ones depending on Jt (e.g. gender only for nouns). As an example Jolt's look at: the entries \[7oE the verb 'geben' (to give). Three forms are to be considered, 'gob' Js the stem for present tense and PPP, 'glb' ffor 2nd and 3rd person sg present tense indicative, and 'gab' for past tense. The correspond\] ng dJ eti onary entries have the following form: The suffix list g~ves the possible endings of the words in class 22, the interpretation \].ist gives tile (:ode of a\]\] forms expressed by ally one of these endings.</Paragraph> </Section> class="xml-element"></Paper>