File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/98/p98-2144_intro.xml
Size: 2,843 bytes
Last Modified: 2025-10-06 14:06:40
<?xml version="1.0" standalone="yes"?> <Paper uid="P98-2144"> <Title>HPSG-Style Underspecified Japanese Grammar with Wide Coverage</Title> <Section position="3" start_page="0" end_page="876" type="intro"> <SectionTitle> 2 Architecture of Japanese Grammar </SectionTitle> <Paragraph position="0"> In this section we describe the architecture of the HPSG-style Japanese grammar we have developed. In the HPSG framework, a grammar consists of (i) immediate dominance schemata (ID schemata), (ii) principles, and (iii) lexical entries (LEs). All of them are represented by typed feature structures (TFSs) (Carpenter, 1992), the fundamental data structures of HPSG. ID schemata, corresponding to rewriting rules in CFG, are significant for constructing syntactic structures. The details of our ID schemata are discussed in Section 2.1. Principles are constraints between mother and daughter feature structures. 4 LEs, which compose the lexicon, are detailed constraints on each word.</Paragraph> <Paragraph position="1"> In our grammar, we do not always assign LEs to each word. Instead, we assign lexical entry templates (LETs) to POSs. The details of our LEs and LETs are discussed in Section 2.2.</Paragraph> <Section position="1" start_page="876" end_page="876" type="sub_section"> <SectionTitle> 2.1 ID Schemata </SectionTitle> <Paragraph position="0"> Our grammar includes the 6 ID schemata shown in Table 1. Although they are similar to the ones used for English in standard HPSG, there is a fundamental difference in the treatment of relative clauses. Our grammar adopts the head-relative schema to treat relative clauses instead of the head-filler schema. More specifically, our grammar does not have SLASH features and does not use traces. Informally speaking, this is because SLASH features and traces are really necessary only when there are more than one verb between the head and the filler (e.g., Sentence (1)). But such sentences are rare in real-world corpora in Japanese. Just using a Head-relative schema makes our grammar simpler and thus less ambiguous.</Paragraph> <Paragraph position="1"> Basically, we assign LETs to POSs. For example, common nouns are assigned one LET, which has general constraints that they can be complements of predicates, that they can be a compound noun with other common nouns, and so on. However, we assign LEs to some single functional words which behave in a special way. For example, the verb 'suru' can be adjacent to some nouns unlike other ordinary verbs. The solution we have adopted is that we assign a special LE to the verb 'suru'.</Paragraph> <Paragraph position="2"> Our lexicon consists of 68 LEs for some functional words, and 63 LETs for POSs. A functional word is assigned one or more LEs, and a POS is also assigned one or more LETs.</Paragraph> </Section> </Section> class="xml-element"></Paper>