File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/concl/00/c00-2119_concl.xml
Size: 2,177 bytes
Last Modified: 2025-10-06 13:52:44
<?xml version="1.0" standalone="yes"?> <Paper uid="C00-2119"> <Title>Using a Broad-Coverage Parser for Word-Breaking in Japanese</Title> <Section position="3" start_page="823" end_page="826" type="concl"> <SectionTitle> 5. Conclusion </SectionTitle> <Paragraph position="0"> We have shown that a practical, broad-coverage parser can be implemented without requiring the word-breaking component to return a single segmentation analysis, and that it can at the same time achieve high accuracy in POS-labeled word-breaking. Separating the tasks of word ident~/'l'cation and best sequence selection offers flexibility in enhancing both recall and precision without sacrificing either at the cost of the other. Our results show that morphological and syntactic information alone can resolve most word-breaking ambiguities. Nonetheless, some ambiguities require semantic and contextual information. For example, the following sentence allows two parses corresponding to two word-breaking analyses, of which the first is semantically preferred: (1) ocha-ni haitte-irtt arukaroido tea-in contain-ASP alkaloid &quot;the alkaloid contained in lea&quot; (2) ocha-ni-ha itte-iru arukatwido tea-in-TOP go-ASP alkaloid * ? the alkaloid that has gone to the tea&quot; Likewise, the sentence below allows two different interpretations of the morpheme de, either as a locative marker (1) or as a copula (2). Both interpretations are syntactically and semantically wflid; only contextual information can resolve the ambiguity.</Paragraph> <Paragraph position="1"> (1) minen-ha isuraeru-de aru next year-TOP Israel-LOC be-held &quot;It will be held in Israel next year&quot;. (2) rainen-ha isuraeru de-artt next year-TOP Israel be-PP, ES &quot;It will be Israel next year&quot;.</Paragraph> <Paragraph position="2"> In both these sentences, we create syntactic trees for all syntactically valid interpretations, leaving the ambiguity intact. Such ambiguities can only be resolved with semantic and contextual information eventually made available by higher processing components. This will be Ihe focus of our ongoing rese.arclt.</Paragraph> </Section> class="xml-element"></Paper>