File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/86/c86-1067_abstr.xml
Size: 1,431 bytes
Last Modified: 2025-10-06 13:46:20
<?xml version="1.0" standalone="yes"?> <Paper uid="C86-1067"> <Title>A Kana-Kanji Translation System for Non-Segmented Input Sentences Based on Syntactic and Semantic Analysis</Title> <Section position="1" start_page="0" end_page="0" type="abstr"> <SectionTitle> Abstract </SectionTitle> <Paragraph position="0"> This paper presents a disambiguation approach for translating non-segmented-Kana into Kanji. The method consists of two steps. In the first step, an input sentence is analyzed morphologically and ambiguous morphemes are stored in a network form. In the second step, the best path, which is a string of morphemes, is selected by syntactic and semantic analysis based on case grammar.</Paragraph> <Paragraph position="1"> In order to avoid the combinatorial explosion of possible paths, the following heuristic search method is adopted.</Paragraph> <Paragraph position="2"> First, a path that contains the smallest number of weighted-morphemes is chosen as the quasi-best path by a best-first-search technique. Next, the restricted range of morphemes near the quasi-best path is extracted from the morpheme network to construct preferential paths.</Paragraph> <Paragraph position="3"> An experimental system incorporating large dictionaries has been developed and evaluated. A translation accracy of 90.5% was obtained. This can be improved to about 95% by optimizing the dictionaries.</Paragraph> </Section> class="xml-element"></Paper>