File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/05/i05-3008_intro.xml

Size: 1,640 bytes

Last Modified: 2025-10-06 14:02:58

<?xml version="1.0" standalone="yes"?>
<Paper uid="I05-3008">
  <Title>Word Meaning Inducing via Character Ontology: A Survey on the Semantic Prediction of Chinese Two-Character Words Shu-Kai Hsieh Seminar f&amp;quot;ur Sprachwissenschaft</Title>
  <Section position="2" start_page="0" end_page="0" type="intro">
    <SectionTitle>
1 Introduction
</SectionTitle>
    <Paragraph position="0"> This paper describes the theoretical consideration concerning with the interaction of ontology and morpho-semantics, and an NLP experiment is performed to do semantic class prediction of unknown two-character words based on the ontological and lexical knowledge of Chinese morphemic components of words (i.e., characters).</Paragraph>
    <Paragraph position="1"> The task that the semantic predictor (or classifier) performs is to automatically assign the (predefined) semantic thesaurus classes to the unknown two-character words of Chinese.</Paragraph>
    <Paragraph position="2"> Among these types of unknown words, Chen and Chen (2000) pointed out that compound words constitute the most productive type of unknown words in Chinese texts. However, the caveat at this point should be carefully formulated, due to the fact that there are no unequivocal opinions concerning with some basic theoretical settings in Chinese morphology. The notion of word, morpheme and compounding are not exactly in accord with the definition common within the theoretical setting of Western morphology. To avoid unnecessary misunderstanding, the pre-theoretical term two-character words will be mostly used instead of compound words in this paper.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML