File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/concl/05/i05-3005_concl.xml
Size: 1,430 bytes
Last Modified: 2025-10-06 13:54:38
<?xml version="1.0" standalone="yes"?> <Paper uid="I05-3005"> <Title>Morphological features help POS tagging of unknown words across language varieties</Title> <Section position="11" start_page="38" end_page="38" type="concl"> <SectionTitle> 6 Conclusion </SectionTitle> <Paragraph position="0"> Previous research in part-of-speech tagging has resulted in taggers that perform well when the training set and test set are both drawn from the same corpus.</Paragraph> <Paragraph position="1"> Unfortunately, for many potential real world applications, such an arrangement is just not possible.</Paragraph> <Paragraph position="2"> Our results show that using sophisticated morphological features can help solve this robustness problem. These features would presumably also be applicable to other languages and NLP tasks that could benefit from the use of morphological information null Besides these tagging results, our research provides valuable analytic results on understanding the nature of unknown words cross-linguistically. Our results that unknown words in Chinese are not proper nouns like in English, but rather common nouns and verbs, suggest a similarity to German. We suggest this is because both German and Chinese, despite their huge differences in genetic, area, and other typological characteristics, tend to form unknown words through a similar word formation rule, compounding.</Paragraph> </Section> class="xml-element"></Paper>