File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/concl/04/w04-2706_concl.xml
Size: 2,377 bytes
Last Modified: 2025-10-06 13:54:26
<?xml version="1.0" standalone="yes"?> <Paper uid="W04-2706"> <Title>Deep Syntactic Annotation: Tectogrammatical Representation and Beyond</Title> <Section position="4" start_page="0" end_page="0" type="concl"> <SectionTitle> 3 Conclusions </SectionTitle> <Paragraph position="0"> Up to now, the framework has been checked on a large amount of running text segments from the Czech National Corpus (as for the valency classification 55,000 utterances, as for the topic-focus structure 20,000 ones). In several cases, it was found that a more detailed classification is needed (e.g. with the differentiation of the General Actor vs. Unspecified, cf. the difference between One can cook well with this oven and At this pub they cook well). However, it has been confirmed that good results can be achieved with the chosen classification of about 40 valency types and of 15 other grammatical attribute types (such as (Semantic) Number, Tense, Modalities, etc., but also different values of Location, such as those corresponding to the preferred functions of in, at, on, under, over, etc., or of Benefactive (positive vs. negative), and so on). It can be supposed that the core of language corresponds to underlying sentence structures and to their unmarked morphemic and phonemic counterparts. The marked layers have to be described by specific sets of rules, most of which concern irregularities of morphemics, including differences between the underlying order of nodes and the surface (morphemic) word order, especially in cases in which the latter does not directly meet the condition of projectivity (with no crossing of edges, cf. the discontinuous constituents of other frameworks).</Paragraph> <Paragraph position="1"> The prototypical varieties of sentence structure can thus be characterized by projective rooted trees, which points to the possibility to describe the core of language structure on the basis of a maximally perspicuous pattern that comes close to patterns present in other domains (primitive logic, arithmetics, and so on) which are normally mastered by children. Structures of this kind are not only appropriate for computer implementation, but they also help understand the relative easiness of mastering the mother tongue, without a necessity to assume complex innate mechanisms specific for the language faculty. null</Paragraph> </Section> class="xml-element"></Paper>