File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/concl/06/w06-2103_concl.xml
Size: 2,275 bytes
Last Modified: 2025-10-06 13:55:41
<?xml version="1.0" standalone="yes"?> <Paper uid="W06-2103"> <Title>A Quantitative Approach to Preposition-Pronoun Contraction in Polish</Title> <Section position="5" start_page="21" end_page="21" type="concl"> <SectionTitle> 4 Summary and Outlook </SectionTitle> <Paragraph position="0"> In this paper, the current results of our ongoing corpus-based study on the distribution of prepositions and pronouns within Polish PPCs were presented. At this point, conclusions can be drawn that, according to corpus evidence, there seem to exist more pronominal forms being able to contract with prepositions than traditionally assumed.</Paragraph> <Paragraph position="1"> On the other hand, corpus data provide fewer prepositions contracting with pronouns than do Polish dictionaries. To verify these results for the purpose of a possible revision of the traditionally assumed in ectional paradigms of TPPPs, as well as for lexicographic purposes, a quantitative analysis was proposed which draws on the calculation and comparison of ratios of the total frequency of all accented postprepositional forms to the total frequency of their unaccented counterparts. The analysis will be completed within the next project phase.</Paragraph> <Paragraph position="2"> In future work, other corpora of Polish, such as the PWN Corpus of Polish11 or the PELCRA Corpus12 will be examined with respect to the distribution of pronouns and prepositions within PPCs, and the results will be compared with those achieved using the IPI PAN Corpus.13 Further on, meta data will be analyzed with respect to the dis- null pus has been provided to us by Magdalena Derwojedowa (personal communication). According to this list, the following PPCs appear in the PWN Corpus: dla*n 'for_TPPP', do*n 'to_TPPP', nade*n 'above_TPPP', na*n 'on_TPPP', ode*n 'from_TPPP', o*n 'above_TPPP', po*n 'after_TPPP', przede*n 'behind_TPPP', przeze*n 'by_TPPP', we*n 'in_TPPP', ze*n 'with_TPPP' / 'from_TPPP'.</Paragraph> <Paragraph position="3"> This set of PPCs does not fully correspond to that found of the IPI PAN Corpus. Thus, such a comparison seems to be reasonable.</Paragraph> <Paragraph position="4"> tribution of TPPPs. Finally, all results will be evaluated by human judges.</Paragraph> </Section> class="xml-element"></Paper>