File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/94/w94-0103_abstr.xml
Size: 1,647 bytes
Last Modified: 2025-10-06 13:48:17
<?xml version="1.0" standalone="yes"?> <Paper uid="W94-0103"> <Title>References</Title> <Section position="2" start_page="0" end_page="0" type="abstr"> <SectionTitle> Abstract </SectionTitle> <Paragraph position="0"> The title of this paper playfully contrasts two rather different approaches to language analysis. The &quot;Noisy Channel&quot; 's are the promoters of statistically based approaches to language learning. Many of these studies are based on the Shannons's Noisy Channel model. The &quot;Braying Donkey&quot; 's are those oriented towards theoretically motivated language models. They are interested in any type of language expressions (such as the famous &quot;Donkey Sentences&quot;), regardless of their frequency in real language, because the focus is the study of human communication.</Paragraph> <Paragraph position="1"> In the past few years, we supported a more balanced approach. While our major concern is applicability to real NLP systems, we think that, after aLl, quantitative methods in Computational Linguistic should provide not only practical tools for language processing, but also some linguistic insight.</Paragraph> <Paragraph position="2"> Since, for sake of space, in this paper we cannot give any complete account of our research, we will present examples of &quot;linguistically appealing&quot;, automatically acquired, lexical data (selectional restrictions of words) obtained trough an integrated use of knowledge-based and statistical techniques. We discuss the pros and cons of adding symbolic knowledge to the corpus linguistic recipe.</Paragraph> </Section> class="xml-element"></Paper>