File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/ackno/00/w00-0901_ackno.xml
Size: 1,262 bytes
Last Modified: 2025-10-06 13:50:03
<?xml version="1.0" standalone="yes"?> <Paper uid="W00-0901"> <Title>Comparing Corpora using Frequency Profiling</Title> <Section position="7" start_page="4" end_page="4" type="ackno"> <SectionTitle> Acknowledgements </SectionTitle> <Paragraph position="0"> Our thanks go to Geoffrey Leech and the anonymous reviewers who commented on earlier versions of this paper. The REVERE project is supported under the EPSRC Systems Engineering for Business Process Change (SEBPC) programme, project number GR/MO4846.</Paragraph> <Paragraph position="1"> This paper has described a method of comparing corpora which uses frequency profiling. The method has been shown to discover key items in the corpora which differentiate one corpus from another. It has been applied at the word level, part-of-speech tag level, and semantic tag level. It can be used as a quick way in to find the differences between the corpora and is shown to have applications in the study of social differentiation in the use of English vocabulary: profiling of learner English and document analysis in the software engineering process.</Paragraph> <Paragraph position="2"> Future directions in which we aim to research include a more precise specification of the</Paragraph> </Section> class="xml-element"></Paper>