File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/concl/02/w02-1115_concl.xml
Size: 1,158 bytes
Last Modified: 2025-10-06 13:53:26
<?xml version="1.0" standalone="yes"?> <Paper uid="W02-1115"> <Title>Selecting the Most Highly Correlated Pairs within a Large Vocabulary</Title> <Section position="9" start_page="0" end_page="0" type="concl"> <SectionTitle> 13 Conclusion </SectionTitle> <Paragraph position="0"> This paper describes a method for selecting correlated pairs in a0a8a1a4a3a15a14 memory space and a0a2a1a4a3 a5 a7a10a9a12a11 a1a4a3a17a14a16a14 computation time, where a3 is the number of documents in a corpus, provided that there is an upper boundary in the number of different words/labels in one document/record. We have observed that a corpus usually has this kind of upper boundary, and have shown that we can uses a sequential file for most of our memory requirements.</Paragraph> <Paragraph position="1"> This method is useful not only for confidence but also for other functions whose values are decided by a21a23a22 a24 , a21a29a22a25a28 , a21a23a22 a30 , a21a23a22a12a32 . Examples of these functions are mutual information, the dice coefficient, the confidence measure, the phi coefficient and the complimentary similarity measure.</Paragraph> </Section> class="xml-element"></Paper>