File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/05/p05-1077_abstr.xml

Size: 659 bytes

Last Modified: 2025-10-06 13:44:28

<?xml version="1.0" standalone="yes"?>
<Paper uid="P05-1077">
  <Title>Randomized Algorithms and NLP: Using Locality Sensitive Hash Function for High Speed Noun Clustering</Title>
  <Section position="2" start_page="0" end_page="0" type="abstr">
    <SectionTitle>
Abstract
</SectionTitle>
    <Paragraph position="0"> In this paper, we explore the power of randomized algorithm to address the challenge of working with very large amounts of data. We apply these algorithms to generate noun similarity lists from 70 million pages. We reduce the running time from quadratic to practically linear in the number of elements to be computed.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML