File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/concl/93/p93-1041_concl.xml
Size: 1,426 bytes
Last Modified: 2025-10-06 13:57:04
<?xml version="1.0" standalone="yes"?> <Paper uid="P93-1041"> <Title>TEXT SEGMENTATION BASED ON SIMILARITY BETWEEN WORDS</Title> <Section position="6" start_page="287" end_page="287" type="concl"> <SectionTitle> CONCLUSION </SectionTitle> <Paragraph position="0"> This paper proposed LCP, an indicator of segment changing, which concentrates on lexical cohesion of a text segment. The experiment proved that LCP closely correlate with the segment boundaries captured by the human judgments, and that lexical cohesion plays main role in forming a sequence of words into segments.</Paragraph> <Paragraph position="1"> Text segmentation described here provides basic information for text understanding: * Resolving anaphora and ellipsis: Segment boundaries provide valuable restriction for determination of the referents.</Paragraph> <Paragraph position="2"> * Analyzing text structure: Segment boundaries can be considered as segment switching (push and pop) in hierarchical structure of text.</Paragraph> <Paragraph position="3"> The segmentation can be applied also to text summarizing. (Consider a list of average meaning of segments.) In future research, the author needs to examine validity of LCP for other genres -- Hearst (1993) segments expository texts. Incorporating other clues (e.g. cue phrases, tense and aspect, etc.) is also needed to make this segmentation method more robust.</Paragraph> </Section> class="xml-element"></Paper>