File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/06/e06-1014_abstr.xml
Size: 1,027 bytes
Last Modified: 2025-10-06 13:44:43
<?xml version="1.0" standalone="yes"?> <Paper uid="E06-1014"> <Title>Improving Probabilistic Latent Semantic Analysis with Principal Component Analysis</Title> <Section position="1" start_page="0" end_page="0" type="abstr"> <SectionTitle> Abstract </SectionTitle> <Paragraph position="0"> (PLSA) models have been shown to provide a better model for capturing polysemy and synonymy than Latent Semantic Analysis (LSA). However, the parameters of a PLSA model are trained using the Expectation Maximization (EM) algorithm, and as a result, the trained model is dependent on the initialization values so that performance can be highly variable.</Paragraph> <Paragraph position="1"> Inthispaper wepresent amethodforusing LSA analysis to initialize a PLSA model.</Paragraph> <Paragraph position="2"> We also investigated the performance of our method for the tasks of text segmentation and retrieval onpersonal-size corpora, and present results demonstrating the efficacy of our proposed approach.</Paragraph> </Section> class="xml-element"></Paper>