File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/00/w00-0725_abstr.xml
Size: 801 bytes
Last Modified: 2025-10-06 13:41:49
<?xml version="1.0" standalone="yes"?> <Paper uid="W00-0725"> <Title>A Comparison of PCFG Models*</Title> <Section position="1" start_page="0" end_page="0" type="abstr"> <SectionTitle> Abstract </SectionTitle> <Paragraph position="0"> In this paper, we compare three different approaches to build a probabilistic context-free grammar for natural language parsing from a tree bank corpus: 1) a model that simply extracts the rules contained in the corpus and counts the number of occurrences of each rule 2) a model that also stores information about the parent node's category and, 3) a model that estimates the probabilities according to a generalized k-gram scheme with k -- 3. The last one allows for a faster parsing and decreases the perplexity of test samples.</Paragraph> </Section> class="xml-element"></Paper>