File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/00/c00-2100_abstr.xml
Size: 846 bytes
Last Modified: 2025-10-06 13:41:38
<?xml version="1.0" standalone="yes"?> <Paper uid="C00-2100"> <Title>Automatic Extraction of Subcategorization Frames for Czech*</Title> <Section position="2" start_page="0" end_page="0" type="abstr"> <SectionTitle> Abstract </SectionTitle> <Paragraph position="0"> We present some novel nmchine learning techniques for the identilication of subcategorization infornmtion for verbs in Czech. We compare three different statistical techniques applied to this problem. We show how the learning algorithm can be used to discover previously unknown subcategorization frames from the Czech Prague 1)ependency Treebank. The algorithm can then be used to label dependents of a verb in the Czech treebank as either arguments or adjuncts. Using our techniques, we are able to achieve 88% precision on unseen parsed text.</Paragraph> </Section> class="xml-element"></Paper>