File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/concl/91/e91-1038_concl.xml
Size: 2,965 bytes
Last Modified: 2025-10-06 13:56:39
<?xml version="1.0" standalone="yes"?> <Paper uid="E91-1038"> <Title>The Semantics of Collocational Patterns for Reporting Verbs</Title> <Section position="6" start_page="0" end_page="0" type="concl"> <SectionTitle> VI Summary </SectionTitle> <Paragraph position="0"> Reported speech is an important phenomenon that cannot be ignored when analyzing newspaper articles. We argue that the lexicai semantics of reportiug vcrbs plays all important part in extracting information from large on-iiine tcxt bases.</Paragraph> <Paragraph position="1"> Based oil extensive studies of two corpora, the 250,000 word TlMEcorpus and the 7 million word Wall Street Journal Corpus we identified that semantic coilocalious must be represented ill the lexicon, expanding thus on current trends to indude syntactic collocations in a word based lexicon \[Smadj~d~M cKeown90\].</Paragraph> <Paragraph position="2"> We further discovered that logical metonymy is pervasive in subject position of reporting verbs, but that reporting verbs differ with respect to their preference for different kinds of logical metonymy. A careful analysis of seven reporting verbs in the TIMEcorpus suggested that there are three features that divide the reporting verbs into classes according to the preference for metonymy in subject position, namely whether the subject NP refers to the source as a single person, a group of people, or an institution.</Paragraph> <Paragraph position="3"> The analysis of the source NPs of seven reporting verbs further allowed us to formulate a specialized se-SGrimshaw \[Grimshaw79\] argues that verbs also select for their complements on a semantic basis. \[;'or the sake of coneiscncss tim whole issue of the form of the complement and its semantic connection has to be omitted here.</Paragraph> <Paragraph position="4"> - 220 mantic grammar for source NPs, which constitutes an important interface between lexical semantics, syntax, and compositional semantics used by an application program. We are currently testing the completeness of this grammar on a different corpus and are planning to implement a noun phrase parser.</Paragraph> <Paragraph position="5"> We have imbedded the findings in the framework of Pustejovsky's Generative Lexicon and qualia theory \[Pustejovsky89\] \[Pustejovsky91\]. This rich knowi' edge representation scheme allows us to represent explicitly the underlying structure of the lexicon, ineluding the clustering of entries into semant.ic types (i.e. I,CPs) with inheritance and the representation of information which wa.s previously considered presuppositional and not part of the lexicai entry itself. In this process we observed that the analysis of semantic collocations can serve as a measure of semantic closeness of words.</Paragraph> <Paragraph position="6"> Acknowledgements: I would like to thank I.ily advisor, James Pustejovsky, for inspiring discussions and irlany critical readings.</Paragraph> </Section> class="xml-element"></Paper>