File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/00/w00-1428_intro.xml
Size: 4,305 bytes
Last Modified: 2025-10-06 14:01:05
<?xml version="1.0" standalone="yes"?> <Paper uid="W00-1428"> <Title>Integrating a Large-scale, Reusable Lexicon with a Natural Language Generator</Title> <Section position="2" start_page="0" end_page="209" type="intro"> <SectionTitle> 1 Introduction </SectionTitle> <Paragraph position="0"> Natural language generation requires lexical, syntactic, and semantic knowledge in order to produce meaningful and fluent output. Such knowledge is often hand-coded anew when a different application is developed. We present in this paper the integration of a large-scale, reusable lexicon with a natural language generator, FUF/SURGE (Elhadad, 1992; Robin, 1994); we show that by integrating the lexicon with FUF/SURGE as a tactical component, we can reuse the knowledge encoded in the lexicon and automate to some extent the development of the lexical realization component in a generation application. null The integration of the lexicon with FUF/SURGE also brings other benefits to generation, including the possibility to accept a semantic input at the level of WordNet synsets, the production of lexical and syntactic paraphrases, the prevention of non-grammatical output, reuse across applications, and wide coverage.</Paragraph> <Paragraph position="1"> We present the process of integrating the lexicon with FUF/SUR(;E. including how to represenl the lexicon in FUF format, how to unify input with the lexicon incrementally to generate more sophisticated and informative representations, and how to design an appropriate semantic input format so that the integration of the lexicon and FUF/SURGE can be done easily.</Paragraph> <Paragraph position="2"> This paper is organized as follows. In Section 2, we explain why a reusable lexical chooser for generation needs to be developed. In Section 3, we present the large-scale, reusable lexicon which we combined from multiple resources, and illustrate its benefits to generation by examples. In Section 4, we describe the process of integrating the lexicon with FUF/SURGE, which includes four unification steps, with each step adding additional lexical or syntactic information. Other applications and comparison with related work are presented in Section 5. Finally, we conclude by discussing future work.</Paragraph> <Paragraph position="3"> 2 Building a reusable lexical chooser for generation While reusable components have been widely used in generation applications, the concept of a &quot;reusable lexical chooser&quot; for generation remains novel.</Paragraph> <Paragraph position="4"> There are two main reasons why such a lexical chooser has not been developed in the past: 1. In the overall architecture of a generator, the lexical chooser is an internal component that depends on the semantic representation and for.:malism and onthe syntactic realizer used by the application.</Paragraph> <Paragraph position="5"> 2. The lexical chooser links conceptual elements to lexical items. Conceptual elements are by definition domain and application dependent (they are the primitive concepts used in an application knowledge base). These primitives are not easily ported from application to application.</Paragraph> <Paragraph position="6"> The emergence of standard architectures for generators (RAGS, (Reiter, 1994))and the possibility to use a standard syntactic realizer answer the first issue.</Paragraph> <Paragraph position="7"> To address the second issue, one must realize that if the whole lexical chooser can not be made domainindependent, major parts can be made reusable. The main argument is that lexical knowledge is modular. Therefore, while choice of words is constrained by domain-specific conceptual knowledge (what information the sentences are to represent) on the one hand, it is also affected by several other dimensions: * inter-lexical constraints: collocations among words o pragmatic constraints: connotations of words o stylistic constraints: familiarity of words * syntactic constraints: government patterns of words, e.g., thematic structure of verbs.</Paragraph> <Paragraph position="8"> We show in this paper how the separation of the syntactic and conceptual interfaces of lexical item definitions allows us to reuse a large amount of lexical knowledge across appli.cations.</Paragraph> </Section> class="xml-element"></Paper>