File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/00/w00-1418_intro.xml
Size: 3,469 bytes
Last Modified: 2025-10-06 14:01:05
<?xml version="1.0" standalone="yes"?> <Paper uid="W00-1418"> <Title>Optimising text quality in generation from relational databases</Title> <Section position="2" start_page="0" end_page="0" type="intro"> <SectionTitle> 1 Introduction </SectionTitle> <Paragraph position="0"> As the problems of how we generate text are gradually solved, a new problem is gaining prominence - where do we obtain the information which feeds the generation. Many domain models for existing generation systems are hand-crafted for the specific system. Other systems take advantage of existing information sources.</Paragraph> <Paragraph position="1"> A good information source for text generation resides in the vast number of relational databases which are in use around tile world. These resources have usually been provided for some reason other than text generation, such as inventory management, accounting, etc. However, given that the information is on hand, it can be of value to conuect these databases to text generation facilities.</Paragraph> <Paragraph position="2"> The benefits include natural language access to information which is usually accessed in tabular form, which can be difficult to interpret. Natural Language descriptions are easier to read, can be tailored to user types, and can be expressed in different languages if properly represented.</Paragraph> <Paragraph position="3"> This paper outlines the domain specification language for the ILEX text g~neration system, (for Intelligent Labelling Explorer). 1 ILEX is a tool for *dynamic browsing of databasedefined information: it allows a user to browse through the information in a database using hyper1Earlier ILEX papers have been based on Ilex 2.0, which was relatively domain-dependent. This paper is based around version 3.0 of ILEX, a re-draft to make the system domainindependent, and domain acquisition far easier. The ILEX project was supported by EPSRC grant GR/K53321.</Paragraph> <Paragraph position="4"> text. ILEX generates descriptions of database objects on the fly, taking into account the user's context of browsing. Figure 1 shows the ILEX web interface, as applied to a museum domain, in this case the Twentieth Century Jewellery exhibition at the the National Museum of Scotland. 2 The links to related database objects are also automatically generated. ILEX has been applied to other domains, including personnel (Nowson, 1999), and a sales catalogue for computer systems and peripherals (Anderson and Bradshaw, 1998).</Paragraph> <Paragraph position="5"> One of the advantages of using NLG for database browsing is that the system can keep track of what has already been said about objects, and not repeat that information on later pages. Appropriate referring expressions can also be selected on the basis of the discourse history. The object descriptions can be tailored to the informational interests of the user. See Knott et al. (1997) and Mellish et al. (1998) for more information on these aspects of ILEX.</Paragraph> <Paragraph position="6"> In section 2, we consider some systems related to the ILEX system. Section 3 describes the form of relational database that ILEX accepts as input. Section 4 outlines what additional information - domain semantics - needs to be provided for coherent text production from the database, while section 5 describes additional information which can be provided to improve the quality of the text produced.</Paragraph> </Section> class="xml-element"></Paper>