File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/86/c86-1124_intro.xml

Size: 2,702 bytes

Last Modified: 2025-10-06 14:04:34

<?xml version="1.0" standalone="yes"?>
<Paper uid="C86-1124">
  <Title>FORMAL SPECIFICATION OF NATURAL LANGUAGE SYNTAX</Title>
  <Section position="2" start_page="0" end_page="0" type="intro">
    <SectionTitle>
1. INTRODUCTION
</SectionTitle>
    <Paragraph position="0"> Formal specifications of natural language syntax should serve as a standard definition for the syntax of the subject language. The specification must be complete, concise, consistent, precise, unambiguous, understandable, and useful to language scholars, users, and implcmentors who wish to develop a parser for the tanguagc to run on a computer.</Paragraph>
    <Paragraph position="1"> Furthermore the specification should be mathematically rigorous to the degree that an implementation of the language can be automatically derived from the specification {10\]. Unfortunately many of these aims arc difficult to accomplish primarily because of the dynanric and informal nature of natural language. Formal specification is still a worthy goal to the degree allowed by present knowledge about natm'al language and iu this paper we propose a mctalanguage for specifying both syntax and semantics of natural language that has potential for satisfying these goals.</Paragraph>
    <Paragraph position="2"> The mctalanguage we propose is the two-levd grammar \[16} (also called W-grammars and tlgs). Two-level grammars have been used extensively for specifying the syntax and semantics of programming languages \[2\] but their use in specifying natural language was first introduced by the authors \[7, 8, 9\].</Paragraph>
    <Paragraph position="3"> Existing formal specification mcthods for natural language syntax take many forms. Of these, some of the more common are augmented transition network grammars \[181, transformational grammars \[1\], and generalized phrase-structure grammars \[5\]. These methods and others arc also surveyed in \[17\]. The degree to which any formal specification method satisfies the above stated goals is sometimes difficult to evaluate and relies on subjectivity. The authors do not intend to evaluate these existing methods with respect to the requirements of formal specification languages but will instead concentrate on why two-level grammars satisfy the necessary goals in a mathematically rigorous but readable and easy to understand way. In this paper, the two-level grammar mctalanguage will be used to define a large classification of English declarative sentences, extending work described in \[8\] and \[9\]. We will emphasize the method of using two-level grammars for this purpose and the advantages gained rather than any particular characteristics of the given grammar.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML