File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/intro/96/p96-1013_intro.xml
Size: 5,603 bytes
Last Modified: 2025-10-06 14:06:03
<?xml version="1.0" standalone="yes"?> <Paper uid="P96-1013"> <Title>Parsing for Semidirectional Lambek Grammar is NP-Complete</Title> <Section position="2" start_page="0" end_page="95" type="intro"> <SectionTitle> 1 Introduction </SectionTitle> <Paragraph position="0"> Categorial Grammar (CG) and in particular Lambek Categorial Grammar (LCG) have their well-known benefits for the formal treatment of natural language syntax and semantics. The most outstanding of these benefits is probably the fact that the specific way, how the complete grammar is encoded, namely in terms of 'combinatory potentials' of its words, gives us at the same time recipes for the construction of meanings, once the words have been combined with others to form larger linguistic entities. Although both frameworks are equivalent in weak generative capacity -- both derive exactly the context-free languages --, LCG is superior to CG in that it can cope in a natural way with extraction and unbounded dependency phenomena. For instance, no special category assignments need to be stipulated to handle a relative clause containing a trace, because it is analyzed, via hypothetical reasoning, like a traceless clause with the trace being the hypothesis to be discharged when combined with the relative pronoun.</Paragraph> <Paragraph position="1"> Figure 1 illustrates this proof-logical behaviour. Notice that this natural-deduction-style proof in the type logic corresponds very closely to the phrase-structure tree one would like to adopt in an analysis with traces. We thus can derive Bill misses ~ as an s from the hypothesis that there is a &quot;phantom&quot; np in the place of the trace. Discharging the hypothesis, indicated by index 1, results in Bill misses being analyzed as an s/np from zero hypotheses. Observe, however, that such a bottom-up synthesis of a new unsaturated type is only required, if that type is to be consumed (as the antecedent of an implication) by another type. Otherwise there would be a simpler proof without this abstraction. In our example the relative pronoun has such a complex type triggering an extraction.</Paragraph> <Paragraph position="2"> A drawback of the pure Lambek Calculus !_ is that it only allows for so-called 'peripheral extraction', i.e., in our example the trace should better be initial or final in the relative clause.</Paragraph> <Paragraph position="3"> This inflexibility of Lambek Calculus is one of the reasons why many researchers study richer systems today. For instance, the recent work by Moortgat (Moortgat 94) gives a systematic in-depth study of mixed Lambek systems, which integrate the systems L, NL, NLP, and LP. These ingredient systems are obtained by varying the Lambek calculus along two dimensions: adding the permutation rule (P) and/or dropping the assumption that the type combinator (which forms the sequences the systems talk about) is associative (N for non-associative).</Paragraph> <Paragraph position="4"> Taken for themselves these variants of I_ are of little use in linguistic descriptions. But in Moortgat's mixed system all the different resource management modes of the different systems are left intact in the combination and can be exploited in different parts of the grammar. The relative pronoun which would, for instance, receive category (np\np)/(np --o s) with --o being implication in LP, 1 i.e., it requires as an argument &quot;an s lacking an np somewhere&quot; .2. The present paper studies the computational complexity of a variant of the Lambek Calculus that lies between / and tP, the Semidirectional Lambek Calculus SDk. 3 Since tP derivability is known to be NPcomplete, it is interesting to study restrictions on the use of the I_P operator -o. A restriction that leaves its proposed linguistic applications intact is to admit a type B -o A only as the argument type in functional applications, but never as the functor. Stated prove-theoretically for Gentzen-style systems, this amounts to disallowing the left rule for -o. Surprisingly, the resulting system SD\[. can be stated without the need for structural rules, i.e., as a monolithic system with just one structural connective, because the ability of the abstracted-over formula to permute can be directly encoded in the right rule for --o. 4 Note that our purpose for studying SDI_ is not that it might be in any sense better suited for a theory of grammar (except perhaps, because of its simplicity), but rather, because it exhibits a core of logical behaviour that any richer system also needs to include, at least if it should allow for non-peripheral extraction. The sources of complexity uncovered here are thus a forteriori present in all these richer systems as well.</Paragraph> <Paragraph position="5"> collapse.</Paragraph> <Paragraph position="6"> 2Morrill (Morrill 94) achieves the same effect with a permutation modality /k apphed to the np gap: (s/Anp) SThis name was coined by Esther K6nig-Baumer, who employs a variant of this calculus in her LexGram system (KSnig 95) for practical grammar development.</Paragraph> <Paragraph position="7"> 4It should be pointed out that the resource management in this calculus is very closely related to the handhng and interaction of local valency and unbounded dependencies in HPSG. The latter being handled with set-valued features SLASH, QUE and KEL essentially emulates the permutation potential of abstracted categories in semidirectional Lambek Grammar. A more detailed analysis of the relation between HPSG and SD\[ is given in (KSnig 95).</Paragraph> </Section> class="xml-element"></Paper>