File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/80/c80-1007_metho.xml

Size: 35,589 bytes

Last Modified: 2025-10-06 14:11:17

<?xml version="1.0" standalone="yes"?>
<Paper uid="C80-1007">
  <Title>A CONTEXT-FREE GRAMMAR OF FRENCH*</Title>
  <Section position="1" start_page="0" end_page="0" type="metho">
    <SectionTitle>
A CONTEXT-FREE GRAMMAR OF FRENCH*
</SectionTitle>
    <Paragraph position="0"/>
    <Section position="1" start_page="0" end_page="0" type="sub_section">
      <SectionTitle>
Morris Salkoff
</SectionTitle>
      <Paragraph position="0"> Universit&amp; de Paris 7, L.A.D.L.</Paragraph>
    </Section>
  </Section>
  <Section position="2" start_page="0" end_page="41" type="metho">
    <SectionTitle>
2 Place Jussieu, Paris 5, FRANCE
</SectionTitle>
    <Paragraph position="0"> Summarx. I present here a method that allows one to construct a CF grammar of a natural language that correctly accounts for verbal selection rules. This goes contrary to the prevailing opinion, following Chomsky 5, that such a construction is impossible. My method consists essentially in separating the semantic function of a selection rule, namely, the exclusion of certain noun sub-classes, from the syntactic relation between the elements (verb, subject, object:) linked by this relation of selection. When the verb and object (subject) are separated by intervening levels of complement construction, the selection can still be satisfied by a double classification of verbs: according to the kind of subject they take, and also according to the type of verb that can follow them (in the complement construction).</Paragraph>
    <Paragraph position="1"> Conjunctions and sentences with respectively can also be treated within the framework of the CF approximation proposed here.</Paragraph>
    <Paragraph position="2"> like the latter, is recursively extendable to SSO. Introduction sentences of theoretically unbounded length.</Paragraph>
    <Paragraph position="3"> It is now quite generally supposed that a natural language cannot be adequately described by a CF grammar. This opinion was first advanced by Chomsky 5 who discussed this problem from the point of view of phrase structure grammars. He presents there a fragment of a CF phrase structure grammar in terms of noun phrases NP, verb phrases, VP, etc., which are familiar from immediate constituent analysis. These rules cannot treat verbal selection rules properly; Chomsky ~ (ch. 8) had already tried himself to correct this defect within the framework of a CF phrase structure grammar, but the difficulties he encountered seem to have persuaded him that only a transformational grammar could handle such a problem.</Paragraph>
    <Paragraph position="4"> Harman 13 proposed another solution to the problem of treating verbal selection rules in a CF grammar; he added a set of subscripts to the CF rules used in Chomsky 5, which were chosen so that only those subjects and objects which satisfied the selection rules could appear with a given type of verb. Chomsky 8 showed that this method would not suffice if the sentences subscripted as Harman had suggested were themselves embedded in complement constructions. Thus, where Harman's system will not generate such aberrant sentences as *Bill elapsed, it will not be able to exclude the generation of such a sequence when it is embedded in a complement construction, as in *John persuaded Bill to elapse.</Paragraph>
    <Paragraph position="5"> Further arguments for the inadequacy of a CF grammar were adduced from the fact that sentences containing respectively cannot be assigned an appropriate structure in the framework of a CF grammar. This was noted by Chomsky 5 (SS4.2) in his discussion of the algebraic language w w; the relation between this language and sentences containing respectively was discussed by Bar-Hillel &amp; Shamir I, and then taken up again by Chomsky 8 together with examples taken from the comparative construction in English. Later, Postal 22 exhibited a construction in Mohawk which is similar to the one with respectively, and As a result of these considerations, Chomsky 8 concluded that a coherent description of recursively embedded sentences or of verbal selection rules could not be obtained in a natural way by any CF grammar, and that consequently no CF grammar could adequately describe a natural language. However, it turns out that this question is not so easily disposed of as it would appear, and recent work by Joshi &amp; Levy 18 shows that a CS grammar containing rich context-dependent rules can be used to analyze trees that describe a CF language. They did this by an extension of a theorem of Peters &amp; Ritchie 21 , who showed that CS rules of a certain type can be used not to generate sentences, i.e., not to characterize them, but only to verify their well-formedness, by applying the context-dependent parts of these rules as constraints on the set of trees that schematize these sentences. In this case, the language described by these trees is a CF language.</Paragraph>
    <Paragraph position="6"> Joshi &amp; Levy generalized the kinds of CS rules that can be used for this result and defined CS rules that can describe conditions on the context whose action is close to that of certain transformations. These rules are expressed as Boolean combinations of predicates that describe the left and/or right context of a node, or the upper and/or lower contexts (the nodes above and below a given node). Roughly speaking, a tree is said to be analyzable with respect to a grammar containing such rules if one of the rules is satisfied at each node of the tree. In that case, the language which consists of the terminal strings of all the trees analyzed by the grammar is a CF language, even though the rules take the context into account ~.</Paragraph>
    <Paragraph position="7"> Hence these terminal strings can be described by SSNote that the formalism used by Joshi &amp; Levy for displaying conditions on trees is close to the notation used for rewrite rules, and can lead to some confusion, lit need only be remembered that these context-dependent rules are not used to generate structures.</Paragraph>
    <Paragraph position="8">  some CF language.</Paragraph>
    <Paragraph position="9"> Now the string grammar proposed by Harris 15 and which analyzes English (Sager 23) and French (Salkoff 2~'25) can be shown to be of just the form described by Joshi &amp; Levy. It contains CS rules of the type described by them, and is used to analyze a tree, rather than to generate it.</Paragraph>
    <Paragraph position="10"> It would thus appear that English or French can be described by some CF language, although the string grammar gives no clear clue as to what its form would be. I shall show here that such a CF grammar can be written for French, and that it can treat, in a linguistically appropriate fashion, the problem of the expression of verbal selection rules in nested complement constructions. I have chosen French because systematic data giving a wide coverage of the French lexicon are available (Gross 12, Boons et al.3); however, the very nature of this construction makes quite plausible its extension to other natural languages. Only the method used will be outlined in this brief article, and an example of its application to embedded complement constructions; for more details, consult Salkoff 25 (chap. 3) I now construct CF rules that correctly describe sentences in which related pairs like verb-subject, verb-object, etc., that are linked by a relation of selection, may be separated by constructions of unbounded length. Each such CF rule is the expansion of a sentence schema S.</Paragraph>
    <Paragraph position="11"> The verbal selection rules are accounted for in this method by separating the semantic function of a selection rule, namely, the exclusion of certain noun sub-classes, from the syntactic relation between the pairs carrying this function (generally, a verb and a noun phrase). Each selection rule is decomposed into two independent parts: one part is the choice of a noun not classified in certain noun sub-classes, in such a way as to express the semantics of that selection rule; the second part is the use of the noun phrase containing this N for the subject or object of a given verb in ~ rule schema, which amounts to satisfying the complete verbal selection rule.</Paragraph>
    <Paragraph position="12"> Conjunctional sequences, including sentences containing respectivement can be handled by this method, but not within the strict mathematical framework of a CF language. The resulting CF grammar of French can be compared with a transformational grammar, and it is seen that the two are more similar than has been thought.</Paragraph>
    <Paragraph position="13"> SS|. The base rules In order to set forth the selection rules as clearly as possible, I shall begin by using in the rules developing S, noun phrases bearing three subscripts, i.e., complex symbols: (I) NP ; where x is a function F: subx,y,z ject s, object ~, or i~direct object io; y is the morphology M: singular, plural,...; and z is a semantic su~-class S~ these sets have no elements in common.</Paragraph>
    <Paragraph position="14"> With this notation, typical rules for S will have the following form:</Paragraph>
    <Paragraph position="16"> The verb is subscripted according to the complements it takes.</Paragraph>
    <Paragraph position="17"> In this notation, the CF rules no longer constitute a strict constituent grammar of the type discussed by Chomsky 5'8. My notation brings out the grammatical relations between the elements of the sentence schemata, which is not possibleSSin a direct way in a phrase structure grammar. The complex symbols are useful in order to explain clearly the process of sentence embedding; they will be eliminated in a second step and replaced by the noun phrases without subscripts used in the verbal selection rules.</Paragraph>
    <Paragraph position="18"> Main rule schema.</Paragraph>
    <Paragraph position="19"> For clarity, I shall use only the subscript F (s, o, or i o) in the rules for S. Only an abbreviated list of these rules can be given here; for a complete list, cf. Salkoff 25. A first sub-group of rules contains non-sentential objects:  (3) S ~ NP t V 1 (Max dort) S</Paragraph>
    <Paragraph position="21"> There are about ten such rules in French. A second group of rules contain a sentential complement clause: S + NP t V20 que S (Max sait que Paul a s fait cela) ; / NP s t V22 NP deg que S (J'~nforme Max que Paul est venu) A third group of rules yield embedded sentences. One example will be treated here, as it occurs in independent sentences and in relative clauses, to illustrate the method.</Paragraph>
    <Paragraph position="23"> The new notation oNPs denotes a noun phrase having a double function F: it must be an accept-SSTo do so, one has, for example, to reinterpret the tree structure of the sentence (cfo Chomsky7).</Paragraph>
    <Paragraph position="24">  able object of the verb V30 which precedes SS , and also an acceptable subject of the main verb of S I. The sentence schema for S 1 is a sentence deformation (in Harris '17 terminology); there are about ten such deformations in French. Another one is the following:  ... etc.</Paragraph>
    <Paragraph position="25"> Each such schema S i contains as many rules as S itself.</Paragraph>
    <Paragraph position="26"> With the schemata S l, I can account for the recursive embedding of sentences, like Luc convainc Paul d'apprendre C/ Max ~ dire aux dl~ves que...; other schemata are needed to account for sentence embedding in relative clauses: (6)a NP rl / NP que NP t V30 S I (l'homne o s s pro que Max convainc...</Paragraph>
    <Paragraph position="27"> b S 1 / o(t) s de V 1 .. de dormir) pro / o(O) s de V 2 NP deg .. d'oter cela) . . . etc.</Paragraph>
    <Paragraph position="28"> Here, the symbol o(@)s is a dummy element standing for the noun phrase, carrying the same subscripts, at the head of NP rl. It is marked by the same selectional features as oNPs and will be used to 'transmit' this selection through embedded sentences. Such dummy elements come close to certain pronouns found in relative clauses without antecedent, like ce in: J'ai achet$ ce que Max a sculpt@.</Paragraph>
    <Paragraph position="29"> A second type of relative clause is this:  (7) NP r2 / NP que S (le livre que... o pro Spr deg / NP s t V 2 (~)o ... Paul lit) + NP s t VL~ (~)o P NPio .Paul fair de . . . ces articles) / NP s t V30 S I'I .. Max convainc... pro S I'I / NP de V2 (0)o .. Paul de pro o s lire) / NP de V 4 (@)o P NP. 0 S iO ..Paul de faire de tee articles) / NP s t V32 S 3&amp;quot;I .. Max apprend... pro S 3.1 + d * NP d V 2 (~)o &amp;quot;&amp;quot; ~ Paul pro io s * . . ~ life) . etc.</Paragraph>
    <Paragraph position="30">  With these rules, it is possible to describe recursively embedded sentences inside relative clauses, although the complex symbols give us no SSSentences like Max emp$che que la table ne tombe / Max emp$che la table de tomber, in which the raised object (table) does not have to be compatible with the verb emp@oher, are accounted for by different rules* indication yet as to how the verbal selection rules are to be satisfied.</Paragraph>
    <Paragraph position="31"> SS2. Selection Rules According to the kind of noun allowed as subject, or as direct or indirect object, a verb is said to select for that sub-class* The majority of the selection rules thus concern the following three rules for S:</Paragraph>
    <Paragraph position="33"> The selection rules vary with the preposition Pi for verbs V 3 and V~. In the sentence analyzer based on the string grammar, these selection rules are contained in a system of contextual rules attached to each lexical entry for a verb that can appear in (8)* Experience shows that five noun sub-classes are needed for such a system of selection rules: N t, 'time'; Ns, sentential; Nh, 'human'; Nc, concrete; and Nnom, nominalizations. These sub-classes are used in the verbal entries to indicate the unacceptable contexts for a verb classified in V2, V3, or V~.</Paragraph>
    <Paragraph position="34"> The analyzer then uses these contextual rules to disallow an unacceptable decomposition in a sentence analysis.</Paragraph>
    <Paragraph position="35"> These contextual rules can be replaced by CF rules in the following way. Let  (9) N \ {N. + N. + ...} = N&amp;quot; i \] denote any noun except one belonging to sub-class Ni, or to N~, etc.; the bar \ means 'minus'. If N&amp;quot; is substituted for the noun N in any NP, and carried over into every rule developing NP, the terminal rule for the noun in NP will be (\]0) N&amp;quot; / N d , where N d is a lexical entry.  Each choice for N d is compared with the list of sub-classes Ni,N., ..., attached to N'. If N d belongs to any ~f these sub-classes, it is discarded; if N d doesn't belong to these sub-classes the conditions expressed in (9) are satisfied. Now, if the selection rule of a given verb is that sub-classes N., N~, , are unacceptable as i d &amp;quot;'&amp;quot; subject (object), then the noun phrase containing N d satisfies that selection rule, and will be the only noun phrase permitted in that syntactic position.</Paragraph>
    <Paragraph position="36"> I now define noun phrases GN containing all the combinations of excluded noun classes from the five named above (there are 31 such GN):</Paragraph>
    <Paragraph position="38"> Inserting the noun phrases GN~ { into (8), and replacing the subscripts i an~J\]_ ' by the single subscript ~, I obtain the following rule schemata: null  double verb classification: V~ ~,(Vj,~,,) is that verb sub-class which requiresJ'JGNi Jfor subject, when the direct (indirect) object is GN i, (GN i ). Lexicographlcal work shows that there are'about 40 different prepositions appearing in the objects P N and N P N. Since the double verb classification must be carried out for each value of Pi' this amounts to a triple classification of verbs.</Paragraph>
    <Paragraph position="39"> SS3. Elimination of the complex symbols The schema (12) generates only acceptable sentences; each verb in the lexicon is classified according to which of the sub-classes defined by (12) it belongs to; hence no verb will ever appear in a schema of type (12) unless it is acceptable there. Then, since the process defined by (I0) is such that only acceptable nouns can be chosen for the noun phrases GN (= N') in these schemata, each schema must in fact give rise to an acceptable sentence* The situation is quite different, however, for the rules containing oNPs, NP o or NPio.</Paragraph>
    <Paragraph position="40"> These cannot be developed as written, for two reasons: (1) only noun phrases of the type N&amp;quot; are available, so that verbal selection rules can be satisfied; (2) the syntactic functions expressed by the subscripts on these noun phrases can be obtained only by a sub-classification of the verbs appearing with them. Thus, in order for oNPs (in 4) to be an acceptable object of the verb V a that precedes, and also an acceptable subject for the verb V b of the embedded sentence containing it, the verb V b must be sub-classified according to type of subject, and V a has to be sub-classified according to the type of V b that may follow.</Paragraph>
    <Paragraph position="41"> An even more complex classification is needed to handle relative clauses like (7), which begin with NP o. This noun phrase must be an acceptable object for the last verb, say Vc, in the S which follows; however, S can contain an pro pro unbounded number of embedded verbs before V c appears. Hence, V c is not known at the moment when the lexical entry is chosen for the N&amp;quot; which represents NP o. The problem, then, is to 'transmit' the selectional characteristics of the noun in NP o to the rule that will later develop Ve, by using the embedded verbs as carriers for the selectional information* This transmission of selectional information necessitates a sub-classification both of embedded verbs and of the schemata of the type S pro&amp;quot; SS31 Elimination of oNPs I subdivide the sets S\]S 2, S 3, ... (cf.4 and 5) into S~ subsets, where i runs through the 31 possible values of the subject N&amp;quot; (which replaces oNPs). These subsets then constitute a classification of the schemata S I, .., according to the type of subject the verb of the schema:  This new way of ordering the rules is the basis for the sub-classification of verbs V30, which take the object S I. A verb V30 accepts only the sub-sets S~ whose subject N~ is an acceptable object forlthat verb* This is a selection rule between verbs: the verb V30 selects an object having a verb of a certain type t.</Paragraph>
    <Paragraph position="42"> The generation of reeursively embedded sentences which satisfy verbal selection rules is now obtained as follows. First, let us choose a rule developing the matrix sentence , for example (15) N\]t V30 S~ I Now the verbs in the sub-class V30 have been sub-classified in the lexicon according to the type of acceptable subject, N i, and also according to the type of acceptable complement S~. By choosing in (15) a verb in the sub-class (Nj,Si) , I obtain an acceptable sentence.</Paragraph>
    <Paragraph position="43"> %The selection between verbs mentioned here has already been suggested by Z. Harris 16 in the framework of a system of sentence generation based on the concept of the verb as an operator acting on its arguments (approximately, its sub-ject and object)* Selection between verbs was also used by M. Gross I0 in order to account for constructions like Je coul~s manger un g~teau, ?? Je co~2~s d~tester Max; here, the first verb (of movement) selects for the type of verb that can follow it.</Paragraph>
    <Paragraph position="44">  1 Next, S i is developed, using the schema (13), by one of two tynes of rules: (16)a S~l + N\]z de V31 S~; b S~l / N~ de V 2 NP o  If rule a is chosen, another sentence is embedded, anda verb V31 in the sub-class (N\[,S\]) is chosen from the lexicon. But if rule b is chosen sentence embedding terminates with that rule.</Paragraph>
    <Paragraph position="45"> The same method can be used for generating acceptable relative clauses NP rl (in 6). As an example, I rewrite one of the NP rl in terms of the noun phrases N':  (17)a NP rl / N~ que N\]t V30 S l pro,i b S~ro,i / (@)i de V 1 i) By choosing a verb V30 in the subclass (N\],Sp, i.e., one taking N'.. as its subject and as second verb (in S!ro )J one whose subject is N\[, I guarantee that the N i in a is both an acceptable object of V30 and an acceptable subject of the verb in S1 pro SS32 Elimination of N_P_o  The development sketched in SS31 will not do for relative clauses like NP r2 (in 7), which have the form NP o que Spr o. This can be schematized roughly as NP o que. * V i .... V~ V c , where Vi, Vj, .. are embedded verbs o~ the type V30, V31, ..., and V c is the last verb of 'qpro' the one for which NP o must be an acceptable obj ect.</Paragraph>
    <Paragraph position="46"> In order to transmit the selectional characteristics of NP o to the rule that develops Vc, and this within the framework of a CF grammar, I can proceed as follows. I subscript Spro in (7) by k, which is also the subscript on the noun phrase N~ that replaces NP o (just as S I, S 2, .... were subscripted for the type of subject); then the schema SI~ for embedded sentences will have two subscripts: one for ~, and a second one for the type of subject the verb takes. This yields the following kind of development:  (31 times, as above, once each for the subjects N~ ..... Nil) The typical rule for NP r2 is the following: sl.l,k (19) NP r2 ~ N~ @ue N~ t V30 pro,j Once more, acceptability is guaranteed by choosing a verb V30 in the sub-class (N~, S\]). Next, the symbol SI.i, I~ representing a possibly em- &amp;quot; pro,j ' bedded sentence, can be developed by the rules: * s3.l,k (20)a sl'l'kpro,J / N:j de V32 pro,j' b SI'I,. k / N: de V 2 (~)k pro,j j If rule b is chosen, sentence embedding terminates; then, choosing a V 2 in the sub-class taking an object of type N~ (as indicated by @k ) guarantees that N~ in (18) is an acceptable object for that V 2. If-rule a is chosen, sentence embedding continues; a verb V32 is chosen, in the sub-class (hi, S\],), until a rule of type _b is chosen.</Paragraph>
    <Paragraph position="47"> The reader will notice two features of this method of using the selection rules to generate relative clauses* (I) The subdivision of S I into a set of S~ rule schemata does not increase the number of rules in S. The same number of rules would be obtained by inserting the noun phrases Nk into S (or SI), and this must be done in any case in order to express the verbal selection rules (in whatever fashion)* In the decompositions of SI,..., used above, the point was only to present the original schemata so as to make the subject or object of the verb in the schema stand out, for further reference. (2) The two kinds of selection made explicit in these schemata, the one between verbs, and the other (better known) between verb and object (or subject), appear only once in the grammar. Both types of selection are used in each step of sentence embedding, but in no case does this entail rewriting the two kinds of selection in the grammar each time a deeper level of embedding is attained. null SS4. Conjunction; respectively It has been shown by Chomsky 5 that conjunctions can be described in a CF grammar only by using an infinite number of rules, represented by rule schemata; if one restricts oneself to strict CF grammar, one introduces an excessive structuring of the conjoined forms* An approximate sol-ution can nevertheless be given to this problem, in the framework of a finite CF grammar, in the following way. I construct a sequence of conjoined noun phrases: (21)a GN 1 + N&amp;quot; ; b GN 2 / N&amp;quot; et N&amp;quot; ; c GN 3 / N&amp;quot; et N&amp;quot; et N&amp;quot; d GN I / N&amp;quot; et N&amp;quot; ... et N&amp;quot; (! times) k 42 i Denoting by Gcf the CF grammar containing the rules GN i, GN i-I, ..., GN I, I can. set up the series of grammars G~f, G 2 l cf' &amp;quot;''' Gcf' each representing a better approximation to the infinite G~f, which contains a noun phrase of un- grammar bounded c length.</Paragraph>
    <Paragraph position="48"> For any practical purpose, such as generation (or analysSs) of sentences, it is clear i that one of the Gcf will be large enough to yield the desired precision. However, another approximation is available which is less costly, from the viewpoint of the number of rules required, and which yields the same result for G~f. This is the rule schema proposed by Chomsky &amp; Sch5tzenberger 9 for handling conjunction in a CF grammar. For the case of noun phrase conjunction, this schema is as follows: (22)a GN / N'; b GN + N&amp;quot; (et N')* The star indicates that the group (et N') can be iterated as many times as is necessary. This schema is therefore an abbreviation for an infinite number of rules.</Paragraph>
    <Paragraph position="49"> With such a rule schema in it, my grammar is no longer strictly CF; however, it is clearly f~ithful to the spirit of the approximation for l outlined above, since the language described Gcf by my grammar is the same as that reached asymptotically by the series of grammars G~f, 2 Gcf,.., G~f obtained with (21). The rule schema (22) can be compared to an alggrithm for generating any l by choosing the number one of the grammars Gcf of iterations.</Paragraph>
    <Paragraph position="50"> There exists a set of structures in natural language which cannot be described by the methods developed until now, namely those containing either respectivement, or the distributives qui or selon que: (23)a Les rats des groupes Aet B rSussissent et $chouent dans les labyrinthes L aet Lb, respectivement.</Paragraph>
    <Paragraph position="51"> b Les reporteurs ont parle qui aux ministres, qui aux d~l$gu~s, qui aux d~put~a.</Paragraph>
    <Paragraph position="52"> c Selon que tu es pauvre, bourgeois ou aristocrate, tu seras ouvrier, commergant ou patron.</Paragraph>
    <Paragraph position="53"> Although these strings cannot be generated by a CF grammarSS , a procedure is nevertheless avail-SS--The applicability-~f this argument to the linguistic case is not quite as simple as this brief formulation of the argument might lead one to suppose, in the way it is generally used in discussing sentences with respectivement. It is only the language containing just the sentences (23), and only those, that cannot be generated by a CF grammar. However, in order for this conclusion to apply to the generation of the entire French language by a CF grammar, it must be shown that there exists no sublanguage of French containing these sentences in respectivement as a subset that can be generated by a CF grammar.</Paragraph>
    <Paragraph position="54"> Cf. Gross II (SS8.1) for this argument.</Paragraph>
    <Paragraph position="55"> able for including this type of sentence in the CF approximation under discussion here.</Paragraph>
    <Paragraph position="56"> I add Kleene rules to the grammar, and a condition on these rules, as follows: (24)a N s (et Ns)* V N o (et No)* b N s (et N )* V (et V)* N (et No)* s o These rules contain all common conjunctions of subject, verbs and direct object. Moreover, they cover the sequences of classes observed in sentences containing respectivement. They don't have the structure one would like to associate with such sentences. In order to describe the respectivement sentences, I add the following condition to the starred parentheses: the number of iterations of each occurrence of the star is the same; and a structure, or rule of interpretation, is imposed on the starred groups, as follows: 1 I , No (etINo), (25) N s (et Ns)* V (et V) This grouping pairs the N s and the N o that are to be associated with each other via respectivement; (25) is equivalent to: (26) N 1 et N 2 .. V 1 et V 2 .. N 1 et N 2</Paragraph>
    <Paragraph position="58"> Thus, I am interpreting (25) as a sentence conjunction: N 1V 1N 1 ~ ~2 ~2 M2 ~ requir- O ~ L~S v ~0 ~ &amp;quot; &amp;quot; &amp;quot; ~ as ed by the adverb respeetivement.</Paragraph>
    <Paragraph position="59"> SS5. Conclusions The methods I have sketched here can certainly be applied to other natural languages and will account in a natural way for the general phenomena of verbal selection rules in embedded sentences. One may wonder why this work has not been carried out before.</Paragraph>
    <Paragraph position="60"> Historically, attacks against the adequacy of CF grammar for describing natural language arose at a moment when it was necessary to explore the nature of the transformational grammar just proposed. This new style of grammar seemed so much better adapted than CF phrase structure grammars to explaining sentence relations that any more effort towards developing a detailed CF grammar seemed fruitless. To discourage such efforts, Chomsky 5 (chap. 5) declared that &amp;quot;any grammar that can be constructed in terms of this theory \[CF phrase structure grammar\] will be extremely complex, ad hoc and 'unrevealing'&amp;quot;. These remarks were reaffirmed (Chomsky 8) and bolstered by an argumentation based on the inherent inadequacy of CF grammar for describing verbal selection rules.</Paragraph>
    <Paragraph position="61"> A second criticism arose from the analysis of constructions, like respectivement, whose description could not be obtained within the strict framework of a CF grammar. We have seen above that such a statement is at best unclear. It may be correct that a mathematically rigorous description of this construction is not possible in a  strict CF grammar; even so, we are under no obligation to transfer this observation bodily to the domain of linguistics. The type of description that I elaborated above, in which a rule of interpretation is added to a rule generating the form of sentences containing respectivement, is now used in recent work in generative semantics.</Paragraph>
    <Paragraph position="62"> Moreover, it can be seen that the CF grammar presented here is but a short step removed from a transformational grammar. In all transformational theories, a transformation includes (among other things) a relation between sentences. Most authors also include operations that deform one sentence into another, or which modify an abstract structure so as to derive sentences from it. The CF grammar I have proposed contains the information that establishes relationships between sentences, but it does not contain the operations or the metalinguistic assertions that make the transformation explicit. By a small extension of the CF framework I can also obtain the equivalent of a transformation, as follows. As an example, I consider the passive transformation.</Paragraph>
    <Paragraph position="63"> The passive transformation consists in matching an active phrase with its passive counterpart. The statement of the transformation can stop there, as does Harris 'I~, or one can add the specification of the computer operations needed to create the active and passive trees, as in generative grammar. In the CF grammar presented here, I have two independent rules, one for the active form, and another for the passive of the first: (27)a Sac t / NP s t V 2 NP o b Spa s / NP o t ~tre V2~ (par NP s) Each of these rules has an independent set of selection rules that are expressed in the choice of the N&amp;quot; for the NP. Adding these selection rules,  This is of course a wasteful repetition of identical selection rules. It was just to avoid this kind of useless duplication that justified the introduction of transformations. Suppose now that I factorize the selection rules from a set of forms that constitute an equivalence class, for example, from the 'active' and the 'passive' forms; I place a separator p between the forms of the equivalence class: (29) S / NP s t V 2 NPo/p/ NP o t ~tre V2~ (par NPs)/p/ I1 t ~tre V2e NP o (par NPs); + N~ &amp;quot; NP o + N\] NP s In this formulation, the selection rules are no longer duplicated; moreover, we can interpret the separator p between the members of the equivalence class as indicating a relation between the sentence schemas so separated. The factorization of the selection rules, together with the introduction of the separator can be read as the definition of a transformational rule between the sentence schemata.</Paragraph>
    <Paragraph position="64"> Of course, rule (29) is no longer CF, but it represents a rather natural extension of the CF framework which makes the latter much more similar to a transformational grammar than one might have thought possible up til now. However, the reader will note that the concept of a transformation is indispensable as a tool for the construction of this CF grammar, and then for its extension towards a transformational grammar by means of the factorization of selection rules.</Paragraph>
    <Paragraph position="65"> Furthermore, this CF grammar does not generate the sentences of the language 'weakly', in the meaning given this word by Chomsky; in fact, it provides them an 'adequate' grammatical structure as well as a linguistically justifiable relationship to other sentences of the language.</Paragraph>
    <Paragraph position="66"> Finally, let us note that although the entire set of rules of the CF grammar proposed here is large (of the order of 109 rules), it is nonetheless finite. Furthermore, the size of the grammar is of no theoretical consequence, since it could be stored quite handily, not in some static memory (e.g., a pile of discs), but in a dynamic form (that is, in the form of schemata) where each rule is generated at the moment when the program of syntactic analysis (or generation) requires it. In this way, the set of rules would be reduced to a series of sub-programs that can generate either one rule, or a sub-set of rules, or all the rules. During analysis or generation, a call for rules would activate their synthesis by the appropriate sub-program.</Paragraph>
    <Paragraph position="67"> Such a program of analysis by synthesis reduces the number of rules to a smaller number of sub-programs, but a string grammar reduces them still more, down to a set of about 150 strings (the rewrite rules) together with about 200 restrictions (the CS portions attached to the CF rules).</Paragraph>
    <Paragraph position="68"> The size of the CF grammar required to describe selection rules adequately also explains why all attempts at automatic syntactic analysis by means of strictly CF grammars undertaken until now have failed. The authors of these CF grammars limited their effort to including some rudimentary linguistic facts; the average size of this sort of CF grammar was of the order of several thousand rules (cf. Kunolg,20). Under these conditions, there was no question of providing only linguistically acceptable analyses. However, in the last few years, other CS variants of a CF grammar have been proposed, and partly worked out. In particular, the augmented transition network grammar of Bobrow &amp; Fraser 2, especially in the form given it by Woods 28, has predicates associated with the transitions, predicates that are so many context-sensitive tests. This kind of  44grammar is then quite similar to string grammar, i.e., to a CF grammar together with CS conditions on the rules. Unfortunately, none of the grammars based on the ideas of Bobrow and Woods has been worked out in sufficient detail to make a linguistic comparison with string grammar possible. null *I should like to thank M. Gross for many helpful comments, and myself for an excellent typing job.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML