File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/90/c90-3003_metho.xml

Size: 26,179 bytes

Last Modified: 2025-10-06 14:12:29

<?xml version="1.0" standalone="yes"?>
<Paper uid="C90-3003">
  <Title>Backwards Phonology</Title>
  <Section position="3" start_page="0" end_page="0" type="metho">
    <SectionTitle>
2 Reversibility of Unilevel
</SectionTitle>
    <Paragraph position="0"/>
    <Section position="1" start_page="0" end_page="0" type="sub_section">
      <SectionTitle>
Rule Systems
</SectionTitle>
      <Paragraph position="0"> The question of grammar reversibility involves two interrelated but separate issues. The first is whether the notational or descriptive devices of a grammar are in general amenable to being reversed, and what is involved in the reversal. The second is whether individual accounts of the phenomena&amp;quot; of a particular language are reversible, and, again, if so, what is involved in the reversal.</Paragraph>
      <Paragraph position="1"> Tim remarks in this paper are mainly concerned with the general paradigm of generative phonology, in particular, segmental phonology as is described in elementary texts - e.g., Kenstowicz arid Kisseberth (1979), IIalle and Ciements (1983), Schane (1973), Mohanan (1986) - rather than any particular linguistic theory. The main techniques discussed are rewrite rules, orderings of rules, features, and variables for feature values (e.g., the alpha and beta of assimilation rules). The problems of suprasegmental phonology will be left for another paper.</Paragraph>
    </Section>
  </Section>
  <Section position="4" start_page="0" end_page="0" type="metho">
    <SectionTitle>
3 Backwards Rules
</SectionTitle>
    <Paragraph position="0"> I shall start by making explicit what it means to apply a phonological rule in the backwards direction. The basic idea is extremely straightforward and will be, I think, uncontroversial.</Paragraph>
    <Paragraph position="1"> a-+ b / ot_fl (1) A rule like the one in (I) transforms the string/o~afl/ into the string /abfl/. Here c~ and fl are strings of characters over some alphabet, e.g., the phonemes of a language. I take it that such a rule can also be interpreted as mapping the string/o~bfl/into the string /o~afll, when it is applied backwards.</Paragraph>
    <Paragraph position="2"> To take a more linguistically realistic rule, let us consider the simple rule in (2).</Paragraph>
    <Paragraph position="4"> From a recognition point of view, this means that if we have the sequence trig\] in a surface form of a word, then the underlying sequence could be /n g/.</Paragraph>
    <Paragraph position="5"> In slightly more general terms, we look for the segment on the right side of the arrow to see whether it appears in the context given in the rule. If so, we can transform that segment into the segment on the left side of the arrow.</Paragraph>
  </Section>
  <Section position="5" start_page="0" end_page="0" type="metho">
    <SectionTitle>
4 Obligatory Versus Optional
</SectionTitle>
    <Paragraph position="0"> The rule in (2) says nothing about whether it is optional or obligatory in the backwards direction. Optionality in the backwards direction is entirely independent of optionality in the forward direction. In English the rule in (2) seems to be obligatory in the reverse direction, i.e., every surface \[t3\] seems to come from an underlying/n/. In the forward direction, it does not always apply. This is demonstrated by the pair: co\[~l\]gress vs. co\[n\]gressional, l In a language that had phonemic/Ij/and/n/, the rule might be obligatory in the forward direction and optional in the backward direction. 9&amp;quot; That is, if \[rj\] on the surface can come from either/n/or/(I/, then the rule would necessarily be optional in the reverse direction.</Paragraph>
    <Paragraph position="1"> The point here then is that one needs to specify in the grammar not just whether a rule is obligatory or optional in the forward direction, but also whether it is obligatory or optional in the backwards direction.</Paragraph>
  </Section>
  <Section position="6" start_page="0" end_page="0" type="metho">
    <SectionTitle>
5 Reversibility and Rule Or-
</SectionTitle>
    <Paragraph position="0"> dering The previous example describes the case of a single rule and points out that attention must be paid to whether a rule is optional or obligatory in the backwards direction as well as in the forward direction. The following case of rule ordering shows that there is more to the issue of reversibility than the distinction between &amp;quot;optional&amp;quot; and &amp;quot;obligatory.&amp;quot; There is a beautiful example in the Problem Book in Phonology by lIalle and Clements (1983) of the elegance of rule ordering. In this section I will show that the device of ordered rules is not generally reversible using their example from Klamath.</Paragraph>
    <Paragraph position="1"> Tile data from Kiamath together with five rules are taken from llalle and Clements (1983), who in turn give their source as being Klamath Grammar by Barker (1964): IMohanan (1986) p. 151.</Paragraph>
    <Paragraph position="3"> Halle and Clements also say that Barker assumes that all phonological rules are unordered and that all rules apply simultaneously to underlying representations to derive surface representations. 3 They then give the following exercise: &amp;quot;Show how Barker's set of rules can be simplified by abandoning these \[Barker's\] assumptions and assuming that phonological rules apply in order, each rule applying to the output of the preceding rule in the list of ordered rules. Write the rules sufficient to describe the above data, and state the order in which they apply. ''4 ,The rules that one is supposed to arrive at are</Paragraph>
    <Paragraph position="5"> The ordering to impose is that Rule (3) applies before Rules (4) and (5), and that Rules (4) and (5) are unordered with respect to each other. The reader can verify that the rules give the correct results when applied in the forward (generative) direction. In the backwards (recognition) direction, the derivations for the five forms are as given below. The rule numbers are superscripted with a minus one to indicate that these rules are inverses of the rules listed above.</Paragraph>
    <Paragraph position="7"/>
  </Section>
  <Section position="7" start_page="0" end_page="0" type="metho">
    <SectionTitle>
4 Ibid.
</SectionTitle>
    <Paragraph position="0"> holfa:l'a ---+ holi'a:l'a ~ honl'a:l'a Rule 5 -1 Rule 3 -1 pa:lha -4 pa:il.a ---+ *pa:nla Rule d -t Rule 3-t yalgal?i ---* yalyali'i --4 *yalyani'i Rule 5 -1 Rule 3 -1 What we~see here is that in order to recognize the  form holli:na correctly, Rule (3) must be obligatory in the reverse direction. However, in order to get the correct results for the forms pa:lha and yalyalfi, Rule (3) may not apply at all; i.e., it is not correct to say that the results can be obtained by correctly stipulating whether a rule is optional or obligatory. Rule (3) works well in the forward direction, but gives incorrect results when applied in the backwards direction. In short, the elegant set of ordered rules makes incorrect predictions about recognition. In contrast, Barker's original unordered set of rules correctly describes the data regardless of direction of application (i.e., generation vs. recognition).</Paragraph>
    <Paragraph position="1"> This is a result about ordering of rules. I have not shown that a set of ordered rules is never reversible, only that such a set is not necessarily reversible.</Paragraph>
  </Section>
  <Section position="8" start_page="0" end_page="0" type="metho">
    <SectionTitle>
6 Variables and Deletion
</SectionTitle>
    <Paragraph position="0"> The previous example used extremely plain rules: no features, no alphas or betas, and no deletion. The next example I shall present involves some of these commonly used devices. I shall try to make clear when they can be used in a reversible way (though they need not be), and when they just do not seem amenable to reversal. Before discussing reversal further, I will present the data and the set of rules for describing the data ill the generative framework. The data and analysis were taken from Kenstowicz and Kisseberth (1979). 6 Their data come from the language Tunica.</Paragraph>
    <Paragraph position="1"> The rules and data deal with two phenomena: vowel assimilation and syncope. The rules, given below, are ordered, with (6) occurring before (7). \[Note on transcription: the question mark represents glottal stop.\] SThls is correct modulo the change of i back into y which Halle and Clements assure us is not, part of the issue at hand. For purposes of discussing reversibility it merely provides more support for the argument that unilcvel rules are not easily reversed.</Paragraph>
    <Paragraph position="2"> 6p. 292. They cite their source as IIaas (1940).</Paragraph>
    <Paragraph position="4"> Rule (7) says (or was meant to say) that unstressed vowels are deleted before glottal stops. Rule (6) was intended to mean that /a/ assimilates to \[el or \[hi when it is separated by a glottal stop from a preceding /i/ or /u/ respectively.</Paragraph>
    <Paragraph position="5"> In addition to the two rules just given, Kenstowicz and Kisseberth mention but do not formulate a rule of Right Destressing that follows both rules. The rules are in accord with the following data, also taken from Kenstowicz and Kisseberth. The following forms show assimilation.</Paragraph>
    <Paragraph position="6"> To verb He verbs She verbs She is v-ing Gloss pd pdfuhki p6C.aki p6hk~, aki look p~ pf?uhki p#eki pfhkfaki emerge yd yd ?uhki yd ?aki ydhk ?aki do ~d 6d?uhki ~d?aki ~dhk?aki take These forms show syncope and assimilation.</Paragraph>
    <Paragraph position="7"> To verb He verbs She verbs She is v-ing Gloss hdra hdr?uhki hdr?aki hdrahk?dki sing h(pu h(p?uhki h~paki h\[pnhkfdki dance ndgi ndgfuhki ndg?eki ndgihkfdki lead As a sample derivation, Kenstowicz and Kisseberth give the following:  For the purpose of going through a backwards derivation, 1 will make explicit a few assumptions. First, 1 assume that the Vowel Assimilation rule is really as in (8) below.</Paragraph>
    <Paragraph position="8">  It is a matter of style that the features \[ + syll, + low\] were left out of the feature bundle to the right of tile arrow in Kenstowicz and Kisseberth's formulation of tile rule. Although it is considered good style to do so, the omission of such information makes it unclear how the rule should be applied for recognition. Hence I have included this information in Rule (8). v Another assumption I will make is that the unformulated rule of Right Destressing lends nothing to my argument herd. I assume that the rule when applied in the reverse direction puts stress on the appropriate syllable and nowhere else. s Finally, I will spell out what I consider to be a reasonable interpretation of how to use the rules for recognition. When interpreted backwards, Rule (8) says that a low vowel that is separated by a glottal stop from another vowel with which it agrees in backheSS and rounding might have come from some other low vowel. The syncope rule in (7), when interpreted backwards, says to insert an unstressed vowel before glottal stops. As was pointed out. before, there is no way to deduce whether these rules are obligatory or optional in the reverse direction. Indeed, it is not at all obvious what &amp;quot;obligatory&amp;quot; even means in terms of the assimilation rule t~ken backwards.</Paragraph>
    <Paragraph position="9"> Given these assumptions, we can now produce a reverse derivation for \[na's?ekq.</Paragraph>
    <Paragraph position="10">  as being \[+ low\] to keep the rule simple and still contrast \[el with \[i\]. If they treat \[e\] as \[- low\] and \[a\] as \[+ low\], the assimilation rule becomes messier. This assumption about \[el becomes important later.</Paragraph>
    <Paragraph position="11"> sit seems clear that segmental accounts will fall short when dealing with suprasegmental issues like stress. The goal of this paper is to contrast two different ways of doing segmental phonology. Both would presumably benefit from autosegmental extensions.</Paragraph>
    <Paragraph position="12"> 16 4 applying the reverse of the Assimilation rule to the preceding forms. A box is drawn around the correct underlying form.</Paragraph>
    <Paragraph position="13"> What we end up with are 14 or 15 possible forms - clearly too many. One problem is that the assimilation rule in (6) and (8) was formulated with only generation in mind. If we change it slightly, adding the features \[+back, -round\] to the bundle to the left of the arrow as in (9), +syll +syll --* c~back ? +back c~back ~round --round flr ound (9) we have a better rnle. Now it says that \[e\] and \[~\], when they result from assimilation, come specifically from/a/. This makes the results better. The previous version of the rule just mentions low vowels, of which there are three that we know about: s,a, ~.s When we specify that of these three we always want /a/, we have a more accurate grammar. Now instead of recognizing 14 or 15 possible underlying forms for the word ndg?eki, the grammar only recognizes ten.</Paragraph>
    <Paragraph position="14"> There iis a very simple but subtle point at issue here, havihlg to do with writing reversible rules. The grammar writers knew when they were formulating the assimilation rule that \[e\] and \[3\] were never going to come up as input to the rule because these two vowels do not exist in the underlying representations. They also knew that there were no other rules applying before the assimilation rule which would introduce \[C/\] or \[~\]. Hence they did not need to distinguish between tim various possibilities for low vowels. In short, the grammar writers made use of fairly subtle information to write a rule which was as pared down as possible. Leaving out the features in (9), as Kenstowicz and Kisseberth do, looks elegant, but turns the two-way rule into a one-way rule that works only for generation. This is a case where leaving out some features obscures the content of the rule and prevents one from correctly applying the rule for recognition. In short, this is a case where the rule could have been written in a way that was reversible, or at least more reversible, but in the name of &amp;quot;brevity&amp;quot; or &amp;quot;elegance&amp;quot; it was not.</Paragraph>
    <Paragraph position="15"> The vowels \[e\] and \[~\] also provide complications for the revcrqal of the vowel deletion rule. We have no reason to believe from the data given that the deleted vowel is ever \[~\] or N. IIowever there is not a good way of saying, using standard rule writing techniques, that any vowel that is introduced in the recognition 9As mentioned in an earlier footnote, Kenstowicz and Kisseberth seem t,o treat \[e,\] as \[+ low\].</Paragraph>
    <Paragraph position="16"> must be one of the underlying ones. In ordered sets of rules, there is not lypically a distinction made between the segments that can occur as input to a rule and segments that can only occur as output. One of the unhappy consequences is that \[e\] and \[~\] have the same status with respect to the rules of Tunica as the other, underlying, vowels in the language.</Paragraph>
    <Paragraph position="17"> An even more serious problem revealed by this Tunica example is the inability of the standard generatire rule-writing mechanism to specify the interrelationship between rules. The rules apply based only on strings of characters they get as input, not oll what rules came before. In the case at hand, however, we would like to be able to relate the two rules to one another. What we would really like to be able to say is that when in the course of recognition it becomes necessary to reintroduce the deleted vowel, if there iu an \[e\] on the surface the reintroduced vowel must be \[i\], and if there is an \[~\] the reintroduced vowel must be \[u\] or \[o\]. This is a problem with alpha (assimilation) rvdes. There is no way to say that if there is an Is\] or \[~1 on the surface, then the reverse of the syncope rule must apply, when doing recognition, and, furthermore, that it must apply in such a way that the assimilation rule can then apply (again in reverse) and, lastly, that the reverse of the assimilation rule must then apply. In simpler terms, there is no way to say that if there is an \[~\] (respectively \[~\]) on the surface, then it must be preceded by an underlying/i/(respectively/u/or/o/).</Paragraph>
    <Paragraph position="18"> When dealing with cases of deletion, and mergers in general, it is not generally possible to write a set of rules that maps surface forms unambiguously to a single underlying form. In the ease of the ~hmica vowel deletion, there are occurrences of surface forms in which the phonological rules cannot tell which vowel to reintroduce when doing recognition. There are, however, cases where it is clear which vowel should be reintroduced, e.g., the case above, and in these cases, both the grammar formalism and the individual analysis should be able to express this information. The mechanism of using alphas and betas, for instance in assimilation rules, does not appear to have this expressive capacity.</Paragraph>
    <Paragraph position="19"> The problem could be ameliorated by writing less elegant rules. For instance, the syncope rule in (7) could be written as in (1O).</Paragraph>
    <Paragraph position="21"> This would ensure that the nommderlying vowels \[~\] and \[.~\] would not be introduced when applying the rules in the reverse direction. It still would not be as</Paragraph>
    <Paragraph position="23"> restrictive as one could be using two-level rules.</Paragraph>
    <Paragraph position="24"> One could argue that all one needs to do is use the lexicon to weed out the forms that are wrong. Yet one would not consider suggesting the same thing if a grammar generated too many surface forms, although one could imagine using a surface lexicon as a filter.</Paragraph>
    <Paragraph position="25"> The technique of using the lexicon to weed out the forms that are wrong is a perfectly good efficiency measure, but has no bearing on the question of how well a formalism maps underlying forms to surface forms and vice versa.</Paragraph>
    <Paragraph position="26"> In the rest of this paper I will present and discuss two-level accounts of phonological phenomena described earlier, and show the merits of such an approach. null</Paragraph>
  </Section>
  <Section position="9" start_page="0" end_page="0" type="metho">
    <SectionTitle>
7 Two-level Rules
</SectionTitle>
    <Paragraph position="0"> In the two-level accounts that have been proposed \[Koskenniemi 1983, Karttunen and Wittenburg 1983, Bear 1986, etc.\], there are two alphabets of segments, underlying and surface. There are constraint-rules about which underlying segments may b'e realized as which surface segments, and vice versa, based on context. The rules' contexts are strings of pairs of segments, each underlying segment paired with a surface segment. Deletions and insertions are handled by pairing a segment with a null segment. What is crucial about the rules is that each element of a context is actually a pair of segments, an underlying and a surface segment. The ability to refer to both surface and underlying contexts in a rule allows the rule writer to describe phenomena that are handled with ordered rules in the unilevel approach.</Paragraph>
    <Paragraph position="1"> The other powerful device in two-level phonology is an explicit listing of the two alphabets and the feasible mappings between them. These mappings are simply pairs of segments, one surface segment paired with one underlying segment. This list of feasible pairs typically contains many pairs of identical segments such as (a,a) or (b,b), representing that there are segnmnts that are the same underlyingly as on the surface. The list also contains pairs representing change. For the Tunica example, (a,C/) and (ao) would be in the list, but (a,u) and (i,u) for example would not be.</Paragraph>
    <Paragraph position="2"> The feasible pairs can be thought of as machinery for generating strings of pairs of segments that the rules either accept or reject. An accepted string of segment pairs constitutes a mapping from an underlying form to a surface form and from surface to underlying form.</Paragraph>
  </Section>
  <Section position="10" start_page="0" end_page="0" type="metho">
    <SectionTitle>
8 Rule Ordering
</SectionTitle>
    <Paragraph position="0"> In a paper presented at the 1986 annual meeting of the Linguistic Society of America, Lauri Karttunen proposed this solution for the Klamath data above: 1deg</Paragraph>
    <Paragraph position="2"> The contexts of tile rules should be read as follows.</Paragraph>
    <Paragraph position="3"> Each pair separated by a colon is a lexical segment followed by a surface segment. The equals sign is a place holder used when the rule writer does not want to make any commitment about what some segment must be. So, for instance, 1':= is an underlying /1'/paired with some surface segment, and the rule doesn't care which. Similarly, =:1 is a way of stil~ulating that there is a surface \[I\] in the context, and we don't care, for the purposes of this rule, which underlying segment it corresponds to. The right arrow, ---~, is being used in the way described in Bear \[1986, 1988 a,b\]. For example, Rule (11) should be construed as allowing the pair of segments n:! (underlying n corresponding to surface l) to occur in the rule's environment, while disallowing the pair n:n. Although the right arrow rule is reminiscent of' the arrow in unilevei rules, this interpretation is nondirectional.</Paragraph>
    <Paragraph position="4"> There are two other kinds of constraints to allow one to deal effectively with the asymmetries involved in pairing underlying forms with surface forms. In Bear \[1986, 1988\] the two other kinds of constraints are (1) to allow a pair of segments to occur in a certain context without disallowing the default pair (e.g. n:n in the previous example is a default pair), and (2) to disallow a pair in some context without allowing some other pair. For example, the rule types in (14) and (15) are allowed.</Paragraph>
    <Paragraph position="5"> a:b allowed here: a _ fl (14) a:b disallowed here: a _ fl (15) In Koskenniemi \[1983, 1984\] tile constraints are slightly different, but have roughly the same functionality. I!1 Koskenniemi's system, one may stipulate that if a lexical segment occurs in some context, then it must correspond to some particular surface segment. One may also stipulate that a certain lexical/surface segment pair may only occur in a certain environment.</Paragraph>
    <Paragraph position="6"> 1degI'm using an amalgamation of notations from Koskenniemi, Karttunen and Wittenburg, and Bear.</Paragraph>
    <Paragraph position="7"> 18 6 Karttunen \[1986\] pointed out that the three rules in (ll), (12), and (13) work correctly to give the right resuits when generating surface forms from underlying forms, and made the point that they do so without recourse to the device of rule ordering. Another point he could have made about these rules which I will make here is that they are just as effective in producing the right underlying forms from surface forms. There is not the problem of multiple intermediate levels of representation, where one is faced with the choice of whether to continue applying \[reversed\] rules or to stop and call the form a result.</Paragraph>
    <Paragraph position="8">  Kules (16) and (17) say that/a/assimilates to the underlying vowel preceding it, with a glottal stop intervening. One other crucial element of the two-level way of doing things is that in addition to rules, a grammar contains a list of feasible segment pairs. For this Tunica case, there presumably would not be a feasible pair/e/:\[e\], nor would there be /~/:\[~\] since \[el and \[3\] do not seem to occur as underlying vowels. Itence the surface Is\] in our example word \[ndg?ekz\] would be forced unambiguously to correspond to an underlying/a/. This is exactly what we want.</Paragraph>
    <Paragraph position="9"> Rule (18) specifies that unstressed vowels are deleted when they occur before a glottal stop. The rule makes clear that only the four vowels i, a, o, and u are deleted, and also that when doing recognition, only those vowels are allowed to be inserted.</Paragraph>
    <Paragraph position="10"> These rules make it clear that the underlying form for \[ndg?ekt\] must be/ndgi?dki/modulo details of the rule of Right Destressing.</Paragraph>
    <Paragraph position="11"> 10 Analysis by Synthesis There is one system for doing computational morphology, specifically for recognizing Turkish, which uses 11 It is a common abbreviatory convention that any pair of idendical segments, e.g., a:a, can be written simply as a single segment, e.g., a. So, in these rules the glottal stop character represents the pair: ?:?.</Paragraph>
    <Paragraph position="12"> unilevel rules \[Hankamer, 1986\]. The system first invokes an ad hoc procedure to find the first heavy syllable of a Turkish word. This substring and perhaps a few carefully constructed variants of it are consid- ' ered as possible stems for the word. Next, based on the morphotactic information about the stem found in the lexicon, assuming one of the possible stems is in the lexicon, several possible suffixes are proposed as possible. A set of phonological rules is applied to the hypothesized underlying forms consisting of stem+suffix. Whichever of them results in a string that matches the input surface form is considered to be right. The process is repeated until the entire string is analyzed.</Paragraph>
    <Paragraph position="13"> Since &amp;quot;l~lrkish is exclusively suffixing and has strong phonotactic constraints on what can be a stem, it is possible to write an ad hoc routine to pick the stem out. It remains to be seen how this method of analysis can be made general enough to be applied successfully to other languages. While Hankamer's paper is interesting in its own right, it would be a mistake to construe it ms demonstrating anything very general about reversibility of unilevel rule systems.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML