File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/94/c94-1021_metho.xml

Size: 12,929 bytes

Last Modified: 2025-10-06 14:13:37

<?xml version="1.0" standalone="yes"?>
<Paper uid="C94-1021">
  <Title>INTERLANGUAGE SIGNS AND LEXICAL TRANSFER ERRORS</Title>
  <Section position="1" start_page="0" end_page="0" type="metho">
    <SectionTitle>
INTERLANGUAGE SIGNS AND LEXICAL TRANSFER ERRORS
</SectionTitle>
    <Paragraph position="0"/>
  </Section>
  <Section position="2" start_page="0" end_page="134" type="metho">
    <SectionTitle>
Abstract
</SectionTitle>
    <Paragraph position="0"> A theory of interlanguage (IL) lexicons is outlined, with emphasis on IL lexical entries, based on the HPSG notion of lexical sign. This theory accounts for idiosyncratic or lexical transfer of syntactic subcategorisation and idioms from the first language to the IL. It also accounts for developmental stages in IL lexical grammar, and grammatical variation in the use of the same lexical item. The theory offers a tool for robust parsing of lexical transfer errors and diagnosis of such errors.</Paragraph>
    <Paragraph position="1"> Introduction ~ Computational error diagnosis of second language (L2) input should be based on L2 acquisition theory. In this paper, I outline a theory of second language lexical items which can be exploited computationally for robust parsing and error diagnosis. L2 is interpreted as interlangnage (IL) in the sense of Selinker (1972). A unification-based lexicalist view of grammar is assumed (HPSG theory). The paper has the following outline: first lexical transfer is presented, then interlanguage lexical entries are discussed, and finally some problems concerned with robust parsing and error diagnosis based on such lexical entries are pointed out.</Paragraph>
    <Paragraph position="2"> Lexleal transfer A theory of IL should account for lexical transfer from the first language (L1). By lexical transfer I mean that idiosyncratic properties of L1 lexical items are projected onto the corresponding target language (Lt) lexical items. By 'corresponding' I mean translationally related. I will consider two types of lexieal transfer; translational transfer of idiomatic expressions, and transfer of L1 subcategorisation frames. Translational transfer of L1 idiomatic expressions is exemplified in  (1), with translational transfer from French (Cf. Catt (1988)).</Paragraph>
    <Paragraph position="3"> (1) *My friend has hunger.</Paragraph>
    <Paragraph position="4">  Mon ami a faim.</Paragraph>
    <Paragraph position="5"> In unification-based grammatical frameworks, like HPSG, idiomatic expressions like avoir faim can be formalised as special cases of subcategorisation (for an LFG-style account of idioms, see Dyvik (1990)) 1, and can thus be covered by the account I give of this type of transfer, to which we will now turn. The examples in (2)-(3) illustrate negative lexical transfer from Spanish in Norwegian interlanguage.</Paragraph>
    <Paragraph position="6"> (2) *Jeg kunne ikke svare til Per.</Paragraph>
    <Paragraph position="7"> I could not answer to Per.</Paragraph>
    <Paragraph position="8"> (3) No podfa responder a Per.</Paragraph>
    <Paragraph position="9"> Not could- 1 sg answer to Per.</Paragraph>
    <Paragraph position="10"> 'I could not answer Per'.</Paragraph>
    <Paragraph position="11"> Tile Norwegian verb svare subcategorises for an object NP, while the Spanish verb with tile same meaning, responder, subcategorises for a PP headed by tile preposition a (&amp;quot;to&amp;quot;). The Spanish subcategorisation frame thus admits the erroneous VP (w.r.t. Norwegian grammar) in (2). i assume that 1L lexical items are linked to corresponding L1 items by structt~resharing. Thus subcategorisation information from L1 lexical items can be used in generating interlanguage strings, and lexical transfer of the kind illustrated in (2) can be accounted for. I further make the idealisation that LI word forms are not used as &amp;quot;terminal vocabulary&amp;quot; in interlanguage strings.</Paragraph>
    <Paragraph position="12"> Lexical entries in IL How can these ideas be formalised? We will first consider the format which will be used for representing L1 lexical entries, that of 1 Althot,gh it is not as straightforward as ordinary subcategorisation, one needs e.g. to distinguish between &amp;quot;pseudo-idioms&amp;quot; like avoirfaim and real idioms like kick the bucket, but a discussion of this topic is not possible within this limited format.</Paragraph>
    <Paragraph position="13">  Pollard and Sag (1987), and then outline interlanguage lexical entries. Example (4) illustrates an underspecified lexical entry for the transitive Norwegian verb svare (&amp;quot;answer&amp;quot;).  IIl HPSG, syntactic categories are signs and are represented as attribute-value matrices, with three features, PHON(OLOGY) 2, SYN(TAX) and SEM(ANTICS), which (with the exception of PHON) can have feature structures as values. The path SYNILOCISUBCAT takes a list of categories as value. The leftmost element in this list corresponds to the lexical bead's most oblique complement (in (4) the object NP), the rightmost to the least oblique complement, (the subject in (4)). The feature SEM in this simple example is specified for the semantic relation expressed by the verb, as well as the roles the verb selects. The categories in the subcategorisation list are signs, for which the labels in (4) are a shorthand notation. The indices in the subcategorisation list indicate that the complement signs' SEM values are to bind the variables with which they are coindexed, and which are associated with the semantic roles in the relation. The lexical entry for the Spanish verb corresponding to svare, responder, is illustrated below:  (5) -i~HON res ~onder - DA,</Paragraph>
  </Section>
  <Section position="3" start_page="134" end_page="136" type="metho">
    <SectionTitle>
SYNILOC HEAD LLEX
SUBCAT &lt;PP\[\]\], NP~&gt;
\['RELN AN~\]WERII
SEMICONT I ANSwEREP-
_ \[_ANSWERED
</SectionTitle>
    <Paragraph position="0"> The signs in (4) and (5) are rather similar, with the exception of the first elements in the</Paragraph>
    <Paragraph position="2"> The intuition is that whereas an L1 sign (in the sense of Saussurean sign 4) has two sides, a concept side (corresponds to the feature SEM in I IPSG) and an expression side (here called PIION), an IL sign has three sides: a common 2 I follow Pollard and Sag (1987) and, for convenience, represent the valuc of PllON orthographically.</Paragraph>
    <Paragraph position="3">  3 Spanish direct objects arc NPs whcn they refer to non-human entities, while objects which refer to humans must be expressed as PPs headed by the prel)osition a (&amp;quot;to&amp;quot;). It ,night appear to be a problem flint NP and PP signs arc of the same scmantic type, but I follow Dyvik (1980) and LOdrup (1989) and call prelx)Sitional phrases which have their semantic role assigned hy all external head, nominal; as opposed to modifier (adjunct) PPs, which exprcss their own semantic roles. Having the same semantic type, &amp;quot;ht,,nan&amp;quot; direct object PPs can be derived from NP objects by a lexical rule.</Paragraph>
    <Paragraph position="4"> 4 Cf. de Saussure (1915).</Paragraph>
    <Paragraph position="5">  concept side, a L1 expression side and an Lt (IL) expression side (cf. (6)).</Paragraph>
    <Paragraph position="6"> Let us now return to the HPSG format, where it is possible to represent an IL entry and its corresponding L1 entry as a bilingual sign, similar to the concept used in machine translation (cf. Tsujii and Fujita (1991)).</Paragraph>
    <Paragraph position="7"> Interlanguage lexical entries can have the form which is visualised in (7)-(8), which represents two different alternatives and two different steps in the second language acquisition process:</Paragraph>
    <Paragraph position="9"> In this first alternative the expression side (PHON) of the IL entry is connected with its corresponding L1 entry. In this way lexical transfer from the L1 (cf. example (2)) is accounted for. The assumption is that the L1 lexical entry is basic at this stage, and the IL PHON is attached to it.</Paragraph>
    <Paragraph position="10"> In a later developmental stage, where the syntactic properties of the IL entry are different from that of the corresponding L1 entry, the SYNILOCISUBCAT path of the IL entry (abbreviated below to SUBCAT) is given its own distinct value. This is illustrated in figure (8):  The lexical items are still linked because they have the same meaning.</Paragraph>
    <Paragraph position="11"> The above account of lexical entries allows us to implement different stages in the development of an interlanguage lexicon. What it does not do, is to account for linguistic variability, which in L2 acquisition research generally is considered to be a property of L2 (cf. e.g. Larsen-Freeman and Long (1991)).In the case of lexical entries like (7) and (8), variability means that L2 users sometimes will use this item in accordance with Lt grammar, and sometimes not. But if we imagine a combination of (7) and (8), with a disjunctive  Wh,'tt we have just discussed also illustrates a significant difference between first and second language acquisition. As Odlin (1989) points out: &amp;quot; there is little reason to believe that the acquisition of lexical semantics in second language is a close recapitulation of all earlier development processes&amp;quot;. Ilere we have an explication of this. Acquisition of lexical items is understood as associating Lt expressions with LI concepts, either in a one-to-one relationship when L1 and Lt concepts are synonymous, or in one-to many relatkms when LI and IA concepts partially overlap 5.</Paragraph>
    <Paragraph position="12"> Future work The prospect of the present approach to IL lexical items, lies in the possibility which a formalisation in a lexicalist, unification-based grammatical fi'amework like HPSG gives for computational implementation, for purposes like anticipatory robust parsing and error diagnosis.</Paragraph>
    <Paragraph position="13"> 5 In cases where an LI lexical item not even partially corresponds to any L1 concept (e.g Japanese bushido, &amp;quot;tile wily of the samurai&amp;quot;, if Japanese is Lt and English LI) the meaning of this item carl still be paraphrased by means of L1 concepts.</Paragraph>
    <Paragraph position="14">  The theory itself can be used for such purposes without much revision: the most important one is that in the bilingual signs LI attribute-value matrices must be replaced with L1 ones. A system for robust parsing and error diagnosis of lexical transfer errors needs Lt subcat values for determining whether sentences have the right complements or not, and L1 simple (non-disjunctive) subcat values for making hypotheses about possible erroneous complements.</Paragraph>
    <Paragraph position="15"> My idea is to exploit the relation between Lt and L1 subcat values in a chart parser, so if a parse fails because of a mismatch between a complement in the input string and the Lt complement needed by a lexical head, the chart can be modified such that a complement licensed by the L1 subcat specification is introduced into the chart as an alternative to the incompatible complement specification in the Lt subcat list.</Paragraph>
    <Paragraph position="16"> This is not unproblematic, however, because even in successful chart parses, many hypotheses fail. Lexical entries with ahernative subcategorisation frames (e.g. SUBCAT &lt;X,(Y),(Z)&gt;, where Y and Z are optional) are common. The longer tim sentence, and the more lexical heads, the larger is the number of hypotheses which will fail. In the case of erroneous strings, how can a system decide which hypotheses to modify, in order to accept such strings? Modifying all edges which fail will even in simple sentences soon lead to an explosion of new edges. Deciding which error hypotheses are the most promising is a central computational question to which future work nmst be dedicated.</Paragraph>
    <Paragraph position="17"> Conclusion A theory of IL signs, which accounts for lexical transfer, has been presented. Transfer has been a controversial subject in second language acquisition (SLA) research, and its importance as a property of L2s has been evaluated differently through the history of SLA research. Scientists have disagreed whether properties of L2 production can be attributed to transfer or other factors, such as universal developmental sequences (cf. e.g.</Paragraph>
    <Paragraph position="18"> Meisel et al. (1981)). In my opinion lexical transfer is an interesting aspect of this discussion. As it is associated with individual lexieal items it has a stronger case that more general types of transfer (e.g. transfer of word order). When faced with an error where syntactic or semantic properties of an IL lexical item diverge from the standard of Lt, but are strikingly similar to properties of the corresponding L1 lexical item, lexical transfer is a likely explanation.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML