XML Viewer - c69-1001

File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/metho/69/c69-1001_metho.xml
Size: 32,175 bytes
Last Modified: 2025-10-06 14:11:04
<?xml version="1.0" standalone="yes"?>
<Paper uid="C69-1001">
  <Title>STRUCTURAL PATTERNS OF CHINESE CHARACTERS</Title>
  <Section position="1" start_page="0" end_page="0" type="metho">
    <SectionTitle>
STRUCTURAL PATTERNS OF CHINESE CHARACTERS
</SectionTitle>
    <Paragraph position="0"/>
    <Section position="1" start_page="0" end_page="0" type="sub_section">
      <SectionTitle>
and
Ryohei Kagaya
</SectionTitle>
      <Paragraph position="0"> Institute for the Study of Languages and Cultures of Asia and Africa Tokyo University of Foreign Studies Chinese characters, as used in Chinese and Japanese for orthography,* bear inherent meanings as well as sound shapes. Apart from these aspects, the graphic patterns of the characters also vary in complex ways and they appear very different over a wide range.</Paragraph>
      <Paragraph position="1"> It is obvious to native users of these characters, however, that the graphic patterns are mostly composed of different but frequently used subunits, and regularity is observed in the structures of character patterns. Quite frequently, a character is clearly composed of more than one character (with some minor modifications in shape).</Paragraph>
      <Paragraph position="2"> We can intuitively identify some commonly used strokes such as vertical and horizontal lines as constituents of characters or their subparts. null These obvious structural regularities have not been studied thoroughly, in spite of accumulated knowledge concerning etymology and historical developments of the characters and interpretation of the sound-meaning association.** This paper describes a way of describing the regularities of the Chinese characters as graphic patterns, without any explicit reference to the sound or meaning. The description may be considered as given in a form of generative grammarl) * The authors are native Japanese, and we are primarily concerned with the Chinese characters used in the contemporary Japanese orthography. There is obvious difference between Chinese and Japanese in the collection of character patterns used, but difference in the structural regularity itself is not apparent and remains to be studied.</Paragraph>
      <Paragraph position="3"> ~:'* See e.g.B. Karlgen's classical work &amp;quot;Word Families in Chinese&amp;quot; (B. M. i~. E.S. Vol. 5, 1934) and Grammata Serica (1957), and a more recent study by A. Todo, Kanji-no Gogenkenkyu (Etymological Studies of Archaic Chinese) (Tokyo: Gakutosha, 1963, in Japanese).</Paragraph>
      <Paragraph position="4"> the patterns, with an important deviation from the concept of genative grammar. Namely, the system of rules given here generates aracter-like patterns, but it does not define the actual set of Chise characters, viz. the accepted vocabulary of either Chinese or panese, but rather the set of patterns each of which could reprent a Chinese character as far as the structural characteristics of ~, pattern are concerned.* The regularity described here is thus )re like phonological regularities of lexical items of a language in syntactic regularities of its sentences. Nevertheless, the ford similarity of this rule system with the transformational theory syntax is rather interesting.</Paragraph>
      <Paragraph position="5"> The abstract representation of a character according to the genative rules make it possible to specify the patterns of essentially Chinese characters completely in terms of elements (strokes) d operators (concatenators and compounders). In other words, can code Chinese characters by use of strokes and operators sed on this framework of graphic theory of Chinese characters.</Paragraph>
      <Paragraph position="6"> ~actical applications are of great import and interest, but these ints are not of our direct concern in this paper.</Paragraph>
      <Paragraph position="7"> This descriptive system was in essence proposed by one of the esent authors several years ago. 2) Some modifications and addi.ns have been made and are still being made, and generation of ~ual character patterns by rules are being tried with use of a digi* computer with an oscilloscope display as the output device and a yboard typewriter as the input. Some results are presented in s paper, and a demonstration is in schedule for the meeting.</Paragraph>
      <Paragraph position="8"> Formation of Units A unit is a separable subpart of a character. In our rule sysm, it is represented by a string of alternating strokes and contenators. The form generated in this way is an underlying form lled i-representation, and it is interpreted into s-representation a conversion process described below in order to obtain the graphpattern. When no element of the string in i-representation is left interpreted, and if the derived s-representation does not violate ,train restrictional criteria performing filtering functions (see fr._aa), the string of alternating strokes and concatenators represents In this analogy to generative grammar, we consider the i-repre~ntation to be formed by concatenation of any alternating strokes ~d operators. Restrictions may be treated by filtering function of ansformational rules that interprete i-representation in terms of representation.</Paragraph>
      <Paragraph position="9"> a simple unit. A unit can represent a character by itself, or it may .be compounded with some other unit(s).</Paragraph>
      <Paragraph position="10"> 1.1. Strokes and Operators A set of strokes is given in the Stroke-table (Table 1). Each stroke is identified by a two-place number called &amp;quot;stroke identifier, &amp;quot; and is defined by a stroke representation pattern. The first numeral of the stroke identifier represents the class of the stroke and the second the variation within the class. For example, the stroke &amp;quot;21&amp;quot; which is the first variational stroke of class-two, is defined by the stroke pattern as shown in Fig. 1.</Paragraph>
      <Paragraph position="12"> Fig. 1 - Stroke representation pattern For each stroke, three functional points (~, , ~,, C/,9 are defined in terms of their x-y coordinate values in the stroke pattern field covering a range of integer values 0-4 for both x and y. A stroke with its three functional points can be represented by the following format (s-representation for the stroke): \[21; 20, 22j 24\].</Paragraph>
      <Paragraph position="13"> In this representation, the first number (21) is the stroke identifier, and the following three sets of numerals represent the x (on the left) and y (on the right) coordinates of the points (~, ~ ~ , and U.J, respectively. null Concatenators are listed in Table 2. A concatenator defines a particular positional interrelation between two strokes in terms of coincidence of a pair of the functional points. The set of strokes is divided into two functionally distinct groups, one for those with odd class numbers and the other for even class numbers, and a concate-</Paragraph>
      <Paragraph position="15"> Table 2 - Concatenators defined in terms of coincidence of the functional points of the preceding and succeeding strokes.</Paragraph>
      <Paragraph position="16"> nator can combine only a pair of strokes of different groups. When a pair of strokes are qualified for concatenation, the pair of strokes are said to have &amp;quot;affinity&amp;quot; between them. For a given concatenator in i-representation, the pertinent pair of strokes with affinity is defined by a general convention of this rule system as the next stroke following the concatenator and the last preceding stroke that does not belong to the same group as the following stroke. The first member of this selected pair shall be called the &amp;quot;predecessor&amp;quot; of the concatenator and the second member the &amp;quot;successor. &amp;quot; For example, in the string in i-representation /21SIIP21C21XIIEII/, the concatenator &amp;quot;C&amp;quot; operates on the fourth stroke 21 (successor) and the second stroke ii (predecessor) skipping the more immediate stroke 21. Similarly~ the last concatenator E concatenates the last stroke ii to 21 rather than another ii.</Paragraph>
      <Paragraph position="17"> I. 2. I- Representation vs. S-Representation A string of alternating strokes and concatenators which shall be called &amp;quot;i-representation&amp;quot; (input representation) can represent an underlying form of a unit. The unit can be actualized as a character shape through executing some shape-adjusting rules and looking up the stroke table that stores the stroke representation patterns for all strokes. For example, the above mentioned string that represents a simple-unit character ~ is actualized as shown in Fig. 2.</Paragraph>
      <Paragraph position="18"> The generated pattern can be represented by giving the x and y coordinate values of the three functional points belonging to all the constituent strokes. For the example above, the stroke positions are represented as: \[21; 00,02,04 11; 00,20,40 21; 40,42,42 21; 20,22,24 11; 02,22,42 11; 04,24,44\].</Paragraph>
      <Paragraph position="19"> Fig. 2 - The pattern for the character</Paragraph>
      <Paragraph position="21"> This shall be called the &amp;quot;s-representation (stroke representation) of the unit, &amp;quot; and it completely specifies the graphic pattern in terms of (abstract) functional interrelations of the constituent strokes, i. 3. Degeneracy and Pseudoconcatenators null More than one stroke can coincide in position as specified in s-representation only when they are connected to each other through special operators, called pseudoconcatenators. The strokes thus interrelated are called degenerate strokes. There are two pseudoconcatenators, one designated by - (hyphen) and the other by ~ (zero).~'.-&amp;quot; A pseudoconcatenator always selects its nearest preceding and the next following strokes as the predecessor and the successor, respectively, and these strokes must belong to the same stroke-class. Any degenerate strokes must be concatenated (or compounded by a superconcatenator, see infr,._._.a) to a stroke of the opposite group at the latter's p, in s-representation.:',-'* A string in i-representation that does not meet these conditions is blocked in generation and thus is rejected as a representation of a unit.</Paragraph>
      <Paragraph position="22"> The pseudoconcatenators allot the same position (in terms of their },t (and often also ~ and u) consequently) coordinates in s-representation to a pair of identical or similar strokes. The degenerate strokes in s-representation are marked for the hyphen or the zero. The order of stroke occurrences is generally preserved in s-representation. Degeneracy of more than two strokes are not allowed. In the actualization process, as discussed later, degeneracy created by the hyphen is resolved and the degenerate strokes ~',-&amp;quot; In earlier reports of our study, we assumed the hyphen and a comma as pseudoconcatenators. The rule system is revised here. ** There are some further restrictions about the kind of strokes to be degenerate and those to be concatenated to degenerate strokes, and also about combination of these. Subclassifieation of strokes in this respect is still to be studied.</Paragraph>
      <Paragraph position="23"> are separated into parallel positions, their spacings being determined by rule (see Fig. 3}.</Paragraph>
      <Paragraph position="25"> acters generated by use of pseudoconcatenators.</Paragraph>
      <Paragraph position="26"> (degenerate vs. resolved) In the case of the zero, the strokes are separated in the same manner, but at the same time a special stroke of the opposite class (horizontal line for class2 degenerate strokes and vertical line for class-1 degenerate strokes) is automatically introduced in s-representation. This additional stroke has an &amp;quot;infinitesimal length, &amp;quot; and this bridges the pertinent two (degenerate) strokes. Where this bridge should be placed along the degenerate strokes is determined, after the unit has been completed in s-representation, according to a preference order that is given by convention of this rule system. The preference order for the selected point on the degenerate strokes is Oh, c~ * and ~ , but if a particular point shows coincidence with any other stroke(s), i. e. , when the point is used as a junction in the pattern, this point is avoided and the point with the next degree of preference is selected for placing the infinitesimal stroke. The infinitesimal stroke becomes &amp;quot;stretched&amp;quot; when the degenerate strokes separate, giving an actual bridging between them.</Paragraph>
      <Paragraph position="27"> The infinitesimal stroke can be placed only at a point where the functional points (of the same kind) of the degenerate strokes coincide.</Paragraph>
      <Paragraph position="28"> The zero can be used repeatedly in the same space between a pair of (degenerate) strokes in i-representation. Each symbol of zero inserts an infinitesimal stroke at the place of the highest preference that remains available. Examples for the use of degenerate strokes and the pseudoeoncatenators are given in Fig. 3. i. 4. The Dummy Concatenator &amp;quot;?&amp;quot; A eoncatenator in general concatenates a stroke with another stroke. We introduce a dummy concatenator &amp;quot;?, &amp;quot; so that we may concatenate a string with another string. The &amp;quot;?&amp;quot; in i-representation marks its immediately preceding stroke as the predecessor of a eoncatenator that remains to be specified later in the string in conjunction with the selected successor stroke. In the string following this dummy concatenator, an extraneous concatenator must be found consecutively following another eoneatenator without a stroke identifier in between, and the second concatenator in the sequence selects the stroke marked previously by &amp;quot;?&amp;quot; as its predecessor stroke. The following stroke serves as the successor for both of the concatenators in pair, thus specifying a junction of two strings. For example, in the ease of a unii represented by /21Cl 7?21SEIIP21TII/ (Fig. 4-e), the pattern /21C17/ (Fig. 4-a) is abutted to the second pattern /21S IIP21TII/ (Fig. 4-b) through the concatenator E operating on the stroke 21 of the former and II of the latter. This operator &amp;quot;?&amp;quot; is convenient to form a unit according to the stroke order in the traditional handwriting.</Paragraph>
      <Paragraph position="29"> a: /21C17/ b: /21SliP21Tll/ c: /21CI7?21SEIIP21TII/ Fig. 4 - Concatenation of substrings by use of &amp;quot;?. &amp;quot; The example above could be generated by a string /21C17EIIS 21P21TII/ if only we disregard the tradition. Sometimes, however, the use of &amp;quot;?&amp;quot; is necessary for generating existing characters. The pattern of Fig. 5, for example, can be transcribed as /21727PEIIT 27/, but there is no way to generate it without using the dummy concatena~:~L% unless we defne a new concatenator filling in the space in the concatenator table with so-to-speak a conjugate concatenator (in this case an E:'.-&amp;quot; that would select }i of the predecessor and cJ of the successor for coincidence). Introduction of these conjugate concatenators is not desirable in consideration of the generalization of the rule system, because it expands the set of i-representations considerably without resulting in any additional acceptable patterns.</Paragraph>
      <Paragraph position="31"> Fig. 5 - The use of &amp;quot;?.</Paragraph>
      <Paragraph position="32"> The particular side of the diagonal in Table 2 is used in favor of the traditional stroke order.</Paragraph>
      <Paragraph position="33"> I. 5. Restrictions on S-Representations null Some restrictions in terms of the generated s-representation have been stated in connection with the degeneracy and the infinitesimal stroke. There are some more restrictions of a general kind given in terms of the derived srepresentation. These restrictions may be interpreted as filtering functions of the transformational process of actualization (see ~ 3).</Paragraph>
      <Paragraph position="34"> One rather obvious restriction is that no strokes of the same class except degenerate ones can share the same set of coordinate values for any members of their functional points, whether they both are of the same kind ( 06 , ~, or ~) or different. The convention of concatenation with the notion of affinity eliminates the possibility of generating two such strokes as a result of immediate succession of these in i-representation. A string, for example, like /IIC21SII/, however, is permissible in i-representation but must be rejected by the criterion stated above.</Paragraph>
      <Paragraph position="35"> Another possible restriction that may be imposed on an s-representation of a unit is in terms of the ratio of the largest dimension of the generated pattern to the number of strokes utilized. A threshold may be set and a pattern with a larger value of this ratio may be rejected, by use of an appropriate definition of length across a unit. This would exclude a long zig-zag of alternating ii and 21, for example, frorn the set of acceptable characters.</Paragraph>
      <Paragraph position="36"> A restriction of a more essential kind is probably in regard to the selection of a particular variation on the basis of contextual redundancy. It may well be the case that this kind of restriction ~is so strong that we can totally omit spccit'ying the variation numbers of the strokes for the input transcription of any character. These points remain to be investigated.</Paragraph>
    </Section>
  </Section>
  <Section position="2" start_page="0" end_page="0" type="metho">
    <SectionTitle>
2. Compounding of Units
</SectionTitle>
    <Paragraph position="0"> More than one unit can be compounded to fom-n a complex unit, which in turn as a unit can be compouiided with ~nother unit. The derivation of a character by a sequence of compounding can be represented in i-representation by recursive use of pairs of parentheses, each surrounding a substring as a unit, A more illustrative representation may be given in a form of tree diagram, where the type of compounding is given by the compounder symbol at each node (see Fig. 6). For a unit to make a subpart of a character, it is</Paragraph>
    <Paragraph position="2"> Fig. 6 - Complex compounding by use of appositional compounders.</Paragraph>
    <Paragraph position="3"> in general necessary to go through a set of transformational rules that adjust the entire shape of the pattern to fit the context, as well as some special rules that makes minor changes in variation numbers of some strokes.</Paragraph>
  </Section>
  <Section position="3" start_page="0" end_page="10" type="metho">
    <SectionTitle>
2. I. Compounders
</SectionTitle>
    <Paragraph position="0"> The compounder &amp;quot;H&amp;quot; can arrange more than one unit in a horizontal row, and the &amp;quot;V&amp;quot; can arrange some vertically. These two compounders form a class and may be called &amp;quot;appositional com- null the appositional compounders. There are many cases where the left subpart (hen) of the H compounding can be regarded as an affected form of a-~ree unit&amp;quot; whose &amp;quot;last stroke&amp;quot; is reduced in shape. Thus stroke 36 in Table 1 is a variation that serves as a reduced form of 32, and the stroke ii becomes 16 in this context. ~-&amp;quot; Fig. 7 gives a typical example.</Paragraph>
    <Paragraph position="1"> As a special case, where the stroke 55 is identified as the last stroke of a unit used as the left constituent unit (he.__nn) in H-compounding, this str~oke undergoes a process of elongation, and the right constituent unit (tsukuri) is placed above the tail of this stroke (see Fig. ((i IX21EI I)V(21C17 ? 4 7CE35))R(21CI7 ) Fig. 8 - Elongation of the last stroke in ny__~o.</Paragraph>
    <Paragraph position="2"> 8). Traditionally, the subparts (radicals) of this sort are called nyo.</Paragraph>
    <Paragraph position="3"> In some cases similar to this, units serving as a subpart of a character cannot be identified'as a transform of any &amp;quot;free unit, &amp;quot; viz. , a unit that can represent a character. Typical examples are those traditionally referred to as tare (the &amp;quot;appendants&amp;quot; or two-side J- F Fig. 9 - Examples of tare Fig. I0 - Example;s of karnak.</Paragraph>
    <Paragraph position="4"> surrounding radicals, see Fig. 9), q'hes,~ units, as well as the elongated unit ny_~o, have opet~ space in which the other unit must be era~:~ This is one of the phenomena that suggest redundancy of specifying a particular variation for a stroke class.</Paragraph>
    <Paragraph position="5"> Ii bedded. Another subclass of units that can embrace other units is called kamae (see Fig. 10).</Paragraph>
    <Paragraph position="6"> These surrounding compounders are all represented by the symbol R in i-representation. The last stroke of the compounding unit (nyo, tare, or kamae) that follows the symbol R in i-representation tells where the preceding unit should be located in s-representation, The third class of compounders consisting of X, C, E, S, and P is provided for cases where a stroke is superposed onto a unit in a special manner given by definition of the particular compounder. Thus in the example given in Fig. ii, the</Paragraph>
    <Paragraph position="8"> vertical stroke 22 across the unit which itself is a V compound of two identical units, leaving the two ends of the vertical stroke sticking out.</Paragraph>
    <Paragraph position="9"> The compounder C concatenates the point of the compounding vertical stroke (typically 21) at the point of the uppermost horizontal stroke, leaving the other end of the compounding stroke sticking out of the lowest (most largevalued) y-coordinate of 's in the compounded unit. The compounder E, S, and P are de-</Paragraph>
    <Paragraph position="11"> Fig. 12 - The stroke 21 with different supePconcatenators.</Paragraph>
    <Paragraph position="12"> fined in a similar manner reflecting the properties of the concatenators~of the same names. Some examples are given in Fig. 12.</Paragraph>
    <Paragraph position="13"> In this class of compounders, which may be called &amp;quot;superconcatenat)ors, &amp;quot; the compounding unit is typically a single stroke constituting a unit by itself. In some cases the succeeding unit is composed of more than one stroke, where only one of them can be desig- null nated as the &amp;quot;major stroke&amp;quot; that determines the manner of compounding. Variations 6 and 7 of all stroke classes and also all strokes in classes 4 and 5, and stroke 13 (Table 1) cannot serve as the major stroke. The major stroke can be degenerate. The superconcatenators act like concatenators in enabling the compounding (major)  nator X with a compounding unit of more than one stroke.</Paragraph>
    <Paragraph position="14"> strokes to be degenerate in the case of C, X, and E (cf. I. 3. ). Thus the rejection of uneoncatenated degenerate strokes has to be performed beyond the minimal unit, when the unit is preceded by a superconcatenator. Examples are given in Fig. 13.</Paragraph>
    <Paragraph position="15"> In the compounding of the third class, the unit to be compounded may be collapsed in size in one dimension treated as though it were a degenerate group of strokes either horizontal or vertical. For example, in the unit (21SIICIIPIIT21) X (21), the compounded unit {21SIICIIPIIT21) could be regarded as a class-I stroke, In this interpretation, it can be said that the superconeatenator in effect acts as a concatenator of the same symbol. In a case like the unit (II) X (21C37S47), the pattern actually can be represented by a single unit IIX21C37S47, simply by removing the parentheses. We may introduce another superconcatenator D, which is defined as a combination of C and E, namely a compounder that superposes a stroke which is &amp;quot;stretched&amp;quot; in such a way that both ends coincide with the two strokes at the extreme positions in the compounded unit. This kind of .compounded patterns can be generated in the rule system stated above by a suceesion of compoundings by use of the supercon- null catenators C and E.</Paragraph>
    <Paragraph position="16"> 2. 2. The Point Unit  The &amp;quot;point&amp;quot; designated by an apo~rophe that follows a unit is an infinitesimal unit compounded to the preceding unit. It shows varied shapes in the actualized pattern, and a &amp;quot;.et of points is distributed in space in different pre~cribed maimers U~ \[~e~ding on the context. Special rules are required for taLdng calve o;L these seemingly varied phenomena, but technical details are still to be worked out. Typical examples are shown in Fig. 14. The examples a~.e transcribed from  left to right as follows: upper: (25)', (25)&amp;quot;, (25)'&amp;quot;, (25) .... , lower: (21S11P21T11)', ((21P11S11P63)X(11))', ((42X36)R(21S11P21T11)): 'l '1' &amp;quot;1&amp;quot; ;J:</Paragraph>
  </Section>
  <Section position="4" start_page="10" end_page="10" type="metho">
    <SectionTitle>
KI N
</SectionTitle>
    <Paragraph position="0"> Fig. 14 - Actualizations of points in accordance with the context and the number of the points.</Paragraph>
  </Section>
  <Section position="5" start_page="10" end_page="14" type="metho">
    <SectionTitle>
3. Actualization
</SectionTitle>
    <Paragraph position="0"> A character is transcribed as a set of units combined through compounders in any depth of complexity. Each constituent unit is transcribed as a string in i-representation placed in parentheses.</Paragraph>
    <Paragraph position="1"> The i-representations of units determine their s-representations, specifying positioning of all occurrences of strokes in a frame of the pattern field. The franle is normalized and placed together according to the specification of the compounder to form a compound unit, and this process of normalization and abutting can be recursively repeated. The set of rules for normalization and stroke reduction (see supra} is thus cyclic in the sense of the cyclicity of phonological rules. The s-representation for a unit after the normalization no longer has the quantized coordinates. In the last stage of actualization of a character. Strokes shapes are called in form the stroke representation pattern into this generalized s-representation.</Paragraph>
    <Paragraph position="2"> 3. I. Stroke Arrangement It may be obvious intuitively that in the actualized form of any  character the constituent strokes are distributed in space somehow evenly. This fact can be accounted for by designing a later part of the actualization process to form a set of stroke distribution rules. As a general principle for this distribution of strokes in space, we may assume a potential field defined in the stroke pattern of each stroke surrounding the actualized shape of the stroke. We then may hypothesize that superposition of the potentials belonging to the distributed strokes in the finally actualized pattern results in a state of equilibrium by attaining the total potential energy minimum. In short, strokes exert repulsive force against each other, and the strokes can translate and be compressed within a given unit frame as long as the topological interconnections are not changed. The  end points ~ and u9 are always rigidly related to the actualized stroke shape, but the midpoint \]_~ can shift along the line defined typically as a straight line connecting (Z and uJ .</Paragraph>
    <Paragraph position="3"> 3. 2. Practical Approximation  A practical approximation for this principle of distribution may be devised as follows. Each stroke has a two-dimensional measure of spacial occupancy for x and y directions, defined in a stroke table. A &amp;quot;size normalization factor&amp;quot; of a unit is defined as the sum of these measures of occupancy of all the constituent strokes. The area which is occupied by each constituent unit in a complex unit is determined by the proportion in terms of the &amp;quot;size factor. &amp;quot; Within a unit, the actualized distribution of constituent strokes is attained in a similar manner, by allowing typically equal spaces between similar strokes in the direction perpendicular to the stroke line. An equally weighted space is allowed at the margin between the border of the frame and the outermost stroke. There are some details of the rules which will not be discussed here.</Paragraph>
    <Paragraph position="4"> Some examples of characters are illustrated in Fig. 15. These were actually generated on an oscilloscope display of a computer by typing in the i-representations. The rules used for this practical approximation of the actualization process arc only preliminary and some character patterns suggest necessary corrections of the program which can be mostly readily done.</Paragraph>
  </Section>
  <Section position="6" start_page="14" end_page="14" type="metho">
    <SectionTitle>
4. Concluding Remarks
</SectionTitle>
    <Paragraph position="0"> Many details are still to be worked out and some are simply not described here for brevity. It is obviously true that the same character can be generated by different i-representations, partly due to different stroke orders and partly due to different selection of variational shapes of strokes. Another so~'t of amb:iguity is possible in  some special cases depending on whether a compounder or a coneatenator is used, as mentioned in 2. I. The use of degeneracy against compounding gives still another sort of ambiguity. Thus for example, the character can be generated either as a simple unit /II~IIX21/ or as a compounded form /(21SI IP21TI \] )X(21)/.</Paragraph>
    <Paragraph position="1"> These sorts of ambiguity have been to a large extent eliminated by some care taken in formulating the rule system, but some of them are interesting and seem to indicate the inherent problems concerning the nature of Chinese characters.</Paragraph>
    <Paragraph position="2"> The system we have given here is concrete and valid in fair details, but it is still subject to even major changes for impr'ovement. The essential principle, however, seems to us convincingly effective for description of the graphical structures of the characters.</Paragraph>
    <Paragraph position="3"> Fig. 15 - Oscilloscope display examples of computer-generated Chinese characters. All characters were generated by rule out of the input trpresentation type in through ordinary keyboard.</Paragraph>
  </Section>
  <Section position="7" start_page="14" end_page="14" type="metho">
    <SectionTitle>
SUMMARY
</SectionTitle>
    <Paragraph position="0"> A system is proposed for specifying any one of the accepted patterns of Chinese characters, or similar patterns that could be used as Chinese characters. The system may be considered as a generative grammar of the set of character patterns. A unit is formed by concatenating strokes by operators. A set of strokes is given in a stroke table where three abstract functional points ~ , ,,u, t~ , as well as a typical actualization form, are defined for each stroke. Concatenators and pseudoconcatenators are provided, each of them defining a particular positional interrelation between two strokes in terms of coincidence of the functional points. The set of strokes is divided into two functionally distinct groups, and a concatenator can combine only a pair of strokes of different groups, a pseudoconcatenator only those of the same group. Thus a string of alternating strokes and operators, which may be called the &amp;quot;i-representation, &amp;quot; determines an underlying form of a unit, which can be actualized as a character shape through looking up the stroke table and executing some shape-adjusting rules. On the level of i-representation, more than one unit can be combined to form a more complex character pattern, by use of one or more of compounding operators that specify &amp;quot;transformational processes&amp;quot; to be executed before the shape adjustment process. Preliminary results of an on-line computer experiment will be shown where the actualizations of characters are displayed on an oscilloscope when characters are specified by typing in the i-representations.</Paragraph>
    <Paragraph position="1"> I.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML