File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/evalu/96/c96-1017_evalu.xml

Size: 1,365 bytes

Last Modified: 2025-10-06 14:00:23

<?xml version="1.0" standalone="yes"?>
<Paper uid="C96-1017">
  <Title>Arabic Finite-State Morphological Analysis and Generation</Title>
  <Section position="6" start_page="92" end_page="92" type="evalu">
    <SectionTitle>
5 Generation
</SectionTitle>
    <Paragraph position="0"> A single underlying Arabic word may be spelled many ways on the surface, depending on how coinplctely the writer specilies the diacritics. Because the system described above recognizes all possible written forms of a word, with varying degrees of diacritical marking, it also generates all the possible surface forms of a word, which may be less than useful in many applications, q'yi)ically, a user wants to see only the fidly vowcled form during generation.</Paragraph>
    <Paragraph position="1"> The Arabic rules have now been modilied to work in two steps, lirst to generate the fully voweled form, and then to generate the various partially roweled forms and the unvoweled form.</Paragraph>
    <Paragraph position="2"> Where desired, the lexicon fst can be composed with only the upper set of rules to make a lexical transducer that gencratcs (and recognizes) only fully-roweled surface forms, l,'or general recognition, both sets of rules, a.s shown in Figure 9, are composed. The result is equivalent to the original lexical transducer described in Figure 7.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML