File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/evalu/04/w04-0103_evalu.xml

Size: 2,157 bytes

Last Modified: 2025-10-06 13:59:16

<?xml version="1.0" standalone="yes"?>
<Paper uid="W04-0103">
  <Title>A Diachronic Approach for Schwa Deletion in Indo Aryan Languages</Title>
  <Section position="7" start_page="3" end_page="3" type="evalu">
    <SectionTitle>
6 Experimental Results and Discussions
</SectionTitle>
    <Paragraph position="0"> The algorithm was implemented for Bengali and Hindi and tested on a set of words. Table 2 summarizes the results for Hindi (tested on the words in a pocket dictionary (Hindi-Bangla-English, 2001)). The algorithm for Bengali was tested on 1000 randomly selected words from a corpus and found to be around 85% accurate.</Paragraph>
    <Paragraph position="1"> Some of the important features of the algorithm are as follows.</Paragraph>
    <Paragraph position="2"> * Efficiency: The algorithm runs in linear time on the input word length. It scans the whole word just twice. Thus, the hidden constant is also very small.</Paragraph>
    <Paragraph position="3"> * Polymorphemic Words: The algorithm can handle polymorphemic words, if the morphological information about the word is provided. This is because schwa deletion is not carried across morpheme boundaries.</Paragraph>
    <Paragraph position="4"> Morphological analyzer for Hindi and Bengali were implemented and integrated with the algorithm. For Hindi, the results were nearly perfect (99.89%) Exceptions: For Hindi there was hardly any exception to the algorithm. For Bengali, the types of words that were incorrectly processed by the algorithm include a class of very frequently used, disyllabic modifier adjectives, certain suffixes, borrowed words from Sanskrit and compound words. In Bengali, the schwa which is retained (as opposed to the predictions by the algorithm) are pronounced as /o/ and not as / o /. Since, /o/ is not a central vowel, deletion of /o/ is marked as compared to deletion of / o / which is unmarked. Transformation of schwa to some non-neutral vowel in Hindi is unknown and therefore, the algorithm works perfectly for Hindi.</Paragraph>
    <Paragraph position="5">  deletion. The results are for individual words. MA stands for Morphological Analysis</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML