File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/concl/00/j00-1005_concl.xml

Size: 2,411 bytes

Last Modified: 2025-10-06 13:52:50

<?xml version="1.0" standalone="yes"?>
<Paper uid="J00-1005">
  <Title>Treatment of Epsilon Moves in Subset Construction</Title>
  <Section position="6" start_page="72" end_page="74" type="concl">
    <SectionTitle>
5. Conclusion
</SectionTitle>
    <Paragraph position="0"> We have discussed a number of variants of the subset construction algorithm for determinizing finite automata containing c-moves. The experiments support the following conclusions: The integrated variants per subset and per state work much better for automata containing a large number of c-moves. The per subset variant tends to improve upon the per state algorithm if the number of E-moves increases even further.</Paragraph>
    <Paragraph position="1">  Computational Linguistics Volume 26, Number 1 Table 1 The automata generated by approximation algorithms. The table lists the number of states, transitions, and jumps of the input automaton, and the number of states of the determinized machine using the erred, efree t, and the efree t; variants, respectively.  our experiments, the per graph t is the algorithm of choice for automata containing few c-moves, because it is faster than the other algorithms, and because it produces smaller automata than the per graph s and per graph s,a variants.</Paragraph>
    <Paragraph position="2"> * The per graph t,c variant is an interesting alternative in that it produces the smallest results. This variant should be used if the input automaton is expected to contain many non-co-accessible states.</Paragraph>
    <Paragraph position="3">  van Noord Epsilon Moves in Subset Construction Automata produced by finite-state approximation techniques tend to contain many c-moves. We found that for these automata the differences in speed between the various algorithms can be enormous. The per subset and per state algorithms are good candidates for this application. We have attempted to characterize the expected efficiency of the various algorithms in terms of the number of jumps and the number of states in the input automaton. It is quite conceivable that other simple properties of the input automaton can be used even more effectively for this purpose. One reviewer suggests using the number of strongly c-connected components (the strongly connected components of the graph of all c-moves) for this purpose. We leave this and other possibilities to a future occasion.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML