File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/concl/00/w00-1405_concl.xml
Size: 2,137 bytes
Last Modified: 2025-10-06 13:52:56
<?xml version="1.0" standalone="yes"?> <Paper uid="W00-1405"> <Title>DTD-Driven Bilingual Document Generation ...... ,Arantza Casillas ..... Departamento de Automgtica, Universidad de Alcalgt e-mail : arantza@aut, alcaia, es</Title> <Section position="7" start_page="37" end_page="37" type="concl"> <SectionTitle> 6 Conclusions </SectionTitle> <Paragraph position="0"> We have shown how7 DTDs derived from de* scriptive markup can~be&quot;employed to ease the process of generating bilingual dedicated documentation. On average, one third of the contents of thedocuments can be automatically accounted for. It must also be pointed out that the part being dealt with represents the core structure, lay-out and logical components of the text. The remaining two-thirds of untreated document can still be managed with the aid of sentence-oriented TMBS, filling in the gaps in the ore/all skeleton provided by the target template. Composers may also browse TM3 to retrieve whole blocks for those parts which are not determined by the DTD. One of the clear targets for the future is to extend the coverage of the corpus and to test structural taggers against other document types. A big challenge we face is to develop tools that automatically perform the recognition of documents from less restricted and more open text types. However, we are not sure of the extent of the practicality of such an approach. An alternative direction we are presently considering is to establish a collection of pre-defined document types, which would be validated by the institutional writers themselves. It is a process currently being implemented in the Basque administration to define docmnent models for writers and translators to follow. What we have demonstrated is that paired DTDs, complemented with rich language resources of the kind defined in this paper, allow for the design of optimal editing environments which would combine both document composition and translation as one single process. All the resourcesneeded (DTDs.and TMs) .</Paragraph> <Paragraph position="1"> can be induced from an aligned corpus.</Paragraph> </Section> class="xml-element"></Paper>