File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/concl/03/w03-1313_concl.xml

Size: 1,464 bytes

Last Modified: 2025-10-06 13:53:47

<?xml version="1.0" standalone="yes"?>
<Paper uid="W03-1313">
  <Title>Technology Corporation</Title>
  <Section position="7" start_page="0" end_page="0" type="concl">
    <SectionTitle>
6 Conclusions
</SectionTitle>
    <Paragraph position="0"> The paper proposed an XML paramterisation of TEI P4 developed for linguistically annotated biomedical corpora, and applied it to the GENIA corpus.</Paragraph>
    <Paragraph position="1"> The conversion from the Genia Project Markup Language to this encoding has been implemented in XSLT and both the TEI-conformant parametrisation (TEI extension file and one-file DTD) and the XSLT stylesheets are, together with a report documenting them, available at http://nl.ijs.si/et/genia/, while the GENIA corpus is freely available from http://wwwtsujii.is.s.u-tokyo.ac.jp/GENIA/. null The paper gave a survey of the TEI modules that can be useful for encoding a wide variety of linguistically annotated corpora. This contribution, it is hoped, can thus serve as a blueprint for parametrising TEI for diverse corpus resources.</Paragraph>
    <Paragraph position="2"> Further work involves the inclusion of other knowledge sources into the corpus, say of Medical Subject Headings (MeSH), Unified Medical Language System (UMLS), International Classification of Disease (ICD), etc. The place of these annotations in the corpus will have to be considered, and their linking to the existing information determined.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML