File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/concl/01/w01-1312_concl.xml
Size: 1,902 bytes
Last Modified: 2025-10-06 13:53:07
<?xml version="1.0" standalone="yes"?> <Paper uid="W01-1312"> <Title>A Multilingual Approach to Annotating and Extracting Temporal Information</Title> <Section position="8" start_page="4" end_page="4" type="concl"> <SectionTitle> 7 Conclusion </SectionTitle> <Paragraph position="0"> In the future, we hope to extend our English annotation guidelines into a set of multilingual annotation guidelines, which would include language-specific supplements specifying examples, tokenization rules, and rules for determining tag extents. To support development of such guidelines, we expect to develop large keyword-in-context concordances, and would like to use the time-tagger system as a tool in that effort. Our approach would be (1) to run the tagger over the desired text corpora; (2) to run the concordance creation utility over the annotated version of the same corpora, using not only TIMEX2 tags but also lexical trigger words as input criteria; and (3) to partition the output of the creation utility into entries that are tagged as temporal expressions and entries that are not so tagged. We can then review the untagged entries to discover classes of cases that are not yet covered by the tagger (and hence, possibly not yet covered by the guidelines), and we can review the tagged entries to discover any spuriously tagged cases that may correspond to guidelines that need to be tightened up.</Paragraph> <Paragraph position="1"> We also expect to create and distribute multilingual corpora annotated according to these guidelines. Initial feedback from machine translation system grammar writers (Levin 2000) indicates that the guidelines were found to be useful in extending an existing interlingua for machine translation. For the existing English annotations, we are currently carrying out inter-annotator agreement studies of the work of the 6 annotators.</Paragraph> </Section> class="xml-element"></Paper>