File Information

File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/03/w03-1309_abstr.xml

Size: 940 bytes

Last Modified: 2025-10-06 13:43:14

<?xml version="1.0" standalone="yes"?>
<Paper uid="W03-1309">
  <Title>Protein Name Tagging for Biomedical Annotation in Text</Title>
  <Section position="1" start_page="0" end_page="0" type="abstr">
    <SectionTitle>
Abstract
</SectionTitle>
    <Paragraph position="0"> We explore the use of morphological analysis as preprocessing for protein name tagging. Our method finds protein names by chunking based on a morpheme, the smallest unit determined by the morphological analysis. This helps to recognize the exact boundaries of protein names.</Paragraph>
    <Paragraph position="1"> Moreover, our morphological analyzer can deal with compounds. This offers a simple way to adapt name descriptions from biomedical resources for language processing. Using GENIA corpus 3.01, our method attains f-score of 70 points for protein molecule names, and 75 points for protein names including molecules, families and domains.</Paragraph>
  </Section>
class="xml-element"></Paper>
Download Original XML