File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/abstr/03/w03-1309_abstr.xml
Size: 940 bytes
Last Modified: 2025-10-06 13:43:14
<?xml version="1.0" standalone="yes"?> <Paper uid="W03-1309"> <Title>Protein Name Tagging for Biomedical Annotation in Text</Title> <Section position="1" start_page="0" end_page="0" type="abstr"> <SectionTitle> Abstract </SectionTitle> <Paragraph position="0"> We explore the use of morphological analysis as preprocessing for protein name tagging. Our method finds protein names by chunking based on a morpheme, the smallest unit determined by the morphological analysis. This helps to recognize the exact boundaries of protein names.</Paragraph> <Paragraph position="1"> Moreover, our morphological analyzer can deal with compounds. This offers a simple way to adapt name descriptions from biomedical resources for language processing. Using GENIA corpus 3.01, our method attains f-score of 70 points for protein molecule names, and 75 points for protein names including molecules, families and domains.</Paragraph> </Section> class="xml-element"></Paper>