File Information
File: 05-lr/acl_arc_1_sum/cleansed_text/xml_by_section/relat/03/w03-1306_relat.xml
Size: 1,153 bytes
Last Modified: 2025-10-06 14:15:41
<?xml version="1.0" standalone="yes"?> <Paper uid="W03-1306"> <Title>Boosting Precision and Recall of Dictionary-Based Protein Name Recognition</Title> <Section position="6" start_page="0" end_page="0" type="relat"> <SectionTitle> 6 Related Work </SectionTitle> <Paragraph position="0"> Kazama et al. (2002) reported an F-measure of 56.5% on the GENIA corpus (Version 1.1) using Support Vector Machines. Collier et al. (2001) reported an F-measure of 75.9% evaluated on 100 MEDLINE abstracts using a Hidden Markov Model.</Paragraph> <Paragraph position="1"> These research efforts are machine learning based and do not provide ID information of recognized terms.</Paragraph> <Paragraph position="2"> Krauthammer et al. (2000) proposed a dictionary-based gene/protein name recognition method. They used BLAST for approximate string matching by mapping sequences of text characters into sequences of nucleotides that can be processed by BLAST.</Paragraph> <Paragraph position="3"> They achieved a recall of 78.8% and a precision of 71.1% by a partial match criterion, which is less strict than our exact match criterion.</Paragraph> </Section> class="xml-element"></Paper>