Bootstrapping the Recognition and Anaphoric Linking of Named Entities in Drosophila Articles

Andreas Vlachos, Caroline Gasperin, Ian Lewin, Ted Briscoe

Computer Laboratory,University of Cambridge,15 JJ Thomson Avenue, CB3 0FD
E-mail: FirstName.LastName@cl.cam.ac.uk


Pac Symp Biocomput. 2006;:100-111.


Abstract

This paper demonstrates how Drosophila gene name recognition and anaphoric linking of gene names and their products can be achieved using existing information in FlyBase and the Sequence Ontology. Extending an extant approach to gene name recognition we achieved a F-score of 0.8559, and we report a preliminary experiment using a baseline anaphora resolution algorithm. We also present guidelines for annotation of gene mentions in texts and outline how the resulting system is used to aid FlyBase curation.


[Full-Text PDF] [PSB Home Page]