Evaluation of Lexical Methods for Detecting Relationships Between Concepts from Multiple Ontologies

Johnson HL, Cohen KB, Baumgartner WA Jr, Lu Z, Bada M, Kester T, Kim H, Hunter L

Center for Computational Pharmacology University of Colorado School of Medicine

Pac Symp Biocomput. 2006;:28-39.


Abstract

We used exact term matching, stemming, and inclusion of synonyms, implemented via the Lucene information retrieval library, to discover relationships between the Gene Ontology and three other OBO ontologies: ChEBI, Cell Type, and BRENDA Tissue. Proposed relationships were evaluated by domain experts. We discovered 91,385 relationships between the ontologies. Various methods had a wide range of correctness. Based on these results, we recommend careful evaluation of all matching strategies before use, including exact string matching. The full set of relationships is available at compbio.uchsc.edu/dependencies.


[Full-Text PDF] [PSB Home Page]