Integration of Microarray and Textual Data Improves the Prognosis Prediction of Breast, Lung, and Ovarian Cancer Patients

O. Gevaert, S. Van Vooren, B. De Moor


BioI@ESAT-SCD, Dept. Electrical Engineering, Katholieke Universiteit Leuven, Kasteelpark Arenberg 10, Leuven, B-3001, Belgium
Email: olivier.gevaert@esat.kuleuven.be


Pac Symp Biocomput. 2008;:279-290.


Abstract

Microarray data are notoriously noisy such that models predicting clinically rele- vant outcomes often contain many false positive genes. Integration of other data sources can alleviate this problem and enhance gene selection and model building. Probabilistic models provide a natural solution to integrate information by using the prior over model space. We investigated if the use of text information from PUBMED abstracts in the structure prior of a Bayesian network could improve the prediction of the prognosis in cancer. Our results show that prediction of the outcome with the text prior was signi cantly better compared to not using a prior, both on a well known microarray data set and on three independent microarray data sets.


[Full-Text PDF] [PSB Home Page]