A Deep Learning Approach for Cancer Detection and Relevant Gene Identification

Padideh Danaee1, Reza Ghaeini1, David Hendrix1,2


1School of Electrical Engineering and Computer Science, Oregon State University
2Department of Biochemistry and Biophysics, Oregon State University
Email: danaeep@oregonstate.edu, ghaeinim@oregonstate.edu, david.hendrix@oregonstate.edu

Pacific Symposium on Biocomputing 22:219-229(2017)

© 2017 World Scientific
Open Access chapter published by World Scientific Publishing Company and distributed under the terms of the Creative Commons Attribution (CC BY) 4.0 License.


Abstract

Cancer detection from gene expression data continues to pose a challenge due to the high dimensionality and complexity of these data. After decades of research there is still uncertainty in the clinical diagnosis of cancer and the identification of tumor-specific markers. Here we present a deep learning approach to cancer detection, and to the identification of genes critical for the diagnosis of breast cancer. First, we used Stacked Denoising Autoencoder (SDAE) to deeply extract functional features from high dimensional gene expression profiles. Next, we evaluated the performance of the extracted representation through supervised classification models to verify the usefulness of the new features in cancer detection. Lastly, we identified a set of highly interactive genes by analyzing the SDAE connectivity matrices. Our results and analysis illustrate that these highly interactive genes could be useful cancer biomarkers for the detection of breast cancer that deserve further studies.


[Full-Text PDF] [PSB Home Page]