Recommending Pathway Genes Using a Compendium of Clustering SolutionsNg DM, Woehrmann MH, Stuart JM
|
|
AbstractA common approach for identifying pathways from gene expression data is to cluster the genes without using prior information about a pathway, which often identi- es only the dominant coexpression groups. Recommender systems are well-suited for using the known genes of a pathway to identify the appropriate experiments for predicting new members. However, existing systems, such as the GeneRecommender, ignore how genes naturally group together within specic experiments. We present a collaborative ltering approach which uses the pattern of how genes cluster together in dierent experiments to recommend new genes in a pathway. Clusters are rst identied within a single experiment series. Informative clusters, in which the user-supplied query genes appear together, are identied. New genes that cluster with the known genes, in a signicant fraction of the informative clusters, are recommended. We implemented a prototype of our system and measured its performance on hundreds of pathways. We nd that our method performs as well as an established approach while signicantly increasing the speed and scalability of searching large datasets. [Supplemental material is available online at sysbio.soe.ucsc.edu/cluegene/psb07.] | |
|
[Full-Text PDF] [PSB Home Page] | |