SAMIE: statistical algorithm for modeling interaction energies

Benos PV, Lapedes AS, Fields DS, Stormo GD

Dept. of Genetics, Campus Box 8232, Medical School, Washington University, 4566 Scott Ave., St. Louis, MO 63110, USA. benos@genetics.wustl.edu

Pac Symp Biocomput. 2001;:115-26.


Abstract

We are investigating the rules that govern protein-DNA interactions, using a statistical mechanics based formalism that is related to the Boltzmann Machine of the neural net literature. Our approach is data-driven, in which probabilistic algorithms are used to model protein-DNA interactions, given SELEX and/or phage data as input. In the current report, we trained the network using SELEX data, under the "one-to-one" model of interactions (i.e. one amino acid contacts one base). The trained network was able to successfully identify the wild-type binding sites of EGR and MIG protein families. The predictions using our method are the same or better than that of methods existing in the literature. However our methodology offers the potential to capitalise in quantitative detail, as well as to be used to explore more general model of interactions, given availability of data.


[Full-Text PDF] [PSB Home Page]