Prediction of protein interaction hot spots using rough set-based multiple criteria linear programming

Publisher:
Elsevier
Publication Type:
Journal Article
Citation:
Journal of Theoretical Biology, 2011, 269 (1), pp. 174 - 180
Issue Date:
2011-01
Full metadata record
Files in This Item:
Filename Description Size
Thumbnail2013005137OK.pdf376.83 kB
Adobe PDF
Protein-protein interactions are fundamentally important in many biological processes and it is in pressing need to understand the principles of proteinprotein interactions. Mutagenesis studies have found that only a small fraction of surface residues, known as hot spots, are responsible for the physical binding in protein complexes. However, revealing hot spots by mutagenesis experiments are usually time consuming and expensive. In order to complement the experimental efforts, we propose a new computational approach in this paper to predict hot spots. Our method, Rough Set-based Multiple Criteria Linear Programming (RS-MCLP), integrates rough sets theory and multiple criteria linear programming to choose dominant features and computationally predict hot spots. Our approach is benchmarked by a dataset of 904 alanine-mutated residues and the results show that our RS-MCLP method performs better than other methods, e.g., MCLP, Decision Tree, Bayes Net, and the existing HotSprint database. In addition, we reveal several biological insights based on our analysis. We find that four features (the change of accessible surface area, percentage of the change of accessible surface area, size of a residue, and atomic contacts) are critical in predicting hot spots. Furthermore, we find that three residues (Tyr, Trp, and Phe) are abundant in hot spots through analyzing the distribution of amino acids.
Please use this identifier to cite or link to this item: