Modeling protein interacting groups by quasi-bicliques: Complexity, algorithm, and application

Publication Type:
Journal Article
Citation:
IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2010, 7 (2), pp. 354 - 364
Issue Date:
2010-03-19
Metrics:
Full metadata record
Files in This Item:
Filename Description Size
Thumbnail2010006982OK.pdf2.58 MB
Adobe PDF
Protein-protein interactions (PPIs) are one of the most important mechanisms in cellular processes. To model protein interaction sites, recent studies have suggested to find interacting protein group pairs from large PPI networks at the first step and then to search conserved motifs within the protein groups to form interacting motif pairs. To consider the noise effect and the incompleteness of biological data, we propose to use quasi-bicliques for finding interacting protein group pairs. We investigate two new problems that arise from finding interacting protein group pairs: the maximum vertex quasi-biclique problem and the maximum balanced quasi-biclique problem. We prove that both problems are NP-hard. This is a surprising result as the widely known maximum vertex biclique problem is polynomial time solvable [1]. We then propose a heuristic algorithm that uses the greedy method to find the quasi-bicliques from PPI networks. Our experiment results on real data show that this algorithm has a better performance than a benchmark algorithm for identifying highly matched BLOCKS and PRINTS motifs. We also report results of two case studies on interacting motif pairs that map well with two interacting domain pairs in iPfam. Availability: The software and supplementary information are available at http://www.cs.cityu.edu.hk/~lwang/software/ppi/index.html. © 2006 IEEE.
Please use this identifier to cite or link to this item: