uu.seUppsala University Publications
Change search
ReferencesLink to record
Permanent link

Direct link
Ensemble learning prediction of protein-protein interactions using proteins functional annotations
Show others and affiliations
2014 (English)In: MOL BIOSYST, ISSN 1742-206X, Vol. 10, no 4, 820-830 p.Article in journal (Refereed) Published
Abstract [en]

Protein-protein interactions are important for the majority of biological processes. A significant number of computational methods have been developed to predict protein-protein interactions using protein sequence, structural and genomic data. Vast experimental data is publicly available on the Internet, but it is scattered across numerous databases. This fact motivated us to create and evaluate new high-throughput datasets of interacting proteins. We extracted interaction data from DIP, MINT, BioGRID and IntAct databases. Then we constructed descriptive features for machine learning purposes based on data from Gene Ontology and DOMINE. Thereafter, four well-established machine learning methods: Support Vector Machine, Random Forest, Decision Tree and Naive Bayes, were used on these datasets to build an Ensemble Learning method based on majority voting. In cross-validation experiment, sensitivity exceeded 80% and classification/prediction accuracy reached 90% for the Ensemble Learning method. We extended the experiment to a bigger and more realistic dataset maintaining sensitivity over 70%. These results confirmed that our datasets are suitable for performing PPI prediction and Ensemble Learning method is well suited for this task. Both the processed PPI datasets and the software are available at http://sysbio.icm.edu.pl/indra/EL-PPI/home.html.

Place, publisher, year, edition, pages
2014. Vol. 10, no 4, 820-830 p.
National Category
Natural Sciences
URN: urn:nbn:se:uu:diva-221725DOI: 10.1039/c3mb70486fISI: 000332456200013OAI: oai:DiVA.org:uu-221725DiVA: diva2:710186
Available from: 2014-04-04 Created: 2014-04-03 Last updated: 2014-04-04Bibliographically approved

Open Access in DiVA

No full text

Other links

Publisher's full text
By organisation
Molecular Evolution
Natural Sciences

Search outside of DiVA

GoogleGoogle Scholar

Altmetric score

Total: 230 hits
ReferencesLink to record
Permanent link

Direct link