Old drug repositioning and new drug discovery through similarity learning from drug-target joint feature spaces.

Zheng, Y; Peng, H; Zhang, X; Zhao, Z; Gao, X; Li, J

Old drug repositioning and new drug discovery through similarity learning from drug-target joint feature spaces.

Zheng, Y Peng, H

Zhang, X

Zhao, Z

Gao, X Li, J

Permalink

Publisher:: BMC
Publication Type:: Journal Article
Citation:: BMC bioinformatics, 2019, 20, (Suppl 23), pp. 605
Issue Date:: 2019-12-27

Open Access

Copyright Clearance Process

Recently Added
In Progress
Open Access

This item is open access.

Adobe PDF

Download Accepted versionAdobe PDF (2.05 MB)

View on publisher's site

View statistics

Full metadata record

Field	Value	Language
dc.contributor.author	Zheng, Y
dc.contributor.author	Peng, H https://orcid.org/0000-0002-4379-8097
dc.contributor.author	Zhang, X https://orcid.org/0000-0002-3783-6560
dc.contributor.author	Zhao, Z https://orcid.org/0000-0001-5544-4504
dc.contributor.author	Gao, X
dc.contributor.author	Li, J https://orcid.org/0000-0003-1833-7413
dc.date.accessioned	2020-05-05T22:39:39Z
dc.date.available	2019-11-14
dc.date.available	2020-05-05T22:39:39Z
dc.date.issued	2019-12-27
dc.identifier.citation	BMC bioinformatics, 2019, 20, (Suppl 23), pp. 605
dc.identifier.issn	1471-2105
dc.identifier.issn	1471-2105
dc.identifier.uri	http://hdl.handle.net/10453/140496
dc.description.abstract	BACKGROUND:Detection of new drug-target interactions by computational algorithms is of crucial value to both old drug repositioning and new drug discovery. Existing machine-learning methods rely only on experimentally validated drug-target interactions (i.e., positive samples) for the predictions. Their performance is severely impeded by the lack of reliable negative samples. RESULTS:We propose a method to construct highly-reliable negative samples for drug target prediction by a pairwise drug-target similarity measurement and OCSVM with a high-recall constraint. On one hand, we measure the pairwise similarity between every two drug-target interactions by combining the chemical similarity between their drugs and the Gene Ontology-based similarity between their targets. Then we calculate the accumulative similarity with all known drug-target interactions for each unobserved drug-target interaction. On the other hand, we obtain the signed distance from OCSVM learned from the known interactions with high recall (≥0.95) for each unobserved drug-target interaction. After normalizing all accumulative similarities and signed distances to the range [0,1], we compute the score for each unobserved drug-target interaction via averaging its accumulative similarity and signed distance. Unobserved interactions with lower scores are preferentially served as reliable negative samples for the classification algorithms. The performance of the proposed method is evaluated on the interaction data between 1094 drugs and 1556 target proteins. Extensive comparison experiments using four classical classifiers and one domain predictive method demonstrate the superior performance of the proposed method. A better decision boundary has been learned from the constructed reliable negative samples. CONCLUSIONS:Proper construction of highly-reliable negative samples can help the classification models learn a clear decision boundary which contributes to the performance improvement.
dc.format	Electronic
dc.language	eng
dc.publisher	BMC
dc.relation.ispartof	BMC bioinformatics
dc.relation.isbasedon	10.1186/s12859-019-3238-y
dc.rights	info:eu-repo/semantics/openAccess
dc.subject	01 Mathematical Sciences, 06 Biological Sciences, 08 Information and Computing Sciences
dc.subject.classification	Bioinformatics
dc.subject.mesh	Humans
dc.subject.mesh	Area Under Curve
dc.subject.mesh	Drug Interactions
dc.subject.mesh	Algorithms
dc.subject.mesh	Drug Discovery
dc.subject.mesh	Drug Repositioning
dc.subject.mesh	Machine Learning
dc.subject.mesh	Algorithms
dc.subject.mesh	Area Under Curve
dc.subject.mesh	Drug Discovery
dc.subject.mesh	Drug Interactions
dc.subject.mesh	Drug Repositioning
dc.subject.mesh	Humans
dc.subject.mesh	Machine Learning
dc.title	Old drug repositioning and new drug discovery through similarity learning from drug-target joint feature spaces.
dc.type	Journal Article
utslib.citation.volume	20
utslib.location.activity	England
utslib.for	01 Mathematical Sciences
utslib.for	06 Biological Sciences
utslib.for	08 Information and Computing Sciences
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Strength - CHT - Health Technologies
pubs.organisational-group	/University of Technology Sydney/Strength - AAI - Advanced Analytics Institute Research Centre
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Students
utslib.copyright.status	open_access	*
dc.date.updated	2020-05-05T22:39:30Z
pubs.issue	Suppl 23
pubs.publication-status	Published
pubs.volume	20
utslib.start-page	605
utslib.citation.issue	Suppl 23

Abstract:

BACKGROUND:Detection of new drug-target interactions by computational algorithms is of crucial value to both old drug repositioning and new drug discovery. Existing machine-learning methods rely only on experimentally validated drug-target interactions (i.e., positive samples) for the predictions. Their performance is severely impeded by the lack of reliable negative samples. RESULTS:We propose a method to construct highly-reliable negative samples for drug target prediction by a pairwise drug-target similarity measurement and OCSVM with a high-recall constraint. On one hand, we measure the pairwise similarity between every two drug-target interactions by combining the chemical similarity between their drugs and the Gene Ontology-based similarity between their targets. Then we calculate the accumulative similarity with all known drug-target interactions for each unobserved drug-target interaction. On the other hand, we obtain the signed distance from OCSVM learned from the known interactions with high recall (≥0.95) for each unobserved drug-target interaction. After normalizing all accumulative similarities and signed distances to the range [0,1], we compute the score for each unobserved drug-target interaction via averaging its accumulative similarity and signed distance. Unobserved interactions with lower scores are preferentially served as reliable negative samples for the classification algorithms. The performance of the proposed method is evaluated on the interaction data between 1094 drugs and 1556 target proteins. Extensive comparison experiments using four classical classifiers and one domain predictive method demonstrate the superior performance of the proposed method. A better decision boundary has been learned from the constructed reliable negative samples. CONCLUSIONS:Proper construction of highly-reliable negative samples can help the classification models learn a clear decision boundary which contributes to the performance improvement.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/140496