Use of extreme patient samples for outcome prediction from gene expression data

Liu, H; Li, J; Wong, L

Use of extreme patient samples for outcome prediction from gene expression data

Liu, H Li, J

Wong, L

Permalink

Publication Type:: Journal Article
Citation:: Bioinformatics, 2005, 21 (16), pp. 3377 - 3384
Issue Date:: 2005-08-15

Closed Access

	Filename	Description	Size
	2010006947OK.pdf		399.09 kB	Adobe PDF	View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Liu, H	en_US
dc.contributor.author	Li, J https://orcid.org/0000-0003-1833-7413	en_US
dc.contributor.author	Wong, L	en_US
dc.date.issued	2005-08-15	en_US
dc.identifier.citation	Bioinformatics, 2005, 21 (16), pp. 3377 - 3384	en_US
dc.identifier.issn	1367-4803	en_US
dc.identifier.uri	http://hdl.handle.net/10453/14846
dc.description.abstract	Motivation: Patient outcome prediction using microarray technologies is an important application in bioinformatics. Based on patients' genotypic microarray data, predictions are made to estimate patients' survival time and their risk of tumor metastasis or recurrence. So, accurate prediction can potentially help to provide better treatment for patients. Results: We present a new computational method for patient outcome prediction. In the training phase of this method, we make use of two types of extreme patient samples: short-term survivors who got an unfavorable outcome within a short period and long-term survivors who were maintaining a favorable outcome after a long follow-up time. These extreme training samples yield a clear platform for us to identify relevant genes whose expression is closely related to the outcome. The selected extreme samples and the relevant genes are then integrated by a support vector machine to build a prediction model, by which each validation sample is assigned a risk score that falls into one of the special pre-defined risk groups. We apply this method to several public datasets. In most cases, patients in high and low risk groups stratified by our method have clearly distinguishable outcome status as seen in their Kaplan-Meier curves. We also show that the idea of selecting only extreme patient samples for training is effective for improving the prediction accuracy when different gene selection methods are used. © The Author 2005. Published by Oxford University Press. All rights reserved.	en_US
dc.relation.ispartof	Bioinformatics	en_US
dc.relation.isbasedon	10.1093/bioinformatics/bti544	en_US
dc.subject.classification	Bioinformatics	en_US
dc.subject.mesh	Humans	en_US
dc.subject.mesh	Neoplasms	en_US
dc.subject.mesh	Neoplasm Proteins	en_US
dc.subject.mesh	Tumor Markers, Biological	en_US
dc.subject.mesh	Diagnosis, Computer-Assisted	en_US
dc.subject.mesh	Prognosis	en_US
dc.subject.mesh	Oligonucleotide Array Sequence Analysis	en_US
dc.subject.mesh	Survival Rate	en_US
dc.subject.mesh	Risk Assessment	en_US
dc.subject.mesh	Risk Factors	en_US
dc.subject.mesh	Survival Analysis	en_US
dc.subject.mesh	Gene Expression Profiling	en_US
dc.subject.mesh	Outcome Assessment (Health Care)	en_US
dc.subject.mesh	Biomarkers, Tumor	en_US
dc.subject.mesh	Outcome Assessment, Health Care	en_US
dc.title	Use of extreme patient samples for outcome prediction from gene expression data	en_US
dc.type	Journal Article
utslib.citation.volume	16	en_US
utslib.citation.volume	21	en_US
utslib.for	080301 Bioinformatics Software	en_US
utslib.for	01 Mathematical Sciences	en_US
utslib.for	06 Biological Sciences	en_US
utslib.for	08 Information and Computing Sciences	en_US
pubs.embargo.period	Not known	en_US
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Strength - AAI - Advanced Analytics Institute Research Centre
pubs.organisational-group	/University of Technology Sydney/Strength - CHT - Health Technologies
utslib.copyright.status	closed_access
pubs.issue	16	en_US
pubs.publication-status	Published	en_US
pubs.volume	21	en_US

Abstract:

Motivation: Patient outcome prediction using microarray technologies is an important application in bioinformatics. Based on patients' genotypic microarray data, predictions are made to estimate patients' survival time and their risk of tumor metastasis or recurrence. So, accurate prediction can potentially help to provide better treatment for patients. Results: We present a new computational method for patient outcome prediction. In the training phase of this method, we make use of two types of extreme patient samples: short-term survivors who got an unfavorable outcome within a short period and long-term survivors who were maintaining a favorable outcome after a long follow-up time. These extreme training samples yield a clear platform for us to identify relevant genes whose expression is closely related to the outcome. The selected extreme samples and the relevant genes are then integrated by a support vector machine to build a prediction model, by which each validation sample is assigned a risk score that falls into one of the special pre-defined risk groups. We apply this method to several public datasets. In most cases, patients in high and low risk groups stratified by our method have clearly distinguishable outcome status as seen in their Kaplan-Meier curves. We also show that the idea of selecting only extreme patient samples for training is effective for improving the prediction accuracy when different gene selection methods are used. © The Author 2005. Published by Oxford University Press. All rights reserved.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/14846