GScheduler: Optimizing resource provision by using GPU usage pattern extraction in cloud environments

Xu, Z; Dong, F; Jin, J; Luo, J; Shen, J

GScheduler: Optimizing resource provision by using GPU usage pattern extraction in cloud environments

Xu, Z Dong, F Jin, J Luo, J Shen, J

Permalink

Publication Type:: Conference Proceeding
Citation:: 2017 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2017, 2017, 2017-January pp. 3225 - 3230
Issue Date:: 2017-11-27

Closed Access

	Filename	Description	Size
	smc2017.pdf	Published version	436.8 kB	Adobe PDF	View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Xu, Z	en_US
dc.contributor.author	Dong, F	en_US
dc.contributor.author	Jin, J	en_US
dc.contributor.author	Luo, J	en_US
dc.contributor.author	Shen, J https://orcid.org/0000-0002-9403-7140	en_US
dc.date.issued	2017-11-27	en_US
dc.identifier.citation	2017 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2017, 2017, 2017-January pp. 3225 - 3230	en_US
dc.identifier.isbn	9781538616451	en_US
dc.identifier.uri	http://hdl.handle.net/10453/131253
dc.description.abstract	© 2017 IEEE. GPU-based clusters are widely chosen for accelerating a variety of scientific applications in high-end cloud environments. With their growing popularity, there is a necessity for improving the system throughput and decreasing the turnaround time for co-executing applications on the same GPU device. However, resource contention among multiple applications on a multi-tasked GPU leads to the performance degradation of applications. Previous works are not accurate enough to learn the characteristics of GPU application before execution, or cannot get such information timely, which may lead to misleading scheduling decisions. In this paper, we present GScheduler, a framework to detect and reduce interference for co-executing applications on the GPU-based cloud. The most important feature of GScheduler is to utilize GPU usage pattern extractor for detecting interference between applications. It is composed of key function-call graph extractor and key GPU resource usage vector extractor, the former is used to detect the similarity of GPU usage mode between applications, while the latter is used to calculate the similarity of GPU resource requirements in-between. In addition, an interference aware scheduler is proposed to minimize the interference. We evaluated our framework with 26 diverse, real-world CUDA applications. When compared with state-of the-art interference-oblivious schedulers, our framework improves system throughput by 36% on average, and achieves a 30.5% reduction of turnaround time on average.	en_US
dc.relation.ispartof	2017 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2017	en_US
dc.relation.isbasedon	10.1109/SMC.2017.8123125	en_US
dc.title	GScheduler: Optimizing resource provision by using GPU usage pattern extraction in cloud environments	en_US
dc.type	Conference Proceeding
utslib.citation.volume	2017-January	en_US
pubs.embargo.period	Not known	en_US
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology/School of Information, Systems and Modelling
utslib.copyright.status	closed_access
pubs.publication-status	Published	en_US
pubs.volume	2017-January	en_US

Abstract:

© 2017 IEEE. GPU-based clusters are widely chosen for accelerating a variety of scientific applications in high-end cloud environments. With their growing popularity, there is a necessity for improving the system throughput and decreasing the turnaround time for co-executing applications on the same GPU device. However, resource contention among multiple applications on a multi-tasked GPU leads to the performance degradation of applications. Previous works are not accurate enough to learn the characteristics of GPU application before execution, or cannot get such information timely, which may lead to misleading scheduling decisions. In this paper, we present GScheduler, a framework to detect and reduce interference for co-executing applications on the GPU-based cloud. The most important feature of GScheduler is to utilize GPU usage pattern extractor for detecting interference between applications. It is composed of key function-call graph extractor and key GPU resource usage vector extractor, the former is used to detect the similarity of GPU usage mode between applications, while the latter is used to calculate the similarity of GPU resource requirements in-between. In addition, an interference aware scheduler is proposed to minimize the interference. We evaluated our framework with 26 diverse, real-world CUDA applications. When compared with state-of the-art interference-oblivious schedulers, our framework improves system throughput by 36% on average, and achieves a 30.5% reduction of turnaround time on average.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/131253