VNE-TD: A virtual network embedding algorithm based on temporal-difference learning

Wang, S; Bi, J; Wu, J; Vasilakos, AV; Fan, Q

VNE-TD: A virtual network embedding algorithm based on temporal-difference learning

Wang, S Bi, J Wu, J Vasilakos, AV

Fan, Q

Permalink

Publication Type:: Journal Article
Citation:: Computer Networks, 2019, 161 pp. 251 - 263
Issue Date:: 2019-10-09

Closed Access

	Filename	Description	Size
	1-s2.0-S138912861830584X-main.pdf	Published Version	3.05 MB	Adobe PDF	View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Wang, S	en_US
dc.contributor.author	Bi, J	en_US
dc.contributor.author	Wu, J	en_US
dc.contributor.author	Vasilakos, AV https://orcid.org/0000-0003-1902-9877	en_US
dc.contributor.author	Fan, Q	en_US
dc.date.accessioned	2020-04-17T03:50:35Z
dc.date.available	2020-04-17T03:50:35Z
dc.date.issued	2019-10-09	en_US
dc.identifier.citation	Computer Networks, 2019, 161 pp. 251 - 263	en_US
dc.identifier.issn	1389-1286	en_US
dc.identifier.uri	http://hdl.handle.net/10453/140074
dc.description.abstract	© 2019 Recently, network virtualization is considered as a promising solution for the future Internet which can help to overcome the resistance of the current Internet to fundamental changes. The problem of embedding Virtual Networks (VN) in a Substrate Network (SN) is the main resource allocation challenge in network virtualization. The major challenge of the Virtual Network Embedding (VNE) problem lies in the contradiction between making online embedding decisions and pursuing a long-term objective. Most previous works resort to balancing the SN workload with various methods to deal with this contradiction. Rather than passive balancing, we try to overcome it by learning actively and making online decisions based on previous experiences. In this article, we model the VNE problem as Markov Decision Process (MDP) and develop a neural network to approximate the value function of VNE states. Further, a VNE algorithm based on Temporal-Difference Learning (one kind of Reinforcement Learning methods), named VNE-TD, is proposed. In VNE-TD, multiple embedding candidates of node-mapping are generated probabilistically, and TD Learning is involved to evaluate the long-run potential of each candidate. Extensive simulation results show that VNE-TD outperforms previous algorithms significantly in terms of both block ratio and revenue.	en_US
dc.relation.ispartof	Computer Networks	en_US
dc.relation.isbasedon	10.1016/j.comnet.2019.05.004	en_US
dc.rights	info:eu-repo/semantics/closedAccess
dc.subject.classification	Networking & Telecommunications	en_US
dc.title	VNE-TD: A virtual network embedding algorithm based on temporal-difference learning	en_US
dc.type	Journal Article
utslib.citation.volume	161	en_US
utslib.for	08 Information and Computing Sciences	en_US
utslib.for	09 Engineering	en_US
utslib.for	10 Technology	en_US
pubs.embargo.period	Not known	en_US
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology/School of Electrical and Data Engineering
utslib.copyright.status	closed_access	*
pubs.publication-status	Published	en_US
pubs.volume	161	en_US

Abstract:

© 2019 Recently, network virtualization is considered as a promising solution for the future Internet which can help to overcome the resistance of the current Internet to fundamental changes. The problem of embedding Virtual Networks (VN) in a Substrate Network (SN) is the main resource allocation challenge in network virtualization. The major challenge of the Virtual Network Embedding (VNE) problem lies in the contradiction between making online embedding decisions and pursuing a long-term objective. Most previous works resort to balancing the SN workload with various methods to deal with this contradiction. Rather than passive balancing, we try to overcome it by learning actively and making online decisions based on previous experiences. In this article, we model the VNE problem as Markov Decision Process (MDP) and develop a neural network to approximate the value function of VNE states. Further, a VNE algorithm based on Temporal-Difference Learning (one kind of Reinforcement Learning methods), named VNE-TD, is proposed. In VNE-TD, multiple embedding candidates of node-mapping are generated probabilistically, and TD Learning is involved to evaluate the long-run potential of each candidate. Extensive simulation results show that VNE-TD outperforms previous algorithms significantly in terms of both block ratio and revenue.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/140074