Online Learning of Optimal Proactive Schedule Based on Outdated Knowledge for Energy Harvesting Powered Internet-of-Things

Lyu, X; Ren, C; Ni, W; Tian, H; Cui, Q; Liu, RP

Online Learning of Optimal Proactive Schedule Based on Outdated Knowledge for Energy Harvesting Powered Internet-of-Things

Lyu, X Ren, C Ni, W

Tian, H Cui, Q Liu, RP

Permalink

Publisher:: Institute of Electrical and Electronics Engineers
Publication Type:: Journal Article
Citation:: IEEE Transactions on Wireless Communications, 2021, 20, (2), pp. 1248-1262
Issue Date:: 2021-02-01

Closed Access

	Filename	Description	Size
	Online_Learning_of_Optimal_Proactive_Schedule_Based_on_Outdated_Knowledge_for_Energy_Harvesting_Powered_Internet-of-Things.pdf		1.66 MB		View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Lyu, X
dc.contributor.author	Ren, C
dc.contributor.author	Ni, W https://orcid.org/0000-0002-4933-594X
dc.contributor.author	Tian, H
dc.contributor.author	Cui, Q
dc.contributor.author	Liu, RP
dc.date.accessioned	2022-03-21T04:17:15Z
dc.date.available	2022-03-21T04:17:15Z
dc.date.issued	2021-02-01
dc.identifier.citation	IEEE Transactions on Wireless Communications, 2021, 20, (2), pp. 1248-1262
dc.identifier.issn	1536-1276
dc.identifier.issn	1558-2248
dc.identifier.uri	http://hdl.handle.net/10453/155412
dc.description.abstract	This paper aims to produce an effective online scheduling technique, where a base station (BS) schedules the transmissions of energy harvesting-powered Internet-of-Things (IoT) devices only based on the (differently outdated) in-band reports of the devices on their states. We establish a new primal-dual learning framework, which learns online the optimal proactive schedules to maximize the time-average throughput of all the devices. Batch gradient descent is designed to enable stochastic gradient descent (SGD)-based dual learning to learn the network dynamics from the outdated reports. Replay memory is deployed to allow online convex optimization (OCO)-based primal learning to predict channel conditions and prevent over-fitting. We also decentralize the online learning between the BS and devices, and speed up learning by leveraging the instantaneous knowledge of the devices on their states. We prove that the proposed framework asymptotically converges to the global optimum, and the impact of the outdated knowledge of the BS diminishes. Simulation results confirm that the proposed approach can increasingly outperform state of the art, as the number of devices grows.
dc.language	English
dc.publisher	Institute of Electrical and Electronics Engineers
dc.relation.ispartof	IEEE Transactions on Wireless Communications
dc.relation.isbasedon	10.1109/TWC.2020.3031927
dc.rights	info:eu-repo/semantics/closedAccess
dc.subject	0805 Distributed Computing, 0906 Electrical and Electronic Engineering, 1005 Communications Technologies
dc.subject.classification	Networking & Telecommunications
dc.title	Online Learning of Optimal Proactive Schedule Based on Outdated Knowledge for Energy Harvesting Powered Internet-of-Things
dc.type	Journal Article
utslib.citation.volume	20
utslib.for	0805 Distributed Computing
utslib.for	0906 Electrical and Electronic Engineering
utslib.for	1005 Communications Technologies
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Strength - GBDTC - Global Big Data Technologies
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology/School of Electrical and Data Engineering
utslib.copyright.status	closed_access	*
pubs.consider-herdc	false
dc.date.updated	2022-03-21T04:17:14Z
pubs.issue	2
pubs.publication-status	Published
pubs.volume	20
utslib.citation.issue	2

Abstract:

This paper aims to produce an effective online scheduling technique, where a base station (BS) schedules the transmissions of energy harvesting-powered Internet-of-Things (IoT) devices only based on the (differently outdated) in-band reports of the devices on their states. We establish a new primal-dual learning framework, which learns online the optimal proactive schedules to maximize the time-average throughput of all the devices. Batch gradient descent is designed to enable stochastic gradient descent (SGD)-based dual learning to learn the network dynamics from the outdated reports. Replay memory is deployed to allow online convex optimization (OCO)-based primal learning to predict channel conditions and prevent over-fitting. We also decentralize the online learning between the BS and devices, and speed up learning by leveraging the instantaneous knowledge of the devices on their states. We prove that the proposed framework asymptotically converges to the global optimum, and the impact of the outdated knowledge of the BS diminishes. Simulation results confirm that the proposed approach can increasingly outperform state of the art, as the number of devices grows.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/155412