Deep Reinforcement Learning for Smart City Communication Networks

Publisher:
Institute of Electrical and Electronics Engineers (IEEE)
Publication Type:
Journal Article
Citation:
IEEE Transactions on Industrial Informatics, 2021, 17, (6), pp. 4188-4196
Issue Date:
2021-06-01
Filename Description Size
Deep_Reinforcement_Learning_for_Smart_City_Communication_Networks.pdfPublished version1.84 MB
Adobe PDF
Full metadata record
Smart city communication networks hold the promise of harnessing the Internet of Things, smart devices, intelligent energy grids, autonomous cars, and many other data architectures to improve quality of life for every citizen. As time and progress have gradually shifted this concept from an idea into a reality, the possibilities for what services a smart city might provide have exploded. Along with that expansion, the number of devices that may need to be supported across a communications network has multiplied enormously. However, existing congestion control protocols are not equipped to support this higher requirement for network performance. New protocols are needed to provide higher data throughputs, reduce queuing delays and packet loss, and maintain stable and reliable communications pathways. The end-to-end congestion control protocol presented in this article does all these things. Called high throughput congestion control (HTCC), the framework takes full advantage of reinforcement learning and a fast growth algorithm to adaptively control the bytes in flight across the network's links to suit the prevailing network conditions. The result is a protocol that not only finds the optimal tradeoff between throughput, latency, and packet loss, but also significantly speeds up the learning process to avoid wasting network resources, use all free bandwidth and maximize network utilization. A comprehensive set of experiments with HTCC and six state-of-the-art protocols - Copa, PCC Vivace, PCC-Allegro, TCP Cubic, TaoVA-100x, and FillP-Sheep - demonstrate that, in most conditions, HTCC is able to make the best tradeoff between throughput, delay, and loss rate.
Please use this identifier to cite or link to this item: