Optimal Beam Association in mmWave Vehicular Networks with Parallel Reinforcement Learning

Van Huynh, N; Nguyen, DN; Hoang, DT; Dutkiewicz, E

Optimal Beam Association in mmWave Vehicular Networks with Parallel Reinforcement Learning

Van Huynh, N Nguyen, DN Hoang, DT Dutkiewicz, E

Permalink

Publisher:: IEEE
Publication Type:: Conference Proceeding
Citation:: 2020 IEEE Global Communications Conference, GLOBECOM 2020 - Proceedings, 2021, 2020-January, pp. 1-6
Issue Date:: 2021-12-01

Open Access

Copyright Clearance Process

Recently Added
In Progress
Open Access

This item is open access.

The embargo period expires on 1 Dec 2023

Adobe PDF

Download Accepted ManuscriptAdobe PDF (510.5 kB)

View on publisher's site

View statistics

Full metadata record

Field	Value	Language
dc.contributor.author	Van Huynh, N
dc.contributor.author	Nguyen, DN
dc.contributor.author	Hoang, DT
dc.contributor.author	Dutkiewicz, E https://orcid.org/0000-0002-4268-9286
dc.date	2020-12-07
dc.date.accessioned	2021-06-21T06:55:23Z
dc.date.available	2021-06-21T06:55:23Z
dc.date.issued	2021-12-01
dc.identifier.citation	2020 IEEE Global Communications Conference, GLOBECOM 2020 - Proceedings, 2021, 2020-January, pp. 1-6
dc.identifier.isbn	9781728182988
dc.identifier.uri	http://hdl.handle.net/10453/149675
dc.description.abstract	This paper develops a beam association framework for mm Wave vehicular networks to improve the system performance in terms of handover, disconnection time, and data rate under the high mobility of vehicles. In particular, we recruit the semi Markov decision process to capture the uncertainty and dynamic of the environment such as locations of beams, received signal strength indicator profiles, velocities, and blockages. Instead of adopting complex deep learning structures such as deep dueling and double deep Q-learning, we develop a lightweight yet very effective parallel Q-learning algorithm to quickly derive the optimal beam association policy by simultaneously learning from various vehicles on the road. Through extensive simulation results, we demonstrate that the proposed framework can reduce the average disconnection time by 33% and increase the data rate by 60% compared to other solutions. We also observed that the proposed parallel Q-learning algorithm converges much faster to the optimal solution than state-of-the-art deep-learning based algorithms.
dc.language	en
dc.publisher	IEEE
dc.relation.ispartof	2020 IEEE Global Communications Conference, GLOBECOM 2020 - Proceedings
dc.relation.ispartof	IEEE Global Communications Conference
dc.relation.isbasedon	10.1109/GLOBECOM42002.2020.9348240
dc.rights	info:eu-repo/semantics/embargoedAccess
dc.title	Optimal Beam Association in mmWave Vehicular Networks with Parallel Reinforcement Learning
dc.type	Conference Proceeding
utslib.citation.volume	2020-January
utslib.location.activity	Taipei, Taiwan
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Strength - GBDTC - Global Big Data Technologies
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology/School of Electrical and Data Engineering
utslib.copyright.status	open_access	*
pubs.consider-herdc	false
utslib.copyright.embargo	2023-12-01T00:00:00+1000Z
dc.date.updated	2021-06-21T06:55:22Z
pubs.finish-date	2020-12-11
pubs.place-of-publication	Piscataway, USA
pubs.publication-status	Published
pubs.start-date	2020-12-07
pubs.volume	2020-January
dc.location	Piscataway, USA

Abstract:

This paper develops a beam association framework for mm Wave vehicular networks to improve the system performance in terms of handover, disconnection time, and data rate under the high mobility of vehicles. In particular, we recruit the semi Markov decision process to capture the uncertainty and dynamic of the environment such as locations of beams, received signal strength indicator profiles, velocities, and blockages. Instead of adopting complex deep learning structures such as deep dueling and double deep Q-learning, we develop a lightweight yet very effective parallel Q-learning algorithm to quickly derive the optimal beam association policy by simultaneously learning from various vehicles on the road. Through extensive simulation results, we demonstrate that the proposed framework can reduce the average disconnection time by 33% and increase the data rate by 60% compared to other solutions. We also observed that the proposed parallel Q-learning algorithm converges much faster to the optimal solution than state-of-the-art deep-learning based algorithms.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/149675