Optimal Beam Association in mmWave Vehicular Networks with Parallel Reinforcement Learning

Publisher:
IEEE
Publication Type:
Conference Proceeding
Citation:
2020 IEEE Global Communications Conference, GLOBECOM 2020 - Proceedings, 2021, 2020-January, pp. 1-6
Issue Date:
2021-12-01
Full metadata record
This paper develops a beam association framework for mm Wave vehicular networks to improve the system performance in terms of handover, disconnection time, and data rate under the high mobility of vehicles. In particular, we recruit the semi Markov decision process to capture the uncertainty and dynamic of the environment such as locations of beams, received signal strength indicator profiles, velocities, and blockages. Instead of adopting complex deep learning structures such as deep dueling and double deep Q-learning, we develop a lightweight yet very effective parallel Q-learning algorithm to quickly derive the optimal beam association policy by simultaneously learning from various vehicles on the road. Through extensive simulation results, we demonstrate that the proposed framework can reduce the average disconnection time by 33% and increase the data rate by 60% compared to other solutions. We also observed that the proposed parallel Q-learning algorithm converges much faster to the optimal solution than state-of-the-art deep-learning based algorithms.
Please use this identifier to cite or link to this item: