Intelligent Resource Management with Deep Reinforcement Learning in Device-to-Device Communication

Cotton, David F.

Intelligent Resource Management with Deep Reinforcement Learning in Device-to-Device Communication

Cotton, David F.

Permalink

Publication Type:: Thesis
Issue Date:: 2022

Open Access

Copyright Clearance Process

Recently Added
In Progress
Open Access

This item is open access.

Adobe PDF

Download contents and abstractAdobe PDF (303.16 kB)

Adobe PDF

Download thesisAdobe PDF (1.66 MB)

View statistics

Full metadata record

Field	Value	Language
dc.contributor.author	Cotton, David F.
dc.date.accessioned	2022-08-22T03:25:35Z
dc.date.available	2022-08-22T03:25:35Z
dc.date.issued	2022
dc.identifier.uri	http://hdl.handle.net/10453/160667
dc.description	University of Technology Sydney. Faculty of Engineering and Information Technology.	en_US.UTF-8
dc.description.abstract	Radio resource management in device-to-device cellular offload can be optimised to increase network capacity, quality of service, energy efficiency, lower latency and provide more resilient networks. However, this resource optimisation problem is both NP-Hard and required to operate at a millisecond timescale, limiting feasible solutions. In this thesis, we investigate how deep reinforcement learning can be applied to improve resource allocation. To empirically demonstrate our approach, we develop a network simulator for device-to-device cellular offload research. We also introduce an improved self-play algorithm for training reinforcement learning without expert guidance. We apply our self-play training algorithm to the game Connect Four. Leveraging the competitive pressures of coevolution, we improve the performance of agents trained with our method, achieving a 15% higher win rate. Furthermore, agents exhibit more stable training dynamics and suffer fewer performance regressions. We evaluate our network simulator and demonstrate deep reinforcement learning can significantly increase network capacity. Our network simulator reduces research friction and provides an evaluation platform to compare, share and build upon results. Our toolkit is provided to other researchers as open-source software.	en_US.UTF-8
dc.format	Thesis (MAnalytics)
dc.language.iso	en_US	en_US.UTF-8
dc.relation	https://opus.lib.uts.edu.au/bitstream/10453/160667/2/02whole.pdf
dc.rights	The author owns the copyright in this thesis including all reproduction and reuse rights for the work. The work may not be altered without the permission of the copyright owner. Attribution is essential when quoting or paraphrasing from this thesis.
dc.rights	au.edu.uts.lib/ppc
dc.rights	info:eu-repo/semantics/openAccess
dc.title	Intelligent Resource Management with Deep Reinforcement Learning in Device-to-Device Communication	en_US.UTF-8
dc.type	Thesis
utslib.copyright.status	open_access	*

Abstract:

Radio resource management in device-to-device cellular offload can be optimised to increase network capacity, quality of service, energy efficiency, lower latency and provide more resilient networks. However, this resource optimisation problem is both NP-Hard and required to operate at a millisecond timescale, limiting feasible solutions. In this thesis, we investigate how deep reinforcement learning can be applied to improve resource allocation. To empirically demonstrate our approach, we develop a network simulator for device-to-device cellular offload research. We also introduce an improved self-play algorithm for training reinforcement learning without expert guidance. We apply our self-play training algorithm to the game Connect Four. Leveraging the competitive pressures of coevolution, we improve the performance of agents trained with our method, achieving a 15% higher win rate. Furthermore, agents exhibit more stable training dynamics and suffer fewer performance regressions. We evaluate our network simulator and demonstrate deep reinforcement learning can significantly increase network capacity. Our network simulator reduces research friction and provides an evaluation platform to compare, share and build upon results. Our toolkit is provided to other researchers as open-source software.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/160667