A Meta-Reinforcement Learning Approach to Optimize Parameters and Hyper-parameters Simultaneously

Ali, AR; Budka, M; Gabrys, B

A Meta-Reinforcement Learning Approach to Optimize Parameters and Hyper-parameters Simultaneously

Ali, AR Budka, M Gabrys, B

Permalink

Publication Type:: Chapter
Citation:: 2019, 11671 LNAI pp. 93 - 106
Issue Date:: 2019-01-01

Closed Access

	Filename	Description	Size
	A Meta-Reinforcement Learning Approach to Optimize Parameters and Hyper-parameters Simultaneously.pdf	Published version	5.43 MB	Adobe PDF	View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Ali, AR	en_US
dc.contributor.author	Budka, M	en_US
dc.contributor.author	Gabrys, B https://orcid.org/0000-0002-0790-2846	en_US
dc.date.issued	2019-01-01	en_US
dc.identifier.citation	2019, 11671 LNAI pp. 93 - 106	en_US
dc.identifier.isbn	9783030299101	en_US
dc.identifier.uri	http://hdl.handle.net/10453/137527
dc.description.abstract	© 2019, Springer Nature Switzerland AG. In the last few years, we have witnessed a resurgence of interest in neural networks. The state-of-the-art deep neural network architectures are however challenging to design from scratch and requiring computationally costly empirical evaluations. Hence, there has been a lot of research effort dedicated to effective utilisation and adaptation of previously proposed architectures either by using transfer learning or by modifying the original architecture. The ultimate goal of designing a network architecture is to achieve the best possible accuracy for a given task or group of related tasks. Although there have been some efforts to automate network architecture design process, most of the existing solutions are still very computationally intensive. This work presents a framework to automatically find a good set of hyper-parameters resulting in reasonably good accuracy, which at the same time is less computationally expensive than the existing approaches. The idea presented here is to frame the hyper-parameter selection and tuning within the reinforcement learning regime. Thus, the parameters of a meta-learner, RNN, and hyper-parameters of the target network are tuned simultaneously. Our meta-learner is being updated using policy network and simultaneously generates a tuple of hyper-parameters which are utilized by another network. The network is trained on a given task for a number of steps and produces validation accuracy whose delta is used as reward. The reward along with the state of the network, comprising statistics of network’s final layer outcome and training loss, are fed back to the meta-learner which in turn generates a tuned tuple of hyper-parameters for the next time-step. Therefore, the effectiveness of a recommended tuple can be tested very quickly rather than waiting for the network to converge. This approach produces accuracy close to the state-of-the-art approach and is found to be comparatively less computationally intensive.	en_US
dc.relation.isbasedon	10.1007/978-3-030-29911-8_8	en_US
dc.subject.classification	Artificial Intelligence & Image Processing	en_US
dc.title	A Meta-Reinforcement Learning Approach to Optimize Parameters and Hyper-parameters Simultaneously	en_US
dc.type	Chapter
utslib.citation.volume	11671 LNAI	en_US
pubs.embargo.period	Not known	en_US
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Strength - AAI - Advanced Analytics Institute Research Centre
pubs.organisational-group	/University of Technology Sydney/Strength - CHT - Health Technologies
utslib.copyright.status	closed_access
pubs.publication-status	Published	en_US
pubs.volume	11671 LNAI	en_US

Abstract:

© 2019, Springer Nature Switzerland AG. In the last few years, we have witnessed a resurgence of interest in neural networks. The state-of-the-art deep neural network architectures are however challenging to design from scratch and requiring computationally costly empirical evaluations. Hence, there has been a lot of research effort dedicated to effective utilisation and adaptation of previously proposed architectures either by using transfer learning or by modifying the original architecture. The ultimate goal of designing a network architecture is to achieve the best possible accuracy for a given task or group of related tasks. Although there have been some efforts to automate network architecture design process, most of the existing solutions are still very computationally intensive. This work presents a framework to automatically find a good set of hyper-parameters resulting in reasonably good accuracy, which at the same time is less computationally expensive than the existing approaches. The idea presented here is to frame the hyper-parameter selection and tuning within the reinforcement learning regime. Thus, the parameters of a meta-learner, RNN, and hyper-parameters of the target network are tuned simultaneously. Our meta-learner is being updated using policy network and simultaneously generates a tuple of hyper-parameters which are utilized by another network. The network is trained on a given task for a number of steps and produces validation accuracy whose delta is used as reward. The reward along with the state of the network, comprising statistics of network’s final layer outcome and training loss, are fed back to the meta-learner which in turn generates a tuned tuple of hyper-parameters for the next time-step. Therefore, the effectiveness of a recommended tuple can be tested very quickly rather than waiting for the network to converge. This approach produces accuracy close to the state-of-the-art approach and is found to be comparatively less computationally intensive.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/137527