Automated Deep Learning: A Study on Neural Architecture Search

Zhang, Miao

Automated Deep Learning: A Study on Neural Architecture Search

Zhang, Miao

Permalink

Publication Type:: Thesis
Issue Date:: 2021

Open Access

Copyright Clearance Process

Recently Added
In Progress
Open Access

This item is open access.

Download contents and abstractAdobe PDF (298.28 kB)

Download thesisAdobe PDF (3.98 MB)

View statistics

Full metadata record

Field	Value	Language
dc.contributor.author	Zhang, Miao
dc.date.accessioned	2022-06-15T00:35:56Z
dc.date.available	2022-06-15T00:35:56Z
dc.date.issued	2021
dc.identifier.uri	http://hdl.handle.net/10453/158177
dc.description	University of Technology Sydney. Faculty of Engineering and Information Technology.	en_US.UTF-8
dc.description.abstract	Automated Deep Learning (AutoDL) aims to build a better deep learning model in a data-driven and automated manner, so that most practitioners in deep learning can also build a high-performance machine learning model, with being relieved from a labor-intensive and time-consuming neural network design process. AutoDL can bring new research ideas to deep neural networks, and lower the threshold of deep learning in various research areas through automated neural network design. This thesis focuses on the two specific research problems of neural architecture search (NAS) in the automated deep learning: one-shot NAS and differentiable NAS. In particular, this paper proposed a novelty driven sampling method and formulate the supernet training as a constrained continual learning optimization problem, to address the "rich-get-richer" problem and multi-model forgetting issue existing in one-shot NAS. As to the differentiable NAS, we leveraged a variational graph autoencoder to relieve the non-negligible incongruence, formulating the neural architecture search as a distribution learning problem to enhance exploration, and proposed the differentiable architecture search with stochastic implicit gradients to enable multi-step inner optimization.	en_US.UTF-8
dc.format	Thesis (PhD)
dc.language.iso	en_US	en_US.UTF-8
dc.relation	https://opus.lib.uts.edu.au/bitstream/10453/158177/2/02whole.pdf
dc.rights	info:eu-repo/semantics/openAccess
dc.rights	The author owns the copyright in this thesis including all reproduction and reuse rights for the work. The work may not be altered without the permission of the copyright owner. Attribution is essential when quoting or paraphrasing from this thesis.
dc.rights	au.edu.uts.lib/ppc
dc.title	Automated Deep Learning: A Study on Neural Architecture Search	en_US.UTF-8
dc.type	Thesis
utslib.copyright.status	open_access	*

Abstract:

Automated Deep Learning (AutoDL) aims to build a better deep learning model in a data-driven and automated manner, so that most practitioners in deep learning can also build a high-performance machine learning model, with being relieved from a labor-intensive and time-consuming neural network design process. AutoDL can bring new research ideas to deep neural networks, and lower the threshold of deep learning in various research areas through automated neural network design. This thesis focuses on the two specific research problems of neural architecture search (NAS) in the automated deep learning: one-shot NAS and differentiable NAS. In particular, this paper proposed a novelty driven sampling method and formulate the supernet training as a constrained continual learning optimization problem, to address the "rich-get-richer" problem and multi-model forgetting issue existing in one-shot NAS. As to the differentiable NAS, we leveraged a variational graph autoencoder to relieve the non-negligible incongruence, formulating the neural architecture search as a distribution learning problem to enhance exploration, and proposed the differentiable architecture search with stochastic implicit gradients to enable multi-step inner optimization.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/158177