Training memristor-based multilayer neuromorphic networks with SGD, momentum and adaptive learning rates.

Yan, Z; Chen, J; Hu, R; Huang, T; Chen, Y; Wen, S

Training memristor-based multilayer neuromorphic networks with SGD, momentum and adaptive learning rates.

Yan, Z Chen, J Hu, R Huang, T Chen, Y Wen, S

Permalink

Publisher:: Elsevier BV
Publication Type:: Journal Article
Citation:: Neural networks : the official journal of the International Neural Network Society, 2020, 128, pp. 142-149
Issue Date:: 2020-08

Closed Access

	Filename	Description	Size
	1-s2.0-S0893608020301532-main.pdf	Published version	1.44 MB		View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Yan, Z
dc.contributor.author	Chen, J
dc.contributor.author	Hu, R
dc.contributor.author	Huang, T
dc.contributor.author	Chen, Y
dc.contributor.author	Wen, S https://orcid.org/0000-0002-5048-0319
dc.date.accessioned	2021-02-09T05:49:22Z
dc.date.available	2020-04-27
dc.date.available	2021-02-09T05:49:22Z
dc.date.issued	2020-08
dc.identifier.citation	Neural networks : the official journal of the International Neural Network Society, 2020, 128, pp. 142-149
dc.identifier.issn	0893-6080
dc.identifier.issn	1879-2782
dc.identifier.uri	http://hdl.handle.net/10453/145990
dc.description.abstract	Neural networks implemented with traditional hardware face inherent limitation of memory latency. Specifically, the processing units like GPUs, FPGAs, and customized ASICs, must wait for inputs to read from memory and outputs to write back. This motivates memristor-based neuromorphic computing in which the memory units (i.e., memristors) have computing capabilities. However, training a memristor-based neural network is difficult since memristors work differently from CMOS hardware. This paper proposes a new training approach that enables prevailing neural network training techniques to be applied for memristor-based neuromorphic networks. Particularly, we introduce momentum and adaptive learning rate to the circuit training, both of which are proven methods that significantly accelerate the convergence of neural network parameters. Furthermore, we show that this circuit can be used for neural networks with arbitrary numbers of layers, neurons, and parameters. Simulation results on four classification tasks demonstrate that the proposed circuit achieves both high accuracy and fast speed. Compared with the SGD-based training circuit, on the WBC data set, the training speed of our circuit is increased by 37.2% while the accuracy is only reduced by 0.77%. On the MNIST data set, the new circuit even leads to improved accuracy.
dc.format	Print-Electronic
dc.language	eng
dc.publisher	Elsevier BV
dc.relation.ispartof	Neural networks : the official journal of the International Neural Network Society
dc.relation.isbasedon	10.1016/j.neunet.2020.04.025
dc.rights	info:eu-repo/semantics/restrictedAccess
dc.subject.classification	Artificial Intelligence & Image Processing
dc.subject.mesh	Neurons
dc.subject.mesh	Humans
dc.subject.mesh	Learning
dc.subject.mesh	Motion
dc.subject.mesh	Neural Networks, Computer
dc.subject.mesh	Neurons
dc.subject.mesh	Humans
dc.subject.mesh	Learning
dc.subject.mesh	Motion
dc.subject.mesh	Neural Networks, Computer
dc.subject.mesh	Humans
dc.subject.mesh	Learning
dc.subject.mesh	Motion
dc.subject.mesh	Neural Networks, Computer
dc.subject.mesh	Neurons
dc.title	Training memristor-based multilayer neuromorphic networks with SGD, momentum and adaptive learning rates.
dc.type	Journal Article
utslib.citation.volume	128
utslib.location.activity	United States
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Strength - AAII - Australian Artificial Intelligence Institute
pubs.organisational-group	/University of Technology Sydney
utslib.copyright.status	closed_access	*
pubs.consider-herdc	false
dc.date.updated	2021-02-09T05:49:16Z
pubs.publication-status	Published
pubs.volume	128

Abstract:

Neural networks implemented with traditional hardware face inherent limitation of memory latency. Specifically, the processing units like GPUs, FPGAs, and customized ASICs, must wait for inputs to read from memory and outputs to write back. This motivates memristor-based neuromorphic computing in which the memory units (i.e., memristors) have computing capabilities. However, training a memristor-based neural network is difficult since memristors work differently from CMOS hardware. This paper proposes a new training approach that enables prevailing neural network training techniques to be applied for memristor-based neuromorphic networks. Particularly, we introduce momentum and adaptive learning rate to the circuit training, both of which are proven methods that significantly accelerate the convergence of neural network parameters. Furthermore, we show that this circuit can be used for neural networks with arbitrary numbers of layers, neurons, and parameters. Simulation results on four classification tasks demonstrate that the proposed circuit achieves both high accuracy and fast speed. Compared with the SGD-based training circuit, on the WBC data set, the training speed of our circuit is increased by 37.2% while the accuracy is only reduced by 0.77%. On the MNIST data set, the new circuit even leads to improved accuracy.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/145990