Sc2Net: Sparse LSTMs for sparse coding <sup>∗</sup>

Zhou, JT; Di, K; Du, J; Peng, X; Yang, H; Pan, SJ; Tsang, IW; Liu, Y; Qin, Z; Goh, RSM

Sc2Net: Sparse LSTMs for sparse coding <sup>∗</sup>

Zhou, JT Di, K Du, J Peng, X Yang, H Pan, SJ Tsang, IW

Liu, Y Qin, Z Goh, RSM

Permalink

Publication Type:: Conference Proceeding
Citation:: 32nd AAAI Conference on Artificial Intelligence, AAAI 2018, 2018, pp. 4588 - 4595
Issue Date:: 2018-01-01

Closed Access

	Filename	Description	Size
	16822-77539-1-PB.pdf	Published version	1.08 MB		View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Zhou, JT	en_US
dc.contributor.author	Di, K	en_US
dc.contributor.author	Du, J	en_US
dc.contributor.author	Peng, X	en_US
dc.contributor.author	Yang, H	en_US
dc.contributor.author	Pan, SJ	en_US
dc.contributor.author	Tsang, IW https://orcid.org/0000-0001-8095-4637	en_US
dc.contributor.author	Liu, Y	en_US
dc.contributor.author	Qin, Z	en_US
dc.contributor.author	Goh, RSM	en_US
dc.date.issued	2018-01-01	en_US
dc.identifier.citation	32nd AAAI Conference on Artificial Intelligence, AAAI 2018, 2018, pp. 4588 - 4595	en_US
dc.identifier.isbn	9781577358008	en_US
dc.identifier.uri	http://hdl.handle.net/10453/133213
dc.description.abstract	Copyright © 2018, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved. The iterative hard-thresholding algorithm (ISTA) is one of the most popular optimization solvers to achieve sparse codes. However, ISTA suffers from following problems: 1) ISTA employs non-adaptive updating strategy to learn the parameters on each dimension with a fixed learning rate. Such a strategy may lead to inferior performance due to the scarcity of diversity; 2) ISTA does not incorporate the historical information into the updating rules, and the historical information has been proven helpful to speed up the convergence. To address these challenging issues, we propose a novel formulation of ISTA (named as adaptive ISTA) by introducing a novel adaptive momentum vector. To efficiently solve the proposed adaptive ISTA, we recast it as a recurrent neural network unit and show its connection with the well-known long short term memory (LSTM) model. With a new proposed unit, we present a neural network (termed SC2Net) to achieve sparse codes in an end-to-end manner. To the best of our knowledge, this is one of the first works to bridge the 1-solver and LSTM, and may provide novel insights in understanding model-based optimization and LSTM. Extensive experiments show the effectiveness of our method on both unsupervised and supervised tasks.	en_US
dc.relation	http://purl.org/au-research/grants/arc/FT130100746
dc.relation	http://purl.org/au-research/grants/arc/LP150100671
dc.relation	http://purl.org/au-research/grants/arc/DP180100106
dc.relation.ispartof	32nd AAAI Conference on Artificial Intelligence, AAAI 2018	en_US
dc.title	Sc2Net: Sparse LSTMs for sparse coding <sup>∗</sup>	en_US
dc.type	Conference Proceeding
utslib.for	0801 Artificial Intelligence and Image Processing	en_US
pubs.embargo.period	Not known	en_US
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Strength - CAI - Centre for Artificial Intelligence
utslib.copyright.status	closed_access
pubs.publication-status	Published	en_US

Abstract:

Copyright © 2018, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved. The iterative hard-thresholding algorithm (ISTA) is one of the most popular optimization solvers to achieve sparse codes. However, ISTA suffers from following problems: 1) ISTA employs non-adaptive updating strategy to learn the parameters on each dimension with a fixed learning rate. Such a strategy may lead to inferior performance due to the scarcity of diversity; 2) ISTA does not incorporate the historical information into the updating rules, and the historical information has been proven helpful to speed up the convergence. To address these challenging issues, we propose a novel formulation of ISTA (named as adaptive ISTA) by introducing a novel adaptive momentum vector. To efficiently solve the proposed adaptive ISTA, we recast it as a recurrent neural network unit and show its connection with the well-known long short term memory (LSTM) model. With a new proposed unit, we present a neural network (termed SC2Net) to achieve sparse codes in an end-to-end manner. To the best of our knowledge, this is one of the first works to bridge the 1-solver and LSTM, and may provide novel insights in understanding model-based optimization and LSTM. Extensive experiments show the effectiveness of our method on both unsupervised and supervised tasks.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/133213