Parameter-Efficient Deep Neural Networks With Bilinear Projections.

Yu, L; Gao, Y; Zhou, J; Zhang, J

Parameter-Efficient Deep Neural Networks With Bilinear Projections.

Yu, L

Gao, Y Zhou, J Zhang, J

Permalink

Publisher:: Institute of Electrical and Electronics Engineers
Publication Type:: Journal Article
Citation:: IEEE Transactions on Neural Networks and Learning Systems, 2021, PP, (99), pp. 1-11
Issue Date:: 2021

Closed Access

	Filename	Description	Size
	Parameter-Efficient_Deep_Neural_Networks_With_Bilinear_Projections.pdf	Published version	1.9 MB	Adobe PDF	View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Yu, L https://orcid.org/0000-0001-5260-885X
dc.contributor.author	Gao, Y
dc.contributor.author	Zhou, J
dc.contributor.author	Zhang, J https://orcid.org/0000-0002-7240-3541
dc.date.accessioned	2021-08-19T01:24:34Z
dc.date.available	2021-08-19T01:24:34Z
dc.date.issued	2021
dc.identifier.citation	IEEE Transactions on Neural Networks and Learning Systems, 2021, PP, (99), pp. 1-11
dc.identifier.issn	1045-9227
dc.identifier.issn	2162-2388
dc.identifier.uri	http://hdl.handle.net/10453/150163
dc.description.abstract	Recent research on deep neural networks (DNNs) has primarily focused on improving the model accuracy. Given a proper deep learning framework, it is generally possible to increase the depth or layer width to achieve a higher level of accuracy. However, the huge number of model parameters imposes more computational and memory usage overhead and leads to the parameter redundancy. In this article, we address the parameter redundancy problem in DNNs by replacing conventional full projections with bilinear projections (BPs). For a fully connected layer with D input nodes and D output nodes, applying BP can reduce the model space complexity from O(D²) to O(2D), achieving a deep model with a sublinear layer size. However, the structured projection has a lower freedom of degree compared with the full projection, causing the underfitting problem. Therefore, we simply scale up the mapping size by increasing the number of output channels, which can keep and even boosts the model accuracy. This makes it very parameter-efficient and handy to deploy such deep models on mobile systems with memory limitations. Experiments on four benchmark data sets show that applying the proposed BP to DNNs can achieve even higher accuracies than conventional full DNNs while significantly reducing the model size.
dc.format	Print-Electronic
dc.language	eng
dc.publisher	Institute of Electrical and Electronics Engineers
dc.relation	http://purl.org/au-research/grants/arc/DP140101075
dc.relation.ispartof	IEEE Transactions on Neural Networks and Learning Systems
dc.relation.isbasedon	10.1109/tnnls.2020.3016688
dc.rights	info:eu-repo/semantics/closedAccess
dc.subject.classification	Artificial Intelligence & Image Processing
dc.title	Parameter-Efficient Deep Neural Networks With Bilinear Projections.
dc.type	Journal Article
utslib.citation.volume	PP
utslib.location.activity	United States
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Strength - GBDTC - Global Big Data Technologies
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology/School of Electrical and Data Engineering
utslib.copyright.status	closed_access	*
pubs.consider-herdc	false
dc.date.updated	2021-08-19T01:24:33Z
pubs.issue	99
pubs.publication-status	Published
pubs.volume	PP
utslib.citation.issue	99

Abstract:

Recent research on deep neural networks (DNNs) has primarily focused on improving the model accuracy. Given a proper deep learning framework, it is generally possible to increase the depth or layer width to achieve a higher level of accuracy. However, the huge number of model parameters imposes more computational and memory usage overhead and leads to the parameter redundancy. In this article, we address the parameter redundancy problem in DNNs by replacing conventional full projections with bilinear projections (BPs). For a fully connected layer with D input nodes and D output nodes, applying BP can reduce the model space complexity from O(D²) to O(2D), achieving a deep model with a sublinear layer size. However, the structured projection has a lower freedom of degree compared with the full projection, causing the underfitting problem. Therefore, we simply scale up the mapping size by increasing the number of output channels, which can keep and even boosts the model accuracy. This makes it very parameter-efficient and handy to deploy such deep models on mobile systems with memory limitations. Experiments on four benchmark data sets show that applying the proposed BP to DNNs can achieve even higher accuracies than conventional full DNNs while significantly reducing the model size.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/150163