Asymptotic Soft Filter Pruning for Deep Convolutional Neural Networks.

He, Y; Dong, X; Kang, G; Fu, Y; Yan, C; Yang, Y

Asymptotic Soft Filter Pruning for Deep Convolutional Neural Networks.

He, Y

Dong, X

Kang, G Fu, Y Yan, C Yang, Y

Permalink

Publisher:: Institute of Electrical and Electronics Engineers
Publication Type:: Journal Article
Citation:: IEEE Transactions on Cybernetics, 2020, 50, (8), pp. 3594-3604
Issue Date:: 2020

Open Access

Copyright Clearance Process

Recently Added
In Progress
Open Access

This item is open access.

The embargo period expires on 27 Aug 2022

Adobe PDF

Download Accepted ManuscriptAdobe PDF (2.24 MB)

View on publisher's site

View statistics

Full metadata record

Field	Value	Language
dc.contributor.author	He, Y https://orcid.org/0000-0002-2257-6073
dc.contributor.author	Dong, X https://orcid.org/0000-0001-9272-1590
dc.contributor.author	Kang, G
dc.contributor.author	Fu, Y
dc.contributor.author	Yan, C
dc.contributor.author	Yang, Y https://orcid.org/0000-0001-5528-0546
dc.date.accessioned	2020-11-01T19:46:33Z
dc.date.available	2020-11-01T19:46:33Z
dc.date.issued	2020
dc.identifier.citation	IEEE Transactions on Cybernetics, 2020, 50, (8), pp. 3594-3604
dc.identifier.issn	1083-4419
dc.identifier.issn	2168-2275
dc.identifier.uri	http://hdl.handle.net/10453/143668
dc.description.abstract	Deeper and wider convolutional neural networks (CNNs) achieve superior performance but bring expensive computation cost. Accelerating such overparameterized neural network has received increased attention. A typical pruning algorithm is a three-stage pipeline, i.e., training, pruning, and retraining. Prevailing approaches fix the pruned filters to zero during retraining and, thus, significantly reduce the optimization space. Besides, they directly prune a large number of filters at first, which would cause unrecoverable information loss. To solve these problems, we propose an asymptotic soft filter pruning (ASFP) method to accelerate the inference procedure of the deep neural networks. First, we update the pruned filters during the retraining stage. As a result, the optimization space of the pruned model would not be reduced but be the same as that of the original model. In this way, the model has enough capacity to learn from the training data. Second, we prune the network asymptotically. We prune few filters at first and asymptotically prune more filters during the training procedure. With asymptotic pruning, the information of the training set would be gradually concentrated in the remaining filters, so the subsequent training and pruning process would be stable. The experiments show the effectiveness of our ASFP on image classification benchmarks. Notably, on ILSVRC-2012, our ASFP reduces more than 40% FLOPs on ResNet-50 with only 0.14% top-5 accuracy degradation, which is higher than the soft filter pruning by 8%.
dc.format	Print-Electronic
dc.language	eng
dc.publisher	Institute of Electrical and Electronics Engineers
dc.relation.ispartof	IEEE Transactions on Cybernetics
dc.relation.isbasedon	10.1109/tcyb.2019.2933477
dc.rights	info:eu-repo/semantics/embargoedAccess
dc.subject	0102 Applied Mathematics, 0801 Artificial Intelligence and Image Processing, 0906 Electrical and Electronic Engineering
dc.subject.classification	Artificial Intelligence & Image Processing
dc.title	Asymptotic Soft Filter Pruning for Deep Convolutional Neural Networks.
dc.type	Journal Article
utslib.citation.volume	50
utslib.location.activity	United States
utslib.for	0801 Artificial Intelligence and Image Processing
utslib.for	0102 Applied Mathematics
utslib.for	0906 Electrical and Electronic Engineering
utslib.for	0102 Applied Mathematics
utslib.for	0801 Artificial Intelligence and Image Processing
utslib.for	0906 Electrical and Electronic Engineering
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Strength - AAII - Australian Artificial Intelligence Institute
utslib.copyright.status	open_access	*
pubs.consider-herdc	true
utslib.copyright.embargo	2022-08-27T00:00:00+1000Z
dc.date.updated	2020-11-01T19:46:28Z
pubs.issue	8
pubs.publication-status	Published
pubs.volume	50
utslib.citation.issue	8

Abstract:

Deeper and wider convolutional neural networks (CNNs) achieve superior performance but bring expensive computation cost. Accelerating such overparameterized neural network has received increased attention. A typical pruning algorithm is a three-stage pipeline, i.e., training, pruning, and retraining. Prevailing approaches fix the pruned filters to zero during retraining and, thus, significantly reduce the optimization space. Besides, they directly prune a large number of filters at first, which would cause unrecoverable information loss. To solve these problems, we propose an asymptotic soft filter pruning (ASFP) method to accelerate the inference procedure of the deep neural networks. First, we update the pruned filters during the retraining stage. As a result, the optimization space of the pruned model would not be reduced but be the same as that of the original model. In this way, the model has enough capacity to learn from the training data. Second, we prune the network asymptotically. We prune few filters at first and asymptotically prune more filters during the training procedure. With asymptotic pruning, the information of the training set would be gradually concentrated in the remaining filters, so the subsequent training and pruning process would be stable. The experiments show the effectiveness of our ASFP on image classification benchmarks. Notably, on ILSVRC-2012, our ASFP reduces more than 40% FLOPs on ResNet-50 with only 0.14% top-5 accuracy degradation, which is higher than the soft filter pruning by 8%.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/143668