Multidimensional Dynamic Pruning: Exploring Spatial and Channel Fuzzy Sparsity

Shao, M; Kuang, J; Wang, C; Zuo, W; Wang, G

Multidimensional Dynamic Pruning: Exploring Spatial and Channel Fuzzy Sparsity

Shao, M Kuang, J Wang, C Zuo, W Wang, G

Permalink

Publisher:: Institute of Electrical and Electronics Engineers (IEEE)
Publication Type:: Journal Article
Citation:: IEEE Transactions on Fuzzy Systems, 2024, 32, (9), pp. 4890-4901
Issue Date:: 2024-01-01

Closed Access

	Filename	Description	Size
	1719661.pdf	Published version	1.92 MB		View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Shao, M
dc.contributor.author	Kuang, J
dc.contributor.author	Wang, C
dc.contributor.author	Zuo, W
dc.contributor.author	Wang, G
dc.date.accessioned	2025-01-28T08:34:52Z
dc.date.available	2025-01-28T08:34:52Z
dc.date.issued	2024-01-01
dc.identifier.citation	IEEE Transactions on Fuzzy Systems, 2024, 32, (9), pp. 4890-4901
dc.identifier.issn	1063-6706
dc.identifier.issn	1941-0034
dc.identifier.uri	http://hdl.handle.net/10453/184449
dc.description.abstract	Dynamic pruning is an effective model compression method to reduce the computational cost of networks. However, existing dynamic pruning methods are limited to pruning along a single dimension (channel, spatial, or depth), which cannot maximally excavate the redundancy of the network. Meanwhile, most of the current state-of-the-arts usually implement dynamic pruning via masked-out partial channels and pixels for training while failing to accelerate the inference speed. To tackle these limitations, we propose a novel fuzzy-based multidimensional dynamic pruning paradigm to dynamically compress neural networks along both the channel and spatial dimensions. Specifically, we design a multidimensional fuzzy-mask block to simultaneously learn which spatial positions or channels are redundant and need to be pruned. Then, the Gumbel-Softmax trick combined with a sparsity loss is introduced to train these mask modules in an end-to-end manner. During the testing stage, we convert features and convolution kernels into two matrices, respectively, and then implement sparse convolution through matrix multiplication to accelerate the network inference. Extensive experiments demonstrate that our method outperforms existing methods in terms of accuracy and computational cost. For instance, on the CIFAR-10 dataset, our method prunes 68% FLOPs of ResNet-56 with only a 0.07% Top-1 accuracy drop.
dc.language	en
dc.publisher	Institute of Electrical and Electronics Engineers (IEEE)
dc.relation.ispartof	IEEE Transactions on Fuzzy Systems
dc.relation.isbasedon	10.1109/TFUZZ.2024.3363220
dc.rights	info:eu-repo/semantics/closedAccess
dc.subject	0102 Applied Mathematics, 0801 Artificial Intelligence and Image Processing, 0906 Electrical and Electronic Engineering
dc.subject.classification	Artificial Intelligence & Image Processing
dc.subject.classification	4602 Artificial intelligence
dc.subject.classification	4901 Applied mathematics
dc.title	Multidimensional Dynamic Pruning: Exploring Spatial and Channel Fuzzy Sparsity
dc.type	Journal Article
utslib.citation.volume	32
utslib.for	0102 Applied Mathematics
utslib.for	0801 Artificial Intelligence and Image Processing
utslib.for	0906 Electrical and Electronic Engineering
utslib.copyright.status	closed_access	*
dc.date.updated	2025-01-28T08:34:50Z
pubs.issue	9
pubs.publication-status	Published
pubs.volume	32
utslib.citation.issue	9

Abstract:

Dynamic pruning is an effective model compression method to reduce the computational cost of networks. However, existing dynamic pruning methods are limited to pruning along a single dimension (channel, spatial, or depth), which cannot maximally excavate the redundancy of the network. Meanwhile, most of the current state-of-the-arts usually implement dynamic pruning via masked-out partial channels and pixels for training while failing to accelerate the inference speed. To tackle these limitations, we propose a novel fuzzy-based multidimensional dynamic pruning paradigm to dynamically compress neural networks along both the channel and spatial dimensions. Specifically, we design a multidimensional fuzzy-mask block to simultaneously learn which spatial positions or channels are redundant and need to be pruned. Then, the Gumbel-Softmax trick combined with a sparsity loss is introduced to train these mask modules in an end-to-end manner. During the testing stage, we convert features and convolution kernels into two matrices, respectively, and then implement sparse convolution through matrix multiplication to accelerate the network inference. Extensive experiments demonstrate that our method outperforms existing methods in terms of accuracy and computational cost. For instance, on the CIFAR-10 dataset, our method prunes 68% FLOPs of ResNet-56 with only a 0.07% Top-1 accuracy drop.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/184449