Salience-Guided Cascaded Suppression Network for Person Re-Identification

Chen, X; Fu, C; Zhao, Y; Zheng, F; Song, J; Ji, R; Yang, Y

Salience-Guided Cascaded Suppression Network for Person Re-Identification

Chen, X Fu, C Zhao, Y Zheng, F Song, J Ji, R Yang, Y

Permalink

Publisher:: IEEE
Publication Type:: Conference Proceeding
Citation:: 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020, 00, pp. 3297-3307
Issue Date:: 2020-06

Closed Access

	Filename	Description	Size
	09156982.pdf	Published version	1.12 MB	Adobe PDF	View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Chen, X
dc.contributor.author	Fu, C
dc.contributor.author	Zhao, Y
dc.contributor.author	Zheng, F
dc.contributor.author	Song, J
dc.contributor.author	Ji, R
dc.contributor.author	Yang, Y https://orcid.org/0000-0002-0512-880X
dc.date	2020-06-13
dc.date.accessioned	2021-05-27T07:35:18Z
dc.date.available	2021-05-27T07:35:18Z
dc.date.issued	2020-06
dc.identifier.citation	2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020, 00, pp. 3297-3307
dc.identifier.uri	http://hdl.handle.net/10453/149272
dc.description.abstract	Employing attention mechanisms to model both global and local features as a final pedestrian representation has become a trend for person re-identification (Re-ID) algorithms. A potential limitation of these methods is that they focus on the most salient features, but the re-identification of a person may rely on diverse clues masked by the most salient features in different situations, e.g., body, clothes or even shoes. To handle this limitation, we propose a novel Salience-guided Cascaded Suppression Network (SCSN) which enables the model to mine diverse salient features and integrate these features into the final representation by a cascaded manner. Our work makes the following contributions: (i) We observe that the previously learned salient features may hinder the network from learning other important information. To tackle this limitation, we introduce a cascaded suppression strategy, which enables the network to mine diverse potential useful features that be masked by the other salient features stage-by-stage and each stage integrates different feature embedding for the last discriminative pedestrian representation. (ii) We propose a Salient Feature Extraction (SFE) unit, which can suppress the salient features learned in the previous cascaded stage and then adaptively extracts other potential salient feature to obtain different clues of pedestrians. (iii) We develop an efficient feature aggregation strategy that fully increases the network's capacity for all potential salience features. Finally, experimental results demonstrate that our proposed method outperforms the state-of-the-art methods on four large-scale datasets. Especially, our approach exceeds the current best method by over 7% on the CUHK03 dataset.
dc.language	en
dc.publisher	IEEE
dc.relation.ispartof	2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
dc.relation.ispartof	2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition
dc.relation.isbasedon	10.1109/cvpr42600.2020.00336
dc.rights	info:eu-repo/semantics/closedAccess
dc.title	Salience-Guided Cascaded Suppression Network for Person Re-Identification
dc.type	Conference Proceeding
utslib.citation.volume	00
utslib.location.activity	Seattle, WA, USA
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Strength - AAII - Australian Artificial Intelligence Institute
utslib.copyright.status	closed_access	*
dc.date.updated	2021-05-27T07:35:17Z
pubs.finish-date	2020-06-19
pubs.publication-status	Published
pubs.start-date	2020-06-13
pubs.volume	00

Abstract:

Employing attention mechanisms to model both global and local features as a final pedestrian representation has become a trend for person re-identification (Re-ID) algorithms. A potential limitation of these methods is that they focus on the most salient features, but the re-identification of a person may rely on diverse clues masked by the most salient features in different situations, e.g., body, clothes or even shoes. To handle this limitation, we propose a novel Salience-guided Cascaded Suppression Network (SCSN) which enables the model to mine diverse salient features and integrate these features into the final representation by a cascaded manner. Our work makes the following contributions: (i) We observe that the previously learned salient features may hinder the network from learning other important information. To tackle this limitation, we introduce a cascaded suppression strategy, which enables the network to mine diverse potential useful features that be masked by the other salient features stage-by-stage and each stage integrates different feature embedding for the last discriminative pedestrian representation. (ii) We propose a Salient Feature Extraction (SFE) unit, which can suppress the salient features learned in the previous cascaded stage and then adaptively extracts other potential salient feature to obtain different clues of pedestrians. (iii) We develop an efficient feature aggregation strategy that fully increases the network's capacity for all potential salience features. Finally, experimental results demonstrate that our proposed method outperforms the state-of-the-art methods on four large-scale datasets. Especially, our approach exceeds the current best method by over 7% on the CUHK03 dataset.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/149272