Unified Dictionary Learning and Region Tagging with Hierarchical Sparse Representation

Cao, X; Wei, X; Han, Y; Yang, Y; Sebe, N; Hauptmann, A

Unified Dictionary Learning and Region Tagging with Hierarchical Sparse Representation

Cao, X Wei, X Han, Y Yang, Y

Sebe, N Hauptmann, A

Permalink

Publication Type:: Journal Article
Citation:: Computer Vision and Image Understanding, 2013, 117 (8), pp. 934 - 946
Issue Date:: 2013-05-06

Closed Access

	Filename	Description	Size
	1-s2.0-S1077314213000623-main.pdf	Published Version	3.06 MB	Adobe PDF	View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Cao, X	en_US
dc.contributor.author	Wei, X	en_US
dc.contributor.author	Han, Y	en_US
dc.contributor.author	Yang, Y https://orcid.org/0000-0001-5528-0546	en_US
dc.contributor.author	Sebe, N	en_US
dc.contributor.author	Hauptmann, A	en_US
dc.date.issued	2013-05-06	en_US
dc.identifier.citation	Computer Vision and Image Understanding, 2013, 117 (8), pp. 934 - 946	en_US
dc.identifier.issn	1077-3142	en_US
dc.identifier.uri	http://hdl.handle.net/10453/116354
dc.description.abstract	Image patterns at different spatial levels are well organized, such as regions within one image and feature points within one region. These classes of spatial structures are hierarchical in nature. The appropriate integration and utilization of such relationship are important to improve the performance of region tagging. Inspired by the recent advances of sparse coding methods, we propose an approach, called Unified Dictionary Learning and Region Tagging with Hierarchical Sparse Representation. This approach consists of two steps: region representation and region reconstruction. In the first step, rather than using the ℓ1-norm as it is commonly done in sparse coding, we add a hierarchical structure to the process of sparse coding and form a framework of tree-guided dictionary learning. In this framework, the hierarchical structures among feature points, regions, and images are encoded by forming a tree-guided multi-task learning process. With the learned dictionary, we obtain a better representation of training and testing regions. In the second step, we propose to use a sub-hierarchical structure to guide the sparse reconstruction for testing regions, i.e., the structure between regions and images. Thanks to this hierarchy, the obtained reconstruction coefficients are more discriminate. Finally, tags are propagated to testing regions by the learned reconstruction coefficients. Extensive experiments on three public benchmark image data sets demonstrate that the proposed approach has better performance of region tagging than the current state of the art methods. © 2013 Elsevier Inc. All rights reserved.	en_US
dc.relation.ispartof	Computer Vision and Image Understanding	en_US
dc.relation.isbasedon	10.1016/j.cviu.2013.03.004	en_US
dc.subject.classification	Artificial Intelligence & Image Processing	en_US
dc.title	Unified Dictionary Learning and Region Tagging with Hierarchical Sparse Representation	en_US
dc.type	Journal Article
utslib.citation.volume	8	en_US
utslib.citation.volume	117	en_US
utslib.for	0801 Artificial Intelligence and Image Processing	en_US
utslib.for	1702 Cognitive Sciences	en_US
pubs.embargo.period	Not known	en_US
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Strength - CAI - Centre for Artificial Intelligence
utslib.copyright.status	closed_access
pubs.issue	8	en_US
pubs.publication-status	Published	en_US
pubs.volume	117	en_US

Abstract:

Image patterns at different spatial levels are well organized, such as regions within one image and feature points within one region. These classes of spatial structures are hierarchical in nature. The appropriate integration and utilization of such relationship are important to improve the performance of region tagging. Inspired by the recent advances of sparse coding methods, we propose an approach, called Unified Dictionary Learning and Region Tagging with Hierarchical Sparse Representation. This approach consists of two steps: region representation and region reconstruction. In the first step, rather than using the ℓ1-norm as it is commonly done in sparse coding, we add a hierarchical structure to the process of sparse coding and form a framework of tree-guided dictionary learning. In this framework, the hierarchical structures among feature points, regions, and images are encoded by forming a tree-guided multi-task learning process. With the learned dictionary, we obtain a better representation of training and testing regions. In the second step, we propose to use a sub-hierarchical structure to guide the sparse reconstruction for testing regions, i.e., the structure between regions and images. Thanks to this hierarchy, the obtained reconstruction coefficients are more discriminate. Finally, tags are propagated to testing regions by the learned reconstruction coefficients. Extensive experiments on three public benchmark image data sets demonstrate that the proposed approach has better performance of region tagging than the current state of the art methods. © 2013 Elsevier Inc. All rights reserved.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/116354