Sparse Transfer Learning for Interactive Video Search Reranking

Tian, X; Tao, D; Rui, Y

Sparse Transfer Learning for Interactive Video Search Reranking

Tian, X Tao, D

Rui, Y

Permalink

Publication Type:: Journal Article
Citation:: ACM Transactions on Multimedia Computing, Communications and Applications, 2012, 8 (3), pp. 1 - 19
Issue Date:: 2012-07-01

Closed Access

	Filename	Description	Size
	2012001417OK.pdf		8.92 MB	Adobe PDF	View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Tian, X	en_US
dc.contributor.author	Tao, D https://orcid.org/0000-0001-7225-5449	en_US
dc.contributor.author	Rui, Y	en_US
dc.date.issued	2012-07-01	en_US
dc.identifier.citation	ACM Transactions on Multimedia Computing, Communications and Applications, 2012, 8 (3), pp. 1 - 19	en_US
dc.identifier.issn	1551-6857	en_US
dc.identifier.uri	http://hdl.handle.net/10453/22851
dc.description.abstract	Visual reranking is effective to improve the performance of the text-based video search. However, existing reranking algorithms can only achieve limited improvement because of the well-known semantic gap between low-level visual features and highlevel semantic concepts. In this article, we adopt interactive video search reranking to bridge the semantic gap by introducing user's labeling effort. We propose a novel dimension reduction tool, termed sparse transfer learning (STL), to effectively and efficiently encode user's labeling information. STL is particularly designed for interactive video search reranking. Technically, it (a) considers the pair-wise discriminative information to maximally separate labeled query relevant samples from labeled query irrelevant ones, (b) achieves a sparse representation for the subspace to encodes user's intention by applying the elastic net penalty, and (c) propagates user's labeling information from labeled samples to unlabeled samples by using the data distribution knowledge. We conducted extensive experiments on the TRECVID 2005, 2006 and 2007 benchmark datasets and compared STL with popular dimension reduction algorithms. We report superior performance by using the proposed STL-based interactive video search reranking. © 2012, ACM. All rights reserved.	en_US
dc.relation	http://purl.org/au-research/grants/arc/DP120103730
dc.relation.ispartof	ACM Transactions on Multimedia Computing, Communications and Applications	en_US
dc.relation.isbasedon	10.1145/2240136.2240139	en_US
dc.subject.classification	Artificial Intelligence & Image Processing	en_US
dc.title	Sparse Transfer Learning for Interactive Video Search Reranking	en_US
dc.type	Journal Article
utslib.citation.volume	3	en_US
utslib.citation.volume	8	en_US
utslib.for	0801 Artificial Intelligence and Image Processing	en_US
utslib.for	0803 Computer Software	en_US
utslib.for	0805 Distributed Computing	en_US
utslib.for	0806 Information Systems	en_US
dc.location.activity	Nagoya, Japan
pubs.embargo.period	Not known	en_US
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
utslib.copyright.status	closed_access
pubs.issue	3	en_US
pubs.publication-status	Published	en_US
pubs.volume	8	en_US

Abstract:

Visual reranking is effective to improve the performance of the text-based video search. However, existing reranking algorithms can only achieve limited improvement because of the well-known semantic gap between low-level visual features and highlevel semantic concepts. In this article, we adopt interactive video search reranking to bridge the semantic gap by introducing user's labeling effort. We propose a novel dimension reduction tool, termed sparse transfer learning (STL), to effectively and efficiently encode user's labeling information. STL is particularly designed for interactive video search reranking. Technically, it (a) considers the pair-wise discriminative information to maximally separate labeled query relevant samples from labeled query irrelevant ones, (b) achieves a sparse representation for the subspace to encodes user's intention by applying the elastic net penalty, and (c) propagates user's labeling information from labeled samples to unlabeled samples by using the data distribution knowledge. We conducted extensive experiments on the TRECVID 2005, 2006 and 2007 benchmark datasets and compared STL with popular dimension reduction algorithms. We report superior performance by using the proposed STL-based interactive video search reranking. © 2012, ACM. All rights reserved.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/22851