A hybrid domain enhanced framework for video retargeting with spatial-temporal importance and 3D grid optimization

Wang, J; Xu, M; He, X; Lu, H; Hoang, D

A hybrid domain enhanced framework for video retargeting with spatial-temporal importance and 3D grid optimization

Wang, J Xu, M

He, X

Lu, H Hoang, D

Permalink

Publication Type:: Journal Article
Citation:: Signal Processing, 2014, 94 (1), pp. 33 - 47
Issue Date:: 2014-01-01

Closed Access

	Filename	Description	Size
	a hybrid domain enhanced framework for video retargeting.pdf	Published Version	9.85 MB	Adobe PDF	View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Wang, J	en_US
dc.contributor.author	Xu, M https://orcid.org/0000-0001-9581-8849	en_US
dc.contributor.author	He, X https://orcid.org/0000-0001-8962-540X	en_US
dc.contributor.author	Lu, H	en_US
dc.contributor.author	Hoang, D https://orcid.org/0000-0003-1798-4926	en_US
dc.date.issued	2014-01-01	en_US
dc.identifier.citation	Signal Processing, 2014, 94 (1), pp. 33 - 47	en_US
dc.identifier.issn	0165-1684	en_US
dc.identifier.uri	http://hdl.handle.net/10453/34105
dc.description.abstract	Recently, a ubiquitous video access is highly demanded for online video applications. One big challenge is that video service needs to adapt different device capabilities. Pervasive multimedia devices require an accurate and user comfort video retargeting. Letting users see their preferred content accurately directly affects their comforts. User preferences on video contents are different in various video domains. In this paper, we present a hybrid framework of video retargeting with a domain enhanced spatial-temporal grid optimization. First, we parse videos from low-level features to high-level visual concepts, combining with visual attention for an accurate importance description. Second, a semantic importance map is built up representing the spatial importance and temporal continuity, which is incorporated with a 3D rectilinear grid scaleplate to map frames to a target display, thereby keeping the aspect ratio of semantically salient objects as well as the perceptual coherency. Extensive evaluations are made on five typical video genres, i.e. sports, advertisements, lecture, news and surveillance. The comparison with the state-of-the-art approaches on both images and videos have demonstrated the advantages of the proposed approach. © 2013 Elsevier B.V.	en_US
dc.relation.ispartof	Signal Processing	en_US
dc.relation.isbasedon	10.1016/j.sigpro.2013.06.007	en_US
dc.subject.classification	Networking & Telecommunications	en_US
dc.title	A hybrid domain enhanced framework for video retargeting with spatial-temporal importance and 3D grid optimization	en_US
dc.type	Journal Article
utslib.citation.volume	1	en_US
utslib.citation.volume	94	en_US
utslib.for	0801 Artificial Intelligence and Image Processing	en_US
utslib.for	08 Information and Computing Sciences	en_US
utslib.for	09 Engineering	en_US
utslib.for	10 Technology	en_US
pubs.embargo.period	Not known	en_US
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology/School of Electrical and Data Engineering
pubs.organisational-group	/University of Technology Sydney/Strength - CRIN - Realtime Information Networks
pubs.organisational-group	/University of Technology Sydney/Strength - GBDTC - Global Big Data Technologies
pubs.organisational-group	/University of Technology Sydney/Strength - INEXT - Innovation in IT Services and Applications
utslib.copyright.status	closed_access
pubs.issue	1	en_US
pubs.publication-status	Published	en_US
pubs.volume	94	en_US

Abstract:

Recently, a ubiquitous video access is highly demanded for online video applications. One big challenge is that video service needs to adapt different device capabilities. Pervasive multimedia devices require an accurate and user comfort video retargeting. Letting users see their preferred content accurately directly affects their comforts. User preferences on video contents are different in various video domains. In this paper, we present a hybrid framework of video retargeting with a domain enhanced spatial-temporal grid optimization. First, we parse videos from low-level features to high-level visual concepts, combining with visual attention for an accurate importance description. Second, a semantic importance map is built up representing the spatial importance and temporal continuity, which is incorporated with a 3D rectilinear grid scaleplate to map frames to a target display, thereby keeping the aspect ratio of semantically salient objects as well as the perceptual coherency. Extensive evaluations are made on five typical video genres, i.e. sports, advertisements, lecture, news and surveillance. The comparison with the state-of-the-art approaches on both images and videos have demonstrated the advantages of the proposed approach. © 2013 Elsevier B.V.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/34105