A hybrid domain enhanced framework for video retargeting with spatial-temporal importance and 3D grid optimization

Publication Type:
Journal Article
Signal Processing, 2014, 94 (1), pp. 33 - 47
Issue Date:
Filename Description Size
Thumbnaila hybrid domain enhanced framework for video retargeting.pdfPublished Version9.85 MB
Adobe PDF
Full metadata record
Recently, a ubiquitous video access is highly demanded for online video applications. One big challenge is that video service needs to adapt different device capabilities. Pervasive multimedia devices require an accurate and user comfort video retargeting. Letting users see their preferred content accurately directly affects their comforts. User preferences on video contents are different in various video domains. In this paper, we present a hybrid framework of video retargeting with a domain enhanced spatial-temporal grid optimization. First, we parse videos from low-level features to high-level visual concepts, combining with visual attention for an accurate importance description. Second, a semantic importance map is built up representing the spatial importance and temporal continuity, which is incorporated with a 3D rectilinear grid scaleplate to map frames to a target display, thereby keeping the aspect ratio of semantically salient objects as well as the perceptual coherency. Extensive evaluations are made on five typical video genres, i.e. sports, advertisements, lecture, news and surveillance. The comparison with the state-of-the-art approaches on both images and videos have demonstrated the advantages of the proposed approach. © 2013 Elsevier B.V.
Please use this identifier to cite or link to this item: