Discriminative optical flow tensor for video semantic analysis

Publication Type:
Journal Article
Citation:
Computer Vision and Image Understanding, 2009, 113 (3), pp. 372 - 383
Issue Date:
2009-03-01
Filename Description Size
Thumbnail2011000245OK.pdf1.27 MB
Adobe PDF
Full metadata record
This paper presents a novel framework for effective video semantic analysis. This framework has two major components, namely, optical flow tensor (OFT) and hidden Markov models (HMMs). OFT and HMMs are employed because: (1) motion is one of the fundamental characteristics reflecting the semantic information in video, so an OFT-based feature extraction method is developed to make full use of the motion information. Thereafter, to preserve the structure and discriminative information presented by OFT, general tensor discriminant analysis (GTDA) is used for dimensionality reduction. Finally, linear discriminant analysis (LDA) is utilized to further reduce the feature dimension for discriminative motion information representation; and (2) video is a sort of information intensive sequential media characterized by its context-sensitive nature, so the video sequences can be more effectively analyzed by some temporal modeling tools. In this framework, we use HMMs to well model different levels of semantic units (SU), e.g., shot and event. Experimental results are reported to demonstrate the advantages of the proposed framework upon semantic analysis of basketball video sequences, and the cross validations illustrate its feasibility and effectiveness. © 2008 Elsevier Inc. All rights reserved.
Please use this identifier to cite or link to this item: