Discriminative optical flow tensor for video semantic analysis

Gao, X; Yang, Y; Tao, D; Li, X

Discriminative optical flow tensor for video semantic analysis

Gao, X Yang, Y Tao, D

Li, X

Permalink

Publication Type:: Journal Article
Citation:: Computer Vision and Image Understanding, 2009, 113 (3), pp. 372 - 383
Issue Date:: 2009-03-01

Closed Access

	Filename	Description	Size
	2011000245OK.pdf		1.27 MB		View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Gao, X	en_US
dc.contributor.author	Yang, Y	en_US
dc.contributor.author	Tao, D https://orcid.org/0000-0001-7225-5449	en_US
dc.contributor.author	Li, X	en_US
dc.date.issued	2009-03-01	en_US
dc.identifier.citation	Computer Vision and Image Understanding, 2009, 113 (3), pp. 372 - 383	en_US
dc.identifier.issn	1077-3142	en_US
dc.identifier.uri	http://hdl.handle.net/10453/15127
dc.description.abstract	This paper presents a novel framework for effective video semantic analysis. This framework has two major components, namely, optical flow tensor (OFT) and hidden Markov models (HMMs). OFT and HMMs are employed because: (1) motion is one of the fundamental characteristics reflecting the semantic information in video, so an OFT-based feature extraction method is developed to make full use of the motion information. Thereafter, to preserve the structure and discriminative information presented by OFT, general tensor discriminant analysis (GTDA) is used for dimensionality reduction. Finally, linear discriminant analysis (LDA) is utilized to further reduce the feature dimension for discriminative motion information representation; and (2) video is a sort of information intensive sequential media characterized by its context-sensitive nature, so the video sequences can be more effectively analyzed by some temporal modeling tools. In this framework, we use HMMs to well model different levels of semantic units (SU), e.g., shot and event. Experimental results are reported to demonstrate the advantages of the proposed framework upon semantic analysis of basketball video sequences, and the cross validations illustrate its feasibility and effectiveness. © 2008 Elsevier Inc. All rights reserved.	en_US
dc.relation.ispartof	Computer Vision and Image Understanding	en_US
dc.relation.isbasedon	10.1016/j.cviu.2008.08.007	en_US
dc.subject.classification	Artificial Intelligence & Image Processing	en_US
dc.title	Discriminative optical flow tensor for video semantic analysis	en_US
dc.type	Journal Article
utslib.citation.volume	3	en_US
utslib.citation.volume	113	en_US
utslib.for	0801 Artificial Intelligence and Image Processing	en_US
utslib.for	1702 Cognitive Sciences	en_US
pubs.embargo.period	Not known	en_US
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
utslib.copyright.status	closed_access
pubs.issue	3	en_US
pubs.publication-status	Published	en_US
pubs.volume	113	en_US

Abstract:

This paper presents a novel framework for effective video semantic analysis. This framework has two major components, namely, optical flow tensor (OFT) and hidden Markov models (HMMs). OFT and HMMs are employed because: (1) motion is one of the fundamental characteristics reflecting the semantic information in video, so an OFT-based feature extraction method is developed to make full use of the motion information. Thereafter, to preserve the structure and discriminative information presented by OFT, general tensor discriminant analysis (GTDA) is used for dimensionality reduction. Finally, linear discriminant analysis (LDA) is utilized to further reduce the feature dimension for discriminative motion information representation; and (2) video is a sort of information intensive sequential media characterized by its context-sensitive nature, so the video sequences can be more effectively analyzed by some temporal modeling tools. In this framework, we use HMMs to well model different levels of semantic units (SU), e.g., shot and event. Experimental results are reported to demonstrate the advantages of the proposed framework upon semantic analysis of basketball video sequences, and the cross validations illustrate its feasibility and effectiveness. © 2008 Elsevier Inc. All rights reserved.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/15127