Piece-wise linearity based method for text frame classification in video

Sharma, N; Shivakumara, P; Pal, U; Blumenstein, M; Tan, CL

Piece-wise linearity based method for text frame classification in video

Sharma, N

Shivakumara, P Pal, U Blumenstein, M

Tan, CL

Permalink

Publication Type:: Journal Article
Citation:: Pattern Recognition, 2015, 48 (3), pp. 862 - 881
Issue Date:: 2015-01-01

Closed Access

	Filename	Description	Size
	p.pdf	Published Version	1.91 MB	Adobe PDF	View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Sharma, N https://orcid.org/0000-0003-0841-1245	en_US
dc.contributor.author	Shivakumara, P	en_US
dc.contributor.author	Pal, U	en_US
dc.contributor.author	Blumenstein, M https://orcid.org/0000-0002-9908-3744	en_US
dc.contributor.author	Tan, CL	en_US
dc.date.issued	2015-01-01	en_US
dc.identifier.citation	Pattern Recognition, 2015, 48 (3), pp. 862 - 881	en_US
dc.identifier.issn	0031-3203	en_US
dc.identifier.uri	http://hdl.handle.net/10453/121765
dc.description.abstract	© 2014 Elsevier Ltd. All rights reserved. The aim of text frame classification technique is to label a video frame as text or non-text before text detection and recognition. It is an essential step prior to text detection because text detection methods assume the input to be a text frame. Consequently, when a non-text frame is subjected to text detection, the precision of the text detection method decreases because of false positives. In this paper a new text frame classification approach based on component linearity is proposed. The method firstly obtains probable text clusters from the gradient values of the RGB images of an input video frame. The Sobel edges corresponding to the text cluster are then extracted and used for further processing. Next, the method proposes to eliminate false text components before undertaking a linearity check where the linearity of the text components is determined using their centroids in a piece-wise manner. If the components in a frame satisfy the defined linearity condition, then the frame is considered as a text frame; otherwise it is considered as a non-text frame. The proposed method has been tested on standard text and non-text datasets of different orientations to demonstrate that it is independent of orientation. A comparative study with the existing method shows that the proposed method is superior in terms of classification rate and processing time.	en_US
dc.relation.ispartof	Pattern Recognition	en_US
dc.relation.isbasedon	10.1016/j.patcog.2014.09.012	en_US
dc.subject.classification	Artificial Intelligence & Image Processing	en_US
dc.title	Piece-wise linearity based method for text frame classification in video	en_US
dc.type	Journal Article
utslib.citation.volume	3	en_US
utslib.citation.volume	48	en_US
utslib.for	0801 Artificial Intelligence and Image Processing	en_US
utslib.for	0806 Information Systems	en_US
utslib.for	0906 Electrical and Electronic Engineering	en_US
pubs.embargo.period	Not known	en_US
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology/School of Computer Science
pubs.organisational-group	/University of Technology Sydney/Strength - CAI - Centre for Artificial Intelligence
pubs.organisational-group	/University of Technology Sydney/Strength - QSI - Centre for Quantum Software and Information
utslib.copyright.status	closed_access
pubs.issue	3	en_US
pubs.publication-status	Published	en_US
pubs.volume	48	en_US

Abstract:

© 2014 Elsevier Ltd. All rights reserved. The aim of text frame classification technique is to label a video frame as text or non-text before text detection and recognition. It is an essential step prior to text detection because text detection methods assume the input to be a text frame. Consequently, when a non-text frame is subjected to text detection, the precision of the text detection method decreases because of false positives. In this paper a new text frame classification approach based on component linearity is proposed. The method firstly obtains probable text clusters from the gradient values of the RGB images of an input video frame. The Sobel edges corresponding to the text cluster are then extracted and used for further processing. Next, the method proposes to eliminate false text components before undertaking a linearity check where the linearity of the text components is determined using their centroids in a piece-wise manner. If the components in a frame satisfy the defined linearity condition, then the frame is considered as a text frame; otherwise it is considered as a non-text frame. The proposed method has been tested on standard text and non-text datasets of different orientations to demonstrate that it is independent of orientation. A comparative study with the existing method shows that the proposed method is superior in terms of classification rate and processing time.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/121765