A framework for application-driven classification of data streams

Zhang, P; Gao, BJ; Liu, P; Shi, Y; Guo, L

A framework for application-driven classification of data streams

Zhang, P

Gao, BJ Liu, P Shi, Y Guo, L

Permalink

Publisher:: Elsevier
Publication Type:: Journal Article
Citation:: Neurocomputing, 2012, 92 (1), pp. 170 - 182
Issue Date:: 2012-01

Closed Access

	Filename	Description	Size
	2013005143OK.pdf		882.07 kB		View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Zhang, P https://orcid.org/0000-0001-7973-2746	en_US
dc.contributor.author	Gao, BJ	en_US
dc.contributor.author	Liu, P	en_US
dc.contributor.author	Shi, Y	en_US
dc.contributor.author	Guo, L	en_US
dc.date.issued	2012-01	en_US
dc.identifier.citation	Neurocomputing, 2012, 92 (1), pp. 170 - 182	en_US
dc.identifier.issn	0925-2312	en_US
dc.identifier.uri	http://hdl.handle.net/10453/28900
dc.description.abstract	Data stream classification has drawn increasing attention from the data mining community in recent years. Relevant applications include network traffic monitoring, sensor network data analysis, Web click stream mining, power consumption measurement, dynamic tracing of stock fluctuations, to name a few. Data stream classification in such real-world applications is typically subject to three major challenges: concept drifting, large volumes, and partial labeling. As a result, training examples in data streams can be very diverse and it is very hard to learn accurate models with efficiency. In this paper, we propose a novel framework that first categorizes diverse training examples into four types and assign learning priorities to them. Then, we derive four learning cases based on the proportion and priority of the different types of training examples. Finally, for each learning case, we employ one of the four SVM-based training models: classical SVM, semi-supervised SVM, transfer semi-supervised SVM, and relational k-means transfer semi-supervised SVM. We perform comprehensive experiments on real-world data streams that validate the utility of our approach	en_US
dc.publisher	Elsevier	en_US
dc.relation.ispartof	Neurocomputing	en_US
dc.relation.isbasedon	10.1016/j.neucom.2011.11.026	en_US
dc.subject.classification	Artificial Intelligence & Image Processing	en_US
dc.title	A framework for application-driven classification of data streams	en_US
dc.type	Journal Article
utslib.citation.volume	1	en_US
utslib.citation.volume	92	en_US
utslib.for	0801 Artificial Intelligence and Image Processing	en_US
utslib.for	08 Information and Computing Sciences	en_US
utslib.for	09 Engineering	en_US
utslib.for	17 Psychology and Cognitive Sciences	en_US
pubs.embargo.period	Not known	en_US
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
utslib.copyright.status	closed_access
pubs.consider-herdc	false	en_US
pubs.issue	1	en_US
pubs.volume	92	en_US

Abstract:

Data stream classification has drawn increasing attention from the data mining community in recent years. Relevant applications include network traffic monitoring, sensor network data analysis, Web click stream mining, power consumption measurement, dynamic tracing of stock fluctuations, to name a few. Data stream classification in such real-world applications is typically subject to three major challenges: concept drifting, large volumes, and partial labeling. As a result, training examples in data streams can be very diverse and it is very hard to learn accurate models with efficiency. In this paper, we propose a novel framework that first categorizes diverse training examples into four types and assign learning priorities to them. Then, we derive four learning cases based on the proportion and priority of the different types of training examples. Finally, for each learning case, we employ one of the four SVM-based training models: classical SVM, semi-supervised SVM, transfer semi-supervised SVM, and relational k-means transfer semi-supervised SVM. We perform comprehensive experiments on real-world data streams that validate the utility of our approach

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/28900