Robust ensemble learning for mining noisy data streams

Zhang, P; Zhu, X; Shi, Y; Guo, L; Wu, X

Robust ensemble learning for mining noisy data streams

Zhang, P

Zhu, X Shi, Y Guo, L Wu, X

Permalink

Publication Type:: Journal Article
Citation:: Decision Support Systems, 2011, 50 (2), pp. 469 - 479
Issue Date:: 2011-01-01

Closed Access

	Filename	Description	Size
	2011000602OK.pdf		777.01 kB	Adobe PDF	View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Zhang, P https://orcid.org/0000-0001-7973-2746	en_US
dc.contributor.author	Zhu, X	en_US
dc.contributor.author	Shi, Y	en_US
dc.contributor.author	Guo, L	en_US
dc.contributor.author	Wu, X	en_US
dc.date.issued	2011-01-01	en_US
dc.identifier.citation	Decision Support Systems, 2011, 50 (2), pp. 469 - 479	en_US
dc.identifier.issn	0167-9236	en_US
dc.identifier.uri	http://hdl.handle.net/10453/18236
dc.description.abstract	In this paper, we study the problem of learning from concept drifting data streams with noise, where samples in a data stream may be mislabeled or contain erroneous values. Our essential goal is to build a robust prediction model from noisy stream data to accurately predict future samples. For noisy data sources, most existing works rely on data preprocessing techniques to cleanse noisy samples before the training of decision models. In data stream environments, these data preprocessing techniques are, unfortunately, hard to apply, mainly because the concept drifting in a data stream may make it very difficult to differentiate noise from samples of changing concepts. Accordingly, we propose an aggregate ensemble (AE) learning framework. The aim of AE is to build a robust ensemble model that can tolerate data errors. Theoretical and empirical studies on both synthetic and real-world data streams demonstrate that the proposed AE learning framework is capable of building accurate classification models from noisy data streams. © 2010 Elsevier B.V. All rights reserved.	en_US
dc.relation	http://purl.org/au-research/grants/arc/DP1093762
dc.relation.ispartof	Decision Support Systems	en_US
dc.relation.isbasedon	10.1016/j.dss.2010.11.004	en_US
dc.subject.classification	Information Systems	en_US
dc.title	Robust ensemble learning for mining noisy data streams	en_US
dc.type	Journal Article
utslib.citation.volume	2	en_US
utslib.citation.volume	50	en_US
utslib.for	0801 Artificial Intelligence and Image Processing	en_US
utslib.for	01 Mathematical Sciences	en_US
utslib.for	08 Information and Computing Sciences	en_US
utslib.for	15 Commerce, Management, Tourism and Services	en_US
pubs.embargo.period	Not known	en_US
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Strength - AAI - Advanced Analytics Institute Research Centre
utslib.copyright.status	closed_access
pubs.issue	2	en_US
pubs.publication-status	Published	en_US
pubs.volume	50	en_US

Abstract:

In this paper, we study the problem of learning from concept drifting data streams with noise, where samples in a data stream may be mislabeled or contain erroneous values. Our essential goal is to build a robust prediction model from noisy stream data to accurately predict future samples. For noisy data sources, most existing works rely on data preprocessing techniques to cleanse noisy samples before the training of decision models. In data stream environments, these data preprocessing techniques are, unfortunately, hard to apply, mainly because the concept drifting in a data stream may make it very difficult to differentiate noise from samples of changing concepts. Accordingly, we propose an aggregate ensemble (AE) learning framework. The aim of AE is to build a robust ensemble model that can tolerate data errors. Theoretical and empirical studies on both synthetic and real-world data streams demonstrate that the proposed AE learning framework is capable of building accurate classification models from noisy data streams. © 2010 Elsevier B.V. All rights reserved.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/18236