Elastic gradient boosting decision tree with adaptive iterations for concept drift adaptation

Wang, K; Lu, J; Liu, A; Song, Y; Xiong, L; Zhang, G

Elastic gradient boosting decision tree with adaptive iterations for concept drift adaptation

Wang, K Lu, J

Liu, A

Song, Y

Xiong, L Zhang, G

Permalink

Publisher:: Elsevier
Publication Type:: Journal Article
Citation:: Neurocomputing, 2022, 491, pp. 288-304
Issue Date:: 2022-06-28

Open Access

Copyright Clearance Process

Recently Added
In Progress
Open Access

This item is open access.

The embargo period expires on 1 Jun 2024

Adobe PDF

Download Accepted versionAdobe PDF (1.03 MB)

View on publisher's site

View statistics

Full metadata record

Field	Value	Language
dc.contributor.author	Wang, K
dc.contributor.author	Lu, J https://orcid.org/0000-0003-0690-4732
dc.contributor.author	Liu, A https://orcid.org/0000-0002-0733-7138
dc.contributor.author	Song, Y https://orcid.org/0000-0002-6633-2695
dc.contributor.author	Xiong, L
dc.contributor.author	Zhang, G https://orcid.org/0000-0003-3960-0583
dc.date.accessioned	2023-03-20T05:52:31Z
dc.date.available	2023-03-20T05:52:31Z
dc.date.issued	2022-06-28
dc.identifier.citation	Neurocomputing, 2022, 491, pp. 288-304
dc.identifier.issn	0925-2312
dc.identifier.issn	1872-8286
dc.identifier.uri	http://hdl.handle.net/10453/167777
dc.description.abstract	As an excellent ensemble algorithm, Gradient Boosting Decision Tree (GBDT) has been tested extensively with static data. However, real-world applications often involve dynamic data streams, which suffer from concept drift problems where the data distribution changes overtime. The performance of GBDT model is degraded when applied to predict data streams with concept drift. Although incremental learning can help to alleviate such degrading, finding a perfect learning rate (i.e., the iteration in GBDT) that suits all time periods with all their different drift severity levels can be difficult. In this paper, we convert the issue of determining an optimal learning rate into the issue of choosing the best adaptive iterations when tuning GBDT. We theoretically prove that drift severity is closely related to the convergence rate of model. Accordingly, we propose a novel drift adaptation method, called adaptive iterations (AdIter), that automatically chooses the number of iterations for different drift severities to improve the prediction accuracy for data streams under concept drift. In a series of comprehensive tests with seven state-of-the-art drift adaptation methods on both synthetic and real-world data, AdIter yielded superior accuracy levels.
dc.language	en
dc.publisher	Elsevier
dc.relation	http://purl.org/au-research/grants/arc/DP190101733
dc.relation.ispartof	Neurocomputing
dc.relation.isbasedon	10.1016/j.neucom.2022.03.038
dc.rights	info:eu-repo/semantics/embargoedAccess
dc.subject	08 Information and Computing Sciences, 09 Engineering, 17 Psychology and Cognitive Sciences
dc.subject.classification	Artificial Intelligence & Image Processing
dc.title	Elastic gradient boosting decision tree with adaptive iterations for concept drift adaptation
dc.type	Journal Article
utslib.citation.volume	491
utslib.for	08 Information and Computing Sciences
utslib.for	09 Engineering
utslib.for	17 Psychology and Cognitive Sciences
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Strength - AAII - Australian Artificial Intelligence Institute
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology/School of Computer Science
utslib.copyright.status	open_access	*
utslib.copyright.embargo	2024-06-01T00:00:00+1000Z
dc.date.updated	2023-03-20T05:52:29Z
pubs.publication-status	Published
pubs.volume	491

Abstract:

As an excellent ensemble algorithm, Gradient Boosting Decision Tree (GBDT) has been tested extensively with static data. However, real-world applications often involve dynamic data streams, which suffer from concept drift problems where the data distribution changes overtime. The performance of GBDT model is degraded when applied to predict data streams with concept drift. Although incremental learning can help to alleviate such degrading, finding a perfect learning rate (i.e., the iteration in GBDT) that suits all time periods with all their different drift severity levels can be difficult. In this paper, we convert the issue of determining an optimal learning rate into the issue of choosing the best adaptive iterations when tuning GBDT. We theoretically prove that drift severity is closely related to the convergence rate of model. Accordingly, we propose a novel drift adaptation method, called adaptive iterations (AdIter), that automatically chooses the number of iterations for different drift severities to improve the prediction accuracy for data streams under concept drift. In a series of comprehensive tests with seven state-of-the-art drift adaptation methods on both synthetic and real-world data, AdIter yielded superior accuracy levels.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/167777