Detecting group concept drift from multiple data streams

Publisher:
Elsevier
Publication Type:
Journal Article
Citation:
Pattern Recognition, 2023, 134, pp. 109113
Issue Date:
2023-02-01
Filename Description Size
Detecting group concept drift from multiple data streams.pdfPublished version1.46 MB
Adobe PDF
Full metadata record
Concept drift may lead to a sharp downturn in the performance of streaming in data-based algorithms, caused by unforeseeable changes in the underlying distribution of data. In this paper, we are mainly concerned with concept drift across multiple data streams, and in situations where the drift of each data stream cannot be detected in time, due to slight underlying distribution drifts. We call this group concept drift. When compared to the detection of concept drift for a single data stream, the challenges of detecting group concept drift arise from three aspects: first, the training data become more complex; second, the underlying distribution becomes more complex; and third, the correlations between data streams become more complex. To address these challenges, the key idea of our method is to construct a distribution free test statistic, free from any underlying distribution in multiple data streams. Then, for streaming data, we design an online learning algorithm to obtain this test statistic, thereby determining the concept drift caused by the hypothesis test. The experiment evaluations with both synthetic and real-world datasets prove that our method can accurately detect concept drift from multiple data streams.
Please use this identifier to cite or link to this item: