Alizadeh Mansouri, Ali (2024) Models and Algorithms for Concept Drift Detection, Adaptation, and Resolution in Streaming Data. PhD thesis, Concordia University.
Preview |
Text (application/pdf)
8MBAlizadehMansouri_PhD_S2025.pdf - Accepted Version Available under License Spectrum Terms of Access. |
Abstract
The evolution of streaming data during long periods of time presents significant challenges for maintaining the accuracy and efficiency of predictive models due to concept drift — where changes in data distribution can lead to performance degradation. In this research, we study the problems of concept drift detection (CDD) and adaptation (CDA). Unlike traditional approaches that treat CDD and CDA independently and in isolation, often under non-streaming, static conditions, we propose a novel methodology based on multivariate vector error-correction analysis of feature importance measures (FIMs). The FIMs provided a solid foundation that allowed us to reformulate concept drift detection and adaptation in streaming data.
We additionally introduce, formalize, and develop the notion of concept drift resolution (CDR) as an innovative model preference technique. This solution further enhances the overall performance by effectively using multiple models undergoing concept drift, including the main learner and the proposed CDA model. The results of our numerous experiments and analyses indicate that the proposed CDD method significantly reduces computation time, particularly in applications experiencing abrupt drifts, while our CDA model delivers notable improvements in prediction accuracy and F1 score on both gradual drift and abrupt drift datasets, outperforming existing methods on varying drift rates and characteristics of concept drift.
By utilizing FIMs as a common basis, we develop a unified framework that integrates CDD, CDA, and CDR tasks, thus bridging the gap between detection and adaptation. Extensive experiments validate the effectiveness of our proposed methods, demonstrating their applicability in various real-world and synthetic benchmark datasets.
This work not only advances the understanding of concept drift in streaming data but also provides a general solution framework that balances performance with interpretability, thus paving the way for development of more reliable and explainable data-driven applications and systems.
Divisions: | Concordia University > Gina Cody School of Engineering and Computer Science > Computer Science and Software Engineering |
---|---|
Item Type: | Thesis (PhD) |
Authors: | Alizadeh Mansouri, Ali |
Institution: | Concordia University |
Degree Name: | Ph. D. |
Program: | Computer Science |
Date: | 7 November 2024 |
Thesis Supervisor(s): | Shiri, Nematollaah |
Keywords: | concept drift, stream processing, data stream, concept drift detection, concept drift adaptation, concept drift resolution |
ID Code: | 994935 |
Deposited By: | Ali Alizadeh Mansouri |
Deposited On: | 17 Jun 2025 13:59 |
Last Modified: | 17 Jun 2025 13:59 |
Repository Staff Only: item control page