Understanding Data Drift and Concept Drift in Machine Learning Systems

Authors

  • Sandeep Bharadwaj Mannapur Jawaharlal Nehru Technological University, Hyderabad, India Author

DOI:

https://doi.org/10.32628/CSEIT25111239

Keywords:

Machine Learning Drift Detection, Concept Drift Analysis, Model Performance Degradation, Real-time Monitoring Systems, Adaptive Model Maintenance

Abstract

This comprehensive article examines the critical challenges of data drift and concept drift in machine learning systems deployed across various industries. The article explores how these phenomena affect model performance in production environments, with a particular focus on healthcare, manufacturing, and autonomous systems. The article analyzes different types of drift, including covariate shifts and prior probability shifts, while exploring their manifestations and impacts. Through findings of real-world implementations, the article presents advanced detection methodologies and mitigation strategies, ranging from statistical approaches to sophisticated monitoring frameworks. The investigation extends to emerging technologies in sustainable manufacturing and edge computing environments, offering insights into future developments in drift management. The findings emphasize the importance of proactive drift detection and adaptive model maintenance for ensuring continued system reliability and performance.

Downloads

Download data is not yet available.

References

Ali Kore, Elyar Abbasi Bavil, et al., "Empirical data drift detection experiments on real-world medical imaging data," PMC Clinical Informatics, vol. 15, no. 3, pp. 245-267, 2024. [Online]. Available: https://pmc.ncbi.nlm.nih.gov/articles/PMC10904813/

Surya Gangadhar Patchipala, et al., "Corresponding author: Surya Gangadhar Patchipala Tackling data and model drift in AI: Strategies for maintaining accuracy during ML model inference," International Journal of Science and Research Archive, 2024. [Online]. Available: https://www.researchgate.net/publication/386282249

Amy B.Z. Zhang, et al., "Quantifying Exploration Preference for E-Commerce Recommendation," CEUR Workshop Proceedings, vol. 3549, pp. 45-56, 2023. [Online]. Available: https://ceur-ws.org/Vol-3549/paper5.pdf

Rashmi Benni, Shashikumar Totad, "Impact analysis of real and virtual concept drifts on the predictive performance of classifiers," Procedia Computer Science, Volume 235, 2024. [Online]. Available: https://www.sciencedirect.com/science/article/pii/S1877050924007373

Jan Zenisek, et al., "Machine learning based concept drift detection for predictive maintenance," Computers & Industrial Engineering, Volume 137, November 2019, 106031. [Online]. Available: https://www.sciencedirect.com/science/article/abs/pii/S0360835219304905

Supriya Agrahari, et al., "Concept Drift Detection in Data Stream Mining : A literature review," Journal of King Saud University - Computer and Information Sciences, Volume 34, Issue 10, Part B, November 2022, Pages 9523-9540. [Online]. Available: https://www.sciencedirect.com/science/article/pii/S1319157821003062

Abdul Razak M. S, et al., "A survey on detecting healthcare concept drift in AI/ML models from a finance perspective," Frontiers in Artificial Intelligence, vol. 5, pp. 955314, 2022. [Online]. Available: https://www.frontiersin.org/journals/artificial-intelligence/articles/10.3389/frai.2022.955314/full

Hanli Qiao, Boris Novikov, et al., "Concept Drift Analysis by Dynamic Residual Projection for Effectively Detecting Botnet Cyber-Attacks in IoT Scenarios," IEEE Transactions on Industrial Informatics ( Volume: 18, Issue: 6, June 2022). [Online]. Available: https://ieeexplore.ieee.org/abstract/document/9525207

Maria Casimiro, Paolo Romano, et al., "Towards a Framework for Adapting Machine Learning Components," IEEE International Conference on Autonomic Computing and Self-Organizing Systems (ACSOS), 2022. [Online]. Available: https://ieeexplore.ieee.org/abstract/document/9935009

S. Surendran, "Numerical simulation of ship stability for dynamic environment," Ocean Engineering Volume 30, Issue 10, July 2003, Pages 1305-1317, 2021. [Online]. Available: https://www.sciencedirect.com/science/article/abs/pii/S0029801802001099

Javier Jose Diaz Rivera, et al., "An ML Based Anomaly Detection System in real-time data streams," 2021 International Conference on Computational Science and Computational Intelligence (CSCI), 2022. [Online]. Available: https://ieeexplore.ieee.org/document/9799063

Engin Zeydan, et al., "Managing Distributed Machine Learning Lifecycle for Healthcare Data in the Cloud," IEEE Transactions on Industrial Informatics, vol. 19, no. 12, pp. 15213-15227, 2023. [Online]. Available: https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=10636156

Deyslen Mariano-Hernández, et al., "Analysis of the Integration of Drift Detection Methods in Learning Algorithms for Electrical Consumption Forecasting in Smart Buildings," Sustainability, vol. 14, no. 10, pp. 5857, 2022. [Online]. Available: https://www.mdpi.com/2071-1050/14/10/5857

Downloads

Published

08-01-2025

Issue

Section

Research Articles