A Smart Air Pollution Detector Using Machine Learning
Keywords:
Air Quality Prediction, Machine Learning (ML), Classification, Synthetic Minority Over-sampling Technique (SMOTE), Air Quality Index (AQI)Abstract
The rapid growth of urbanization and industrial activities in cities has resulted in a significant deterioration in air quality, which poses an increasing threat to both public health and the environment. This study focuses on predicting air quality using ML algorithms, aiming to classify air quality into three distinct categories: Good, Satisfactory, and Poor. The dataset utilized for this research comprises key environmental factors such as PM2.5, PM10, nitrogen oxides, and carbon monoxide, which are considered critical indicators of air pollution. To enhance the accuracy of predictions, several ML models were employed, including Logistic Regression, MLP, Random Forest, Decision Tree, The data preprocessing phase involved several essential steps to prepare the dataset for model training. These steps included the handling of missing values, selection of relevant features, and addressing class imbalance through the use of the SMOTE, which was employed to balance the distribution of target labels. The models were then trained and evaluated based on their performance in predicting air quality categories, with accuracy being the primary evaluation metric. Moreover, it can help inform public health decisions by identifying regions with poor air quality and ensuring better management of air pollution levels.
Downloads
References
Aamer, H., Ba-Alawi, A. H., Kang, S., Lee, T., & Jo, Y. M. (2025). Prediction of school PM2.5 by an attention-based deep learning approach informed with data from nearby air quality monitoring stations. Chemosphere, 375, 144241. https://doi.org/10.1016/J.CHEMOSPHERE.2025.144241
Kant, S. (2024). From data to decision-making: utilizing decision tree for air quality monitoring in smart urban areas. International Journal of Information Technology (Singapore), 17(1), 665–672. https://doi.org/10.1007/S41870-024-02208-Y/METRICS
Lee, M. J., & Zhang, R. (2024). Multimodal Data Fusion and Deep Learning for Occupant-Centric Indoor Environmental Quality Classification. Journal of Computing in Civil Engineering, 39(2), 04024061. https://doi.org/10.1061/JCCEE5.CPENG-6249
Njaime, M., Abdallah, F., Snoussi, H., Akl, J., Chaaban, K., & Omrani, H. (2024). Transfer learning based solution for air quality prediction in smart cities using multimodal data. International Journal of Environmental Science and Technology, 22(3), 1297–1312. https://doi.org/10.1007/S13762-024-05722-5/METRICS
Pabitha, C., Pal Pandian, P., Saravanan, S., Rajendiran, M., Sneha Chaturya, A., & Natrayan, L. (2025). Development of robotic sensor nodes in wireless sensor networks using embedded systems and machine learning. Hybrid and Advanced Technologies, 513–518. https://doi.org/10.1201/9781003559139-76
Potharaju, S., Tirandasu, R. K., Tambe, S. N., Jadhav, D. B., Kumar, D. A., & Amiripalli, S. S. (2025). A two-step machine learning approach for predictive maintenance and anomaly detection in environmental sensor systems. MethodsX, 14, 103181. https://doi.org/10.1016/J.MEX.2025.103181
Rajendran, R. K., Mohana Priya, T., Musa, A. I. A., Mahalakshmi, S. B., & Anand, T. R. (1 C.E.). Smart Solutions for Climate Resilience Harnessing Machine Learning and Sustainable WSNs. Https://Services.Igi-Global.Com/Resolvedoi/Resolve.Aspx?Doi=10.4018/979-8-3693-3940-4.Ch010, 213–232. https://doi.org/10.4018/979-8-3693-3940-4.CH010
Rathnayaka, R. M. N., & Sujah, A. M. A. (2025). IoT-Driven Environmental Intelligence for Sustainable Tomorrow Through Advanced Machine Learning: A Systematic Literature Review. Journal of Information and Communication Technology (JICT), 02.
Sankaran, K. S. (2025). Design and development of sensor based crop monitoring system using deep learning approach for smart agriculture. Multimedia Tools and Applications, 1–22. https://doi.org/10.1007/S11042-024-20525-Z/METRICS
Varun, E., Rajesh, L., Lokeshwari, M., Ashwini, S., Bhat, D., & S., C. K. (1 C.E.). Machine Learning-Enhanced Water and Air Quality Monitoring Technologies and Applications. Https://Services.Igi-Global.Com/Resolvedoi/Resolve.Aspx?Doi=10.4018/979-8-3693-4759-1.Ch004, 79–110. https://doi.org/10.4018/979-8-3693-4759-1.CH004
Zeng, T., Ma, X., Luo, Y., Yin, J., Ji, Y., & Lu, S. (2025). Improving outdoor thermal environmental quality through kinetic canopy empowered by machine learning and control algorithms. Building Simulation 2025, 1–22. https://doi.org/10.1007/S12273-025-1246-6
Downloads
Published
Issue
Section
License
Copyright (c) 2025 International Journal of Scientific Research in Computer Science, Engineering and Information Technology

This work is licensed under a Creative Commons Attribution 4.0 International License.