A Smart Air Pollution Detector Using Machine Learning

Authors

  • K. Madhusudhan Reddy Assistant Professor, Department of MCA, Annamacharya Institute of Technology and Sciences, Karakambadi, Tirupati, Andhra Pradesh, India Author
  • Kanala Prakash Student, Department of MCA, Annamacharya Institute of Technology and Sciences, Karakambadi, Tirupati, Andhra Pradesh, India Author

Keywords:

Air Quality Prediction, Machine Learning (ML), Classification, Synthetic Minority Over-sampling Technique (SMOTE), Air Quality Index (AQI)

Abstract

The rapid growth of urbanization and industrial activities in cities has resulted in a significant deterioration in air quality, which poses an increasing threat to both public health and the environment. This study focuses on predicting air quality using ML algorithms, aiming to classify air quality into three distinct categories: Good, Satisfactory, and Poor. The dataset utilized for this research comprises key environmental factors such as PM2.5, PM10, nitrogen oxides, and carbon monoxide, which are considered critical indicators of air pollution. To enhance the accuracy of predictions, several ML models were employed, including Logistic Regression, MLP, Random Forest, Decision Tree, The data preprocessing phase involved several essential steps to prepare the dataset for model training. These steps included the handling of missing values, selection of relevant features, and addressing class imbalance through the use of the SMOTE, which was employed to balance the distribution of target labels. The models were then trained and evaluated based on their performance in predicting air quality categories, with accuracy being the primary evaluation metric. Moreover, it can help inform public health decisions by identifying regions with poor air quality and ensuring better management of air pollution levels.

Downloads

Download data is not yet available.

References

Aamer, H., Ba-Alawi, A. H., Kang, S., Lee, T., & Jo, Y. M. (2025). Prediction of school PM2.5 by an attention-based deep learning approach informed with data from nearby air quality monitoring stations. Chemosphere, 375, 144241. https://doi.org/10.1016/J.CHEMOSPHERE.2025.144241

Kant, S. (2024). From data to decision-making: utilizing decision tree for air quality monitoring in smart urban areas. International Journal of Information Technology (Singapore), 17(1), 665–672. https://doi.org/10.1007/S41870-024-02208-Y/METRICS

Lee, M. J., & Zhang, R. (2024). Multimodal Data Fusion and Deep Learning for Occupant-Centric Indoor Environmental Quality Classification. Journal of Computing in Civil Engineering, 39(2), 04024061. https://doi.org/10.1061/JCCEE5.CPENG-6249

Njaime, M., Abdallah, F., Snoussi, H., Akl, J., Chaaban, K., & Omrani, H. (2024). Transfer learning based solution for air quality prediction in smart cities using multimodal data. International Journal of Environmental Science and Technology, 22(3), 1297–1312. https://doi.org/10.1007/S13762-024-05722-5/METRICS

Pabitha, C., Pal Pandian, P., Saravanan, S., Rajendiran, M., Sneha Chaturya, A., & Natrayan, L. (2025). Development of robotic sensor nodes in wireless sensor networks using embedded systems and machine learning. Hybrid and Advanced Technologies, 513–518. https://doi.org/10.1201/9781003559139-76

Potharaju, S., Tirandasu, R. K., Tambe, S. N., Jadhav, D. B., Kumar, D. A., & Amiripalli, S. S. (2025). A two-step machine learning approach for predictive maintenance and anomaly detection in environmental sensor systems. MethodsX, 14, 103181. https://doi.org/10.1016/J.MEX.2025.103181

Rajendran, R. K., Mohana Priya, T., Musa, A. I. A., Mahalakshmi, S. B., & Anand, T. R. (1 C.E.). Smart Solutions for Climate Resilience Harnessing Machine Learning and Sustainable WSNs. Https://Services.Igi-Global.Com/Resolvedoi/Resolve.Aspx?Doi=10.4018/979-8-3693-3940-4.Ch010, 213–232. https://doi.org/10.4018/979-8-3693-3940-4.CH010

Rathnayaka, R. M. N., & Sujah, A. M. A. (2025). IoT-Driven Environmental Intelligence for Sustainable Tomorrow Through Advanced Machine Learning: A Systematic Literature Review. Journal of Information and Communication Technology (JICT), 02.

Sankaran, K. S. (2025). Design and development of sensor based crop monitoring system using deep learning approach for smart agriculture. Multimedia Tools and Applications, 1–22. https://doi.org/10.1007/S11042-024-20525-Z/METRICS

Varun, E., Rajesh, L., Lokeshwari, M., Ashwini, S., Bhat, D., & S., C. K. (1 C.E.). Machine Learning-Enhanced Water and Air Quality Monitoring Technologies and Applications. Https://Services.Igi-Global.Com/Resolvedoi/Resolve.Aspx?Doi=10.4018/979-8-3693-4759-1.Ch004, 79–110. https://doi.org/10.4018/979-8-3693-4759-1.CH004

Zeng, T., Ma, X., Luo, Y., Yin, J., Ji, Y., & Lu, S. (2025). Improving outdoor thermal environmental quality through kinetic canopy empowered by machine learning and control algorithms. Building Simulation 2025, 1–22. https://doi.org/10.1007/S12273-025-1246-6

Downloads

Published

05-05-2025

Issue

Section

Research Articles