Document Type : Research Paper

Authors

1 Faculty of civil engineering and transportation, University of Isfahan, Isfahan, Iran

2 Geomatics department, Faculty of Civil Engineering and Transportation, University of Isfahan, Isfahan, Iran

Abstract

Extended Abstract

Introduction

The concentration of particulate matters has recently increased in the metropolitan area of Tehran resulting in many severe hazards for both the environment and citizens. Particulate matters (PM) with a diameter less than 2.5 microns (PM2.5) are considered to be one of the most dangerous types of pollution. Estimating the concentration of these particles in Tehran is challenging due to the existence of various sources of pollution and the lack of sufficient ground stations. Aerosol optical depth (AOD) data retrieved from satellite imagery can be an alternative. However, AOD are not easily convertible into surface pollution and requires the development of appropriate models such as those based on data-driven approaches and machine learning techniques. Thus, the present study seeks to create a model to estimate the concentration of PM2.5 in Tehran employing deep generative models and in-situ measurements, meteorological data, and AOD data extracted from MODIS satellite imagery. Reviewed literature has proved the ability of deep learning techniques to solve regression and classification problems. Deep learning techniques are divided into various categories, one of which is based on the generative models seeking to reconstruct the input features. In this way, high-level and efficient features can be employed to explore the relationship between PM2.5 and AOD. Thus, the present study has investigated the potential of deep generative models for estimating PM2.5 concentration from high resolution AOD data retrieved from satellite imagery. 

 

Materials and Study Area

As a metropolitan area suffering from air pollution particularly in winters, the capital city of Iran, Tehran was selected as the study area. PM2.5, the main source of pollution in Tehran, is mainly emitted from vehicles and especially old urban public transport fleet.

Aerosol data collected by Aqua and Terra sensors of MODIS and retrieved by Multi-angle Implementation of Atmospheric Correction (MAIAC) algorithm were used in the present study. Meteorological data were obtained from the global ECMWF climate model, and the concentration of PM2.5 was measured at air quality monitoring stations. Data were collected for a time interval of January 2013 to January 2020.

 

Methods

The present study has investigated the potential of deep generative models used to provide an estimate of PM2.5 concentration based on satellite AOD data. To reach such an aim, three types of deep generative neural networks, deep autoencoder (DAE), deep belief network (DBN) and conditional generative adversarial network (CGAN) were developed. Moreover, the performance of deep generative modes was compared with linear regression techniques as typical models used to explore the relation between PM2.5 and AOD data. Finally, the most accurate model for the generation of high resolution (1km) PM2.5 maps from AOD data was selected based on the performance of models.

 

Results and Discussion  

The accuracy of each developed model was evaluated using the test data and the obtained results were compared with results obtained from other basic linear regression models. Accuracy evaluation indicated that the developed deep autoencoder (DAE) combined with support vector regression led to the highest correlation (R2 = 0.69) and lowest RMSE (10.34) and MAE (7.95) and thus, can be potentially used for high resolution estimation of PM2.5 concentration. Next was the developed deep belief network which with a performance close to DAE demonstrated its potential capability to estimate PM2.5 concentration from satellite AOD data. The CGAN network acted less accurately in the estimation of PM2.5 concentration as compared to other deep generative models, but outperformed the linear regression algorithms on the test data. To sum up, findings indicated that deep generative models have outperformed classical linear regression techniques used for high resolution estimation of PM2.5 from satellite AOD data. Among the linear methods, the highest accuracy was achieved by the Lasso algorithm with an RSME of 12.14 and MAE of 9.46 on the test data which showed the significance of regularization for the improvement of performance in linear regression algorithms. Nevertheless, the accuracy of linear regression techniques was much lower than deep generative models.

 

Conclusion

Finally, DAE was selected as the best model for the estimation of PM2.5 concentration across the study area and high resolution maps of PM2.5 concentration were generated using the developed model. Investigating the daily PM2.5 maps generated for two days with different air quality conditions (clean and polluted) demonstrated the efficiency of the developed DAE for PM2.5 modeling.

Keywords

1- Arciszewska, C., & McClatchey, J. (2001). The importance of meteorological data for modelling air pollution using ADMS-Urban. Meteorological Applications, 8(3), 345-350.
2- Bagheri, H., Sadeghian, S., & Sadjadi, S. Y. (2014). The Assessment of using an Intelligent Algorithm for the Interpolation of Elevation in the DTM Generation. PFG Photogrammetrie, Fernerkundung, Geoinformation, 197-208.
3- Beckerman, B. S., Jerrett, M., Martin, R. V., van Donkelaar, A., Ross, Z., & Burnett, R. T. (2013). Application of the deletion/substitution/addition algorithm to selecting land use regression models for interpolating air pollution measurements in California. Atmospheric environment, 77, 172-177.
4- Chen, B., You, S., Ye, Y., Fu, Y., Ye, Z., Deng, J., Hong, Y. (2021). An interpretable self-adaptive deep neural network for estimating daily spatially-continuous PM2. 5 concentrations across China. Science of The Total Environment, 768, 144724.
5- Dominici, F., Peng, R. D., Bell, M. L., Pham, L., McDermott, A., Zeger, S. L., & Samet, J. M. (2006). Fine particulate air pollution and hospital admission for cardiovascular and respiratory diseases. Jama, 295(10), 1127-1134.
6- Ghotbi, S., Sotoudeheian, S., & Arhami, M. (2016). Estimating urban groundlevel PM10 using MODIS 3km AOD product and meteorological parameters from WRF model. Atmospheric Environment, 141, 333–346.
7- Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S. Bengio, Y. (2014). Generative adversarial nets. Advances in neural information processing systems, 27.
8- Gupta, P., & Christopher, S. A. (2009). Particulate matter air quality assessment using integrated surface, satellite, and meteorological products: Multiple regression approach. Journal of Geophysical Research: Atmospheres, 114(D14).
9- Habibi, R., Alesheikh, A. A., Mohammadinia, A., Sharif, M.. (2017). An Assessment of Spatial Pattern Characterization of Air Pollution: A Case Study of CO and PM2.5 in Tehran, Iran. ISPRS International Journal of Geo-Information , 6(9).
10- Hersbach, H., Bell, B., Berrisford, P., Hirahara, S., Horányi, A., Muñoz‐Sabater, J., Schepers, D. (2020). The ERA5 global reanalysis. Quarterly Journal of the Royal Meteorological Society, 14(730), 1999-2049.
11- Hinton, G. E. (2012). A practical guide to training restricted Boltzmann machines. In Neural networks: Tricks of the trade (pp. 599-619): Springer.
12- Hoek, G., Beelen, R., De Hoogh, K., Vienneau, D., Gulliver, J., Fischer, P., & Briggs, D. (2008). A review of land-use regression models to assess spatial variation of outdoor air pollution. Atmospheric environment, 42(33), 7561-7578.
13- Hsu, N., Jeong, M. J., Bettenhausen, C., Sayer, A., Hansell, R., Seftor, C., Tsay, S. C. (2013). Enhanced Deep Blue aerosol retrieval algorithm: The second generation. Journal of Geophysical Research: Atmospheres, 118(16), 9296-9315.
14- Jafarian, H., & Behzadi, S. (2020). Evaluation of PM2. 5 emissions in Tehran by means of remote sensing and regression models. Pollution, 6(3), 521-529.
15- Klemm, R. J., Mason Jr, R. M., Heilig, C. M., Neas, L. M., & Dockery, D. W. (2000). Is daily mortality associated specifically with fine particles? Data reconstruction and replication of analyses. Journal of the Air & Waste Management Association, 50(7), 1215-1222.
16- Levy, R., Mattoo, S., Munchak, L., Remer, L., Sayer, A., Patadia, F., & Hsu, N. (2013). The Collection 6 MODIS aerosol products over land and ocean. Atmospheric Measurement Techniques, 6(11), 2989-3034.
17- Li, L. (2020). A robust deep learning approach for spatiotemporal estimation of satellite AOD and PM2. 5. Remote Sensing, 12(2), 264.
18- Li, T., Shen, H., Yuan, Q., Zhang, X., & Zhang, L. (2017). Estimating ground‐level PM2. 5 by fusing satellite and station observations: a geo‐intelligent deep learning approach. Geophysical Research Letters, 44(23), 11,985-911,993.
19- Lippmann, M., Ito, K., Nadas, A., & Burnett, R. T. (2000). Association of particulate matter components with daily mortality and morbidity in urban populations. Research report (Health Effects Institute)(95), 5-72, discussion 73.
20- Mirza, M., & Osindero, S. (2014). Conditional generative adversarial nets. arXiv preprint arXiv:1411.1784.
21- Nabavi, S. O., Haimberger, L., & Abbasi, E. (2019). Assessing PM2.5 concentrations in Tehran, Iran, from space using MAIAC, deep blue, and dark target AOD and machine learning algorithms. Atmospheric Pollution Research, 10(3), 889–903.
22- Ni, X., Cao, C., Zhou, Y., Cui, X., & P Singh, R. (2018). Spatio-temporal pattern estimation of PM2. 5 in Beijing-Tianjin-Hebei Region based on MODIS AOD and meteorological data using the back propagation neural network. Atmosphere, 9(3), 105.
23- Peng, R. D., Bell, M. L., Geyh, A. S., McDermott, A., Zeger, S. L., Samet, J. M., & Dominici, F. (2009). Emergency admissions for cardiovascular and respiratory diseases and the chemical composition of fine particle air pollution. Environmental health perspectives, 117(6), 957-963.
24- Peters, A., Dockery, D. W., Muller, J. E., & Mittleman, M. A. (2001). Increased particulate air pollution and the triggering of myocardial infarction. Circulation, 103(23), 2810-2815.
25- Ruthotto, L., & Haber, E. (2021). An introduction to deep generative modeling. GAMM‐Mitteilungen, e202100008.
26- Shahbazi, H., Taghvaee, S.,  Hosseini, V., Afshin, H. (2016)  A GIS based emission inventory development for Tehran. Urban Climate, 17, 216-229
27- Sayer, A., Hsu, N., Bettenhausen, C., Jeong, M. J., & Meister, G. (2015). Effect of MODIS Terra radiometric calibration improvements on Collection 6 Deep Blue aerosol products: Validation and Terra/Aqua consistency. Journal of Geophysical Research: Atmospheres, 120(23), 12,157-112,174.
28- Sotoudeheian, S., & Arhami, M. (2014). Estimating ground-level PM 10 using satellite remote sensing and ground-based meteorological measurements over Tehran. Journal of Environmental Health Science and Engineering, 12(1), 1-13.
29- Vienneau, D., De Hoogh, K., Beelen, R., Fischer, P., Hoek, G., & Briggs, D. (2010). Comparison of land-use regression models between Great Britain and the Netherlands. Atmospheric environment, 44(5), 688-696.