EGU23-7796
https://doi.org/10.5194/egusphere-egu23-7796
EGU General Assembly 2023
© Author(s) 2023. This work is distributed under
the Creative Commons Attribution 4.0 License.

Evaluation of TROPOMI observations for estimating surface NO2 concentrations over Europe using XGBoost Model 

Shobitha Shetty1,2, Philipp Schneider1, Kerstin Stebel1, Arve Kylling1, Terje Koren Berntsen2, and Paul Hamer1
Shobitha Shetty et al.
  • 1Norwegian Institute for Air Research, Kjeller, Norway (sshe@nilu.no)
  • 2University of Oslo, Oslo, Norway

Nitrogen dioxide (NO2) is among the major air pollutants in Europe posing severe hazard to environmental and human health. The concentrations of surface NO2 are measured by ground monitoring stations which are fairly limited in representation and distribution. While NO2 estimates from chemical transport models are realistic, their complexity makes them computationally intensive. Satellite observations from instruments such as TROPOMI provide high spatiotemporal distribution of NO2. However, these instruments capture NO2 density only along the tropospheric column and not on the surface. Exploiting the availability of ground station measurements and spatially continuous information from TROPOMI, this study estimates surface NO2 concentrations over Europe at 1km spatial resolution for 2019-2021 using XGBoost machine learning model. While ground measurements are used as target reference features, satellite observations such as tropospheric column density of NO2 (from TROPOMI), night light radiance (from VIIRS), NDVI (from MODIS) and modelled meteorological parameters such as planetary boundary layer height, wind velocity, temperature are used as input features to the model. We find an overall mean absolute error of 7.87µg/m3, mean bias of -3.13µg/m3 and spearman correlation of 0.61 during model validation. We found that the performance of the model is influenced by NO2 concentration levels and is most reliable for predictions at concentration levels <40µg/m3 with a relative bias of <40%. The spatial error analysis also indicates the spatial robustness of the model across the study area. The importance of input features is evaluated using SHapley Additive exPlanations (SHAP), which shows TROPOMI NO2 being the most important source for the modelled NO2 predictions. Furthermore, SHAP values also highlight the role of VIIRS night light radiance in deriving finer detailed spatial patterns of surface NO2 estimates. Despite the complex non-linear relationship of the input features, the trained XGBoost model requires an average of 570 seconds to predict single day surface NO2 concentrations for the large study area of continental scale. Thus, this work evaluates the importance of TROPOMI data and reliability of machine learning models for estimating surface NO2 concentrations on a larger spatial scale.

How to cite: Shetty, S., Schneider, P., Stebel, K., Kylling, A., Koren Berntsen, T., and Hamer, P.: Evaluation of TROPOMI observations for estimating surface NO2 concentrations over Europe using XGBoost Model , EGU General Assembly 2023, Vienna, Austria, 24–28 Apr 2023, EGU23-7796, https://doi.org/10.5194/egusphere-egu23-7796, 2023.