AN AI-DRIVEN HYBRID LSTM-RF FRAMEWORK FOR DECOUPLING ENVIRONMENTAL AND ANTHROPOGENIC IMPACTS ON AIR QUALITY
Keywords:
Meteorological normalization, Machine learning, Long short-term memory, Random forest, Decoupling analysisAbstract
Evaluating atmospheric emission policies requires accurately separating anthropogenic activities from meteorological influences. Traditional statistical approaches often fail to capture the non-linear, high-dimensional dynamics of atmospheric environments. This research introduces a hybrid machine learning framework integrating an LSTM neural network and Random Forest meteorological normalization to decouple these overlapping effects. A dynamic inversion model was constructed using 2021 hourly observations of PM2.5, PM10, temperature, humidity, and wind speed from Jinan's Guodian station. The LSTM captured long-term temporal dependencies, achieving a coefficient of determination exceeding 0.80 across seasonal variations. To isolate meteorological impacts, the RF model employed a 14-day random resampling technique, calculating the conditional expectation of pollutant concentrations to marginalize weather randomness. Results revealed distinct seasonal contribution rates: meteorological factors contributed 47.36% to PM2.5 concentrations during severe December pollution events, driven by stagnant conditions and high humidity. Conversely, this contribution dropped to 24.87% in February, indicating primarily anthropogenic origins. Feature analysis identified humidity and temperature as the primary meteorological drivers of particulate accumulation. This framework provides an interpretable methodology to evaluate the actual efficacy of emission interventions under varying weather scenarios.References
[1] Zhao Y, Sun H, Ma D. Research on the impact of urban green finance reform on synergizing the reduction of pollution and carbon emissions. Industrial Economics Research, 2024(3): 15-28.
[2] Yu W, Zhang T, Shen D. Evolution of spatial pattern and influencing factors of county-level carbon emission intensity in China based on random forest model. China Environmental Science, 2022, 42(6): 2788-2798.
[3] Tao Y, Du J. Temperature prediction using long short-term memory network based on random forest. Computer Engineering and Design, 2019, 40(3): 737-743.
[4] Li C, Yin Y, Cui W. Study on carbon emission prediction and its interaction with green finance index. Statistical Theory and Practice, 2023(12): 18-26.
[5] Gao H, Wang H, Dong Z. Intelligent drive and enterprise green innovation: a quasi-natural experiment based on the national artificial intelligence pilot zone. Journal of Jiangnan University (Humanities and Social Sciences), 2024, 23(6): 55-69.
[6] Zhang H, Deng H. Photovoltaic evaluation in buildings integrally based on entropy weight and grey relational analysis// 2025 8th International Conference on Power and Energy Applications (ICPEA). 2025: 526-531.
[7] Liang W, Li Y, Liu X, et al. AI-based Bayesian structural time series modeling for assessing PM2.5 air quality improvements during the Beijing 2022 Winter Olympics. Atmospheric Environment, 2025, 358: 121328.
[8] Zhao J, Deng F, Cai Y, et al. Long short-term memory - fully connected (LSTM-FC) neural network for PM2.5 concentration prediction. Chemosphere, 2019, 220: 486-492.
[9] Chen M, Xu P, Liu Z, et al. Air pollution prediction based on optimized deep learning neural networks: PSO-LSTM. Atmospheric Pollution Research, 2025, 16(3): 102413.
[10] Huang G, Li X, Wu D. PM2.5 concentration forecasting at surface monitoring sites using GRU neural network based on empirical mode decomposition. Science of the Total Environment, 2021, 768: 144516.