AIR QUALITY ANALYSIS BASED ON OUTLIER DETECTION ALGORITHMS

Authors

  • QiZhi Zhang Northeast Forestry University, Harbin 150040, Heilongjiang, China.
  • Nan Jiang (Corresponding Author) Northeast Forestry University, Harbin 150040, Heilongjiang, China.

Keywords:

Data mining, Two-Step Clustering algorithm, Outlier detection algorithm, Air quality analysis

Abstract

This study employs an outlier detection algorithm based on the distance metric of the Two-Step clustering algorithm to analyze potential untrustworthy data in pollutant concentration records across the region of Beijing-Tianjin-Hebei. By calculating anomaly indices through designated formulas and evaluating variable contribution rates, abnormal data points were identified for each monitoring area. Subsequent analysis of these anomalies provides substantial evidence supporting the existence of unreliable data within the dataset.

References

[1] Cheng Hanxi, Zhu Hongxia, Wang Jing, et al. Research on the impact of air pollution prevention and control on air quality in the Beijing-Tianjin-Hebei region. Environmental Impact Assessment, 2024, 46(06): 78-85. DOI: 10.14068/j.ceia.2024.06.013.

[2] Song Guojun, Li Honglin. Targeting PM2.5 pollution: updated design of an air quality management policy framework. China Population Resources and Environment, 2023, 33(02): 1-10.

[3] Fu Jianhua, Zhou Fangzhao. Measurement and influencing factors analysis of provincial air quality in China. Urban Problems, 2020(05): 20-27. DOI: 10.13239/j.bjsshkxy.cswt.200503.

[4] Wu EMY, Kuo SL. A study on the use of a statistical analysis model to monitor air pollution status in an air quality total quantity control district. Atmosphere, 2013(04): 349-364.

[5] Zhang Y, Xu L, Lu Z. Synergistic effect of factors influencing urban air quality in China: a hybrid model integrating WGRA and QCA. International Journal of Environmental Science and Technology 2023(11): 12179-12194.

[6] Yu Heng, Hou Xiaolan. Application of hierarchical clustering algorithm in astronomy. Scientia Sinica (Physica, Mechanica & Astronomica), 2022, 52(08): 118-131.

[7] Huang Mengting. A recommendation algorithm based on The combination of two-step clustering and association rules. Information & Computer, 2021, 33(01): 35-37.

[8] Ding Sifang, Wang Shouwei, Zhu William. Density peaks clustering algorithm based on two-step allocation strategy. 2023 International Conference on Image Processing, Computer Vision and Machine Learning (ICICML). IEEE, 2023(02): 946-950.

[9] Hong Xia, Gao Junbin, Wei Hong, et al. Two-step scalable spectral clustering algorithm using landmarks and probability density estimation. Neurocomputing, 2023, 519: 173-186.

[10] Kumar Y, Sahoo G. A two-step artificial bee colony algorithm for clustering. Neural Computing and Applications, 2017, 28: 537-551.

Downloads

Published

2025-05-14

How to Cite

QiZhi Zhang, Nan Jiang. Air quality analysis based on outlier detection algorithms. Eurasia Journal of Science and Technology. 2025, 7(3): 25-29. DOI: https://doi.org/10.61784/ejst3083 .