Science, Technology, Engineering and Mathematics.
Open Access

THE PREDICTION AND INFLUENCING FACTORS OF BREAST CANCER RECURRENCE BASED ON RANDOM FOREST

Download as PDF

Volume 2, Issue 1, Pp 21-25, 2025

DOI: https://doi.org/10.61784/asat3011

Author(s)

Xi Yang*, WenBei Zheng, WenYun Xia

Affiliation(s)

Guangxi Normal University, Guilin 541000, Guangxi, China.

Corresponding Author

Xi Yang

ABSTRACT

Breast cancer (BC) is one of the most common malignant tumors in women. In 2022, it has become the second most common cancer after lung cancer. Although medical technology has made great progress in recent years and the survival rate of breast cancer patients has been greatly improved, according to research, about 40% of patients still relapse after treatment. Constructing a breast cancer recurrence prediction model and finding factors that affect breast cancer recurrence are of great significance for clinical treatment and prolonging patient survival. This study used the TCGA dataset and randomly divided the patients into training and test sets in a ratio of 8:2. Seven algorithms, including decision tree, logistic regression, support vector machine, K nearest neighbor, random forest, neural network, and adaptive boosting, were used to construct the model, and the performance of each model was evaluated. The results showed that the random forest model had the best effect, with an accuracy of 97.77%, a sensitivity of 94.23%, a specificity of 99.21%, a false positive rate of 0.79%, an F1 of 96.08%, and an AUC value of 96.72%. The features obtained by the model classification were ranked according to their importance. The top three features were: Age_at_Initial_Pathologic_Diagnosis_nature2012, lymph_node_examined_count and number_of_lymphnodes_Postive _by_he. The model provides more robust feature importance analysis results, providing an important reference for clinicians in breast cancer recurrence risk assessment and individualized treatment decision-making.

KEYWORDS

Breast cancer; Random forest; Recurrence prediction; Influencing factor

CITE THIS PAPER

Xi Yang, WenBei Zheng, WenYun Xia. The prediction and influencing factors of breast cancer recurrence based on random forest. Journal of Trends in Applied Science and Advanced Technologies. 2025, 2(1): 21-25. DOI: https://doi.org/10.61784/asat3011.

REFERENCES

[1] Huang Xiaolin, He Ningning, Chen Shuzhen, et al. Retrospective study on the changes of PRL levels in female breast cancer patients aged 30 to 50 years old from Zhanjiang. Smart Health, 2018, 4(22).

[2] Rainey Linda, Eriksson Mikael, Trinh Thang, et al. The impact of alcohol consumption and physical activity on breast cancer: The role of breast cancer risk. International journal of cancer, 2020.

[3] Whitaker K D, Sheth D, Olopade O I. Dynamic contrast enhanced magnetic resonance imaging for risk-stratified screening in women with BRCA mutations or high familial risk for breast cancer: are we there yet? Breast Cancer Res Treat, 2020, 183(2).

[4] Lepucki A, Orlińska K, Mielczarek-Palacz A, et al. The Role of Extracellular Matrix Proteins in Breast Cancer. Journal of Clinical Medicine, 2022, 11(5):1250

[5] Heitmeir B, Deniz M, Janni W, et al. Circulating Tumor Cells in Breast Cancer Patients:A Balancing Act between Stemness, EMT Features and DNA Damage Responses. Cancers(Basel), 2022, 14(4): 997

[6] Bilski J, Smolag J. Parallel architectures for learning the RTRN and Elman dynamic neural networks. IEEETransactions on Parallel and Distributed Systems, 2015, 26(9): 2561.

[7] Wu Ying. Research on urban waterlogging risk assessment and prediction based on machine learning. Beijing University of Civil Engineering and Architecture, 2024. DOI:10.26943/d.cnki.gbjzc.2024.000067.

[8] Xun L, Peng Z, Yichen L, et al. Influencing Factors and Risk Assessment of Precipitation-Induced Flooding in Zhengzhou, China, Based on Random Forest and XGBoost Algorithms. International Journal of Environmental Research and PublicHealth, 2022, 19(24): 16544.

[9] Liu Chao. Regression Analysis: Methods, Data and Application of R. Beijing: Higher Education Press, 2019.

All published work is licensed under a Creative Commons Attribution 4.0 International License. sitemap
Copyright © 2017 - 2025 Science, Technology, Engineering and Mathematics.   All Rights Reserved.