ANALYSIS AND PREDICTION OF INFLUENCING FACTORS ON THE PROBABILITY OF STROKE
Volume 2, Issue 4, Pp 73-77, 2024
DOI: 10.61784/tsshr3013
Author(s)
XinChun Wang
Affiliation(s)
School of Mathematics and Statistics, Guangxi Normal University, Guilin 541006, Guangxi, China.
Corresponding Author
XinChun Wang
ABSTRACT
With the continuous development of big data, statistical model analysis has permeated various fields of life, particularly in clinical medicine. In addressing the clinical prediction of patients' stroke probability, modeling methods such as ANOVA, support vector machines, binomial logistic regression analysis and random forest models are essential. Given that the dependent variable in this study is a binary classification problem, logistic regression analysis and random forest models were selected for the analysis. This paper elaborates on the principles of logistic regression analysis and random forest models and random forest models, providing the regression equation for the regression model and the importance scores of each variable in the random forest model. Additionally, the predictive capabilities of these two models were evaluated, including an assessment of prediction accuracy.Through the application of regression and random forest models, we aim to enhance the clinical prediction accuracy of patients' stroke probability, thereby providing a more reliable basis for clinical decision-making.
KEYWORDS
Stroke; Influencing factors; Logistic Regression Analysis; Random forest regression model; Prediction accuracyt
CITE THIS PAPER
XinChun Wang. Analysis and prediction of influencing factors on the probability of stroke. Trends in Social Sciences and Humanities Research. 2024, 2(4): 73-77. DOI: 10.61784/tsshr3013.
REFERENCES
[1] Breiman L. Random forests. Machine learning, 2001, 45(1): 5-32.
[2] Liaw A., Wiener, M. Classification and regression by randomForest. R news, 2(3): 18-22.
[3] Diaz-Uriarte, R., Alvarez de Andres, S. Gene selection and classification of microarray data using random forest. BMC bioinformatics, 2006, 7(1): 3.
[4] Ishwaran H., Kogalur UB. Random forests for survival, regression and classification (RF-SRC). R package version 1, 2007, 4.
[5] Ishwaran H., Kogalur UB., Blackstone EH., Lauer MS. Random survivalforests. The annals of applied statistics, 2008, 2(3): 841-860.
[6] Genuer Robin, Jean-Michel Poggi, Christine Tuleau-Malot. Variable selection using random forests. Pattern Recognition Letters, 2010, 31(14): 2225-2236.