A Risk Prediction Model for Ischemic Stroke Construction of a Risk Prediction Model for Ischemic Stroke Based on Lasso-Logistic Regression

Expand
  • School of Public Health Qiqihar Medical University Heilongjiang Qiqihar 161006, China

Received date: 2025-08-07

  Revised date: 2026-01-21

  Accepted date: 2026-03-18

  Online published: 2026-04-21

Abstract

Objective: To construct a risk prediction model for ischemic stroke providing a reference for medical institutions to improve relevant prevention and treatment measures.Methods: Based on real data research 5605 physical examination subjects from three tertiary hospitals in 20232024 were selected as the research subjects. T-tests and χ2 tests were used to compare the differences in clinical characteristics. Lasso regression was applied to screen the related variables of the disease onset. The bootstrap method was used for repeated sampling 1000 times. The C-index of the lambda.min model and the lambda.1se model were calculated rank the importance of related variables using the random forest model and construct a risk prediction model for ischemic stroke onset using multivariate Logistic regression. The accuracy of the prediction model was evaluated in the training set and validation set by using the ROC curve and calibration curve and the DCA curve and CIC curve were plotted to analyze the predictive value.Results: Based on the Akaike Information Criterion AIC), the more parsimonious lambda.1se model was selected. Using the optimal λ value λ=0.017), the screened independent variables in descending order of importance were homocysteine platelet count hyperlipidemia hypertension fasting blood glucose triglycerides and diabetes. Multivariate logistic regression confirmed all these variables as significant risk factors for ischemic stroke P<0.05. The model demonstrated good discrimination with AUC values of 0.771 95% CI 0.739-0.802 in the training set and 0.786 95% CI 0.736-0.835 in the validation set. Calibration was also satisfactory with mean absolute errors of 0.003 and 0.011 in the training and validation sets respectively. DCA indicated favorable net intervention benefits across a reasonable threshold range. CIC further showed high prediction efficiency for onset risk when the threshold probability exceeded 0.4.Conclusion: The constructed prediction model exhibits good discrimination calibration and provides net clinical benefit across a range of threshold probabilities. It can serve as a useful tool to support clinical decision-making in the prevention of ischemic stroke.


Cite this article

HUO Lifeng, WANG Xi, ZHANG Hanqi, SUN Panan, HAN Yunfeng .

A Risk Prediction Model for Ischemic Stroke Construction of a Risk Prediction Model for Ischemic Stroke Based on Lasso-Logistic Regression

[J]. CHINESE JOURNAL OF MEDICINAL GUIDE, 2026 , 28(3) : 317 -324 . DOI: 10.1009-0959.2026.040003

References

    [1 刘明波,何新叶,杨晓红,等.《中国心血管健康与疾病报告2023》要点解读[J.临床心血管病杂志,2024408):599-616.

         2  Ma Qingfeng Li Rui Wang Lijunet al. Temporal trend and attributable risk factors of stroke burden in China 19902019 an analysis for the Global Burden of Disease Study 2019J.Lancet Public Health 2021612):e897-e906.

         3  GBD 2016 neurology collaborators. Global regional and national burden of neurological disorders 19902016 a systematic analysis for the Global Burden of Disease Study 2016J.Lancet Neurol 2019185):459-480.

         4  Lian Zhengqi Gao Yikun Xiong Jiali et al. Temporal trend and attributable risk factors of ischemic stroke burden in China 19902021J. BMC Neurology 2025251):245.

         5  Nayem HM Aziz S Kibria BMG et al. Evaluating estimator performance under multicollinearity a trade-off between MSE and accuracy in logistic lasso elastic net and ridge regression with varying penalty parametersJ.Stats 202582):45.

         6  Qi Duan Li Wenlong Zhang Ye et al. Nomogram established on account of Lasso-logistic regression for predicting hemorrhagic transformation in patients with acute ischemic stroke after endovascular thrombectomyJ.Clin Neurol Neurosurg 20242431):108389.

         7  Huang Qianqian Zeng Tianshu Zhang Jiaoyueet al. Framingham risk score conventional risk factors are potent to predict all-cause mortality using machine learning algorithms a population-based prospective cohort study over 40 years in ChinaJ.J Investig Med 2023716):586-590.

         8  北京高血压防治协会,中国卒中学会高血压预防与管理分会,中国老年保健协会养老与健康专业委员会,等.基层冠心病与缺血性脑卒中共患管理专家共识2022J.中国心血管病研究,2022209):772-793.

         9  程露,梁晓峰,吴静,等.中国5省农村地区脱贫居民吸烟现状及影响因素研究[J. 中国慢性病预防与控制,2023318):601-605.

         10 Ng R Sutradhar R Yao Z et al. Smoking drinkingdiet and physical activity-modifiable lifestyle risk factors and their associations with age to first chronic diseaseJ.Int J Epidemiol 2020491):113-130.

         11 中国高血压防治指南修订委员会.中国高血压防治指南(2024年修订版)[J.中华高血压杂志(中英文),2024327):603-700.

         12 中华医学会糖尿病学分会.中国糖尿病防治指南(2024版)[J.中华糖尿病杂志,2025171):16-139.

         13 中国血脂管理指南修订联合专家委员会.中国血脂管理指南(2023年)[J.中国循环杂志,2023383):237-271

         14 Zhang Juanjuan Wang Weixian Yang Mingming et al.Variational Bayesian Variable Selection in Logistic Regression Based on Spike-and-Slab LassoJ.Mathematics 20251313):2205.

         15 Gou Dengqun Min Changhang Peng Xiaofeng et al. Associating factors of cognitive frailty among older people with chronic heart failure Based on LASSO-logistic regressionJ.J Adv Nurs 2024813):1399-1411.

         16 Li Yun Zhao Lina Wang Ye et al. Development and validation of prediction models for neurocognitive disorders in adult patients admitted to the ICU with sleep disturbanceJ.CNS Neurosci Ther 2021284):554-565.

         17 Li Qiaowei Lin Fan Gao Zhonghai et al. Chinese ASCVD risk equations rather than pooled cohort equations are better to identify macro and microcirculation abnormalitiesJ.BMC Cardiovasc Disord 2020201):145.

         18 Sakamoto T Endo H Yamamoto H et al. Machine learning prediction of anastomotic leak after low anterior resection nationwide database analysisJ.Medicine 202510434):e443977.

         19 Jiang Yan Wang Weikai Xu Ruifeng et al. Construction and efficacy evaluation of a model for early diagnosis of pediatric sepsis based on LASSO-logistic regressionJ.Front Pediatr 2025131):1624278.

         20 许海婷,魏晓霞,吴文华,等.基于Markov模型的血塞通软胶囊治疗缺血性中风的成本效果分析[J.中国药物评价,2024412):155-159.

         21 Jin Xiuli Yang Xiangxi Li Fengjuan. Prediction of ischemic stroke in elderly hypertensive patients using carotid plaque superb microvascular imaging characteristicsa lasso-logistic regression modelJ.J Med Ultrason 2025522):1-9.

         22 Nam KW Kim CKYU S et al. Plasma total homocysteine level is related to unfavorable outcomes in ischemic stroke with atrial fibrillationJ.J Am Heart Assoc 2022119):e022138.

         23 Li Keliang Xu Min Zhang Yun et al. Prognostic values of homocysteine and potassium levels in acute ischemic stroke patients after intravenous thrombolysis with recombinant tissue-type plasminogen activatorJ.Crit Rev Eukaryot Gene Expr 2025352):65-73.

         24 Burlard P Vögtle TNieswandt B. Platelets in thromboinflammationconcepts mechanismsand therapeutic strategies for ischemic strokeJ.Hamostaseologie 2020402):153-164.

         25 李丽,刘婉如,张征,等.逐瘀通脉胶囊对大鼠急性局灶性脑缺血再灌注损伤的影响[J.中国药物评价,2023406):502-505.

         26 Wen Huijun Wang Ning Lv Min et al. The early predictive value of platelet-to-lymphocyte ratio to hemorrhagic transformation of young acute ischemic strokeJ.Asian Biomed 2023176):267-272.

         27 Zeng Nimei Shen Yue Li Yuan et al. Association between remnant cholesterol and subclinical carotid atherosclerosis among Chinese general population in health examinationJ.J Stroke Cerebrovasc Dis2023328):107234.

         28 Li Chujun Chen Yuzhen Ou Xiuli et al. Factors influencing the occurrence of ischemic stroke in elderly patients with hypertension and type 2 diabetes mellitusa case-control studyJ.BMC Neurology2025251):35.

         29 Xu Tianqi Yang Jianhong Xu Yao et al. Post-acute ischemic stroke hyperglycemia aggravates destruction of the blood-brain barrierJ.Neural Regen Res 2024196):1344-1350.

Outlines

/