Harrop, J. S. et al. Congress of neurological surgeons systematic review and evidence-based guidelines for perioperative spine: preoperative surgical risk assessment. Neurosurgery 89, S9–S18 (2021).
Ton, A. et al. The evolution of risk assessment in spine surgery: a narrative review. World Neurosurg. 188, 1–14 (2024).
Bakhsheshian, J. et al. The performance of frailty in predictive modeling of short-term outcomes in the surgical management of metastatic tumors to the spine. Spine J. 22, 605–615 (2022).
Ton, A. et al. The impact of frailty on postoperative complications in geriatric patients undergoing multi-level lumbar fusion surgery. Eur. Spine J. 31, 1745–1753 (2022).
Shahrestani, S. et al. Inclusion of frailty improves predictive modeling for postoperative outcomes in surgical management of primary and secondary lumbar spine tumors. World Neurosurg. 153, e454–e463 (2021).
Thommen, R., Bowers, C. A., Segura, A. C., Roy, J. M. & Schmidt, M. H. baseline frailty measured by the risk analysis index and 30-day mortality after surgery for spinal malignancy: analysis of a prospective registry (2011–2020). Neurospine 21, 404–413 (2024).
Chan, V. et al. Frailty adversely affects outcomes of patients undergoing spine surgery: a systematic review. Spine J. 21, 988–1000 (2021).
Shahrestani, S. et al. Integration of chronological age does not improve the performance of a mixed-effect model using comorbidity burden and frailty to predict 90-day readmission after surgery for degenerative scoliosis. World Neurosurg. 187, e560–e567 (2024).
Miller, E. K. et al. External validation of the adult spinal deformity (ASD) frailty index (ASD-FI). Eur. Spine J. 27, 2331–2338 (2018).
Miller, E. K. et al. External validation of the Adult Spinal Deformity (ASD) Frailty Index (ASD-FI) in the scoli-RISK-1 patient database. Spine (Phila Pa 1976) 43, 1426–1431 (2018).
Adida, S. et al. Machine learning in spine surgery: a narrative review. Neurosurgery 94, 53–64 (2024).
Karandikar, P. et al. Machine learning applications of surgical imaging for the diagnosis and treatment of spine disorders: current state of the art. Neurosurgery 90, 372–382 (2022).
Ghosh, A. et al. Preoperative anemia is an unsuspecting driver of machine learning prediction of adverse outcomes after lumbar spinal fusion. Spine J. https://doi.org/10.1016/j.spinee.2025.01.031 (2025).
Yahanda, A. T. et al. Current applications and future implications of artificial intelligence in spine surgery and research: a narrative review and commentary. Global Spine J. 15, 1445–1454 (2025).
Shahrestani, S. et al. Optimizing predictive model performance in adult spinal deformity surgery: a comparative head-to-head analysis of learning models for perioperative complications. Neurosurg. Focus 58, E12 (2025).
van Buuren, S. & Groothuis-Oudshoorn, K. Mice: multivariate imputation by chained equations in R. J. Stat. Soft. 45, 1–67 (2011).
Pedregosa, F. et al. Scikit-learn: machine learning in python. J. Mach. Learn. Res. 12, 2825–2830 (2011).
Azur, M. J., Stuart, E. A., Frangakis, C. & Leaf, P. J. Multiple imputation by chained equations: what is it and how does it work?. Int. J. Methods Psychiatr. Res. 20, 40–49 (2011).
Hancock, J. T. & Khoshgoftaar, T. M. Survey on categorical data for neural networks. J. Big Data 7, 28 (2020).
Poslavskaya, E. & Korolev, A. Encoding categorical data: is there yet anything ‘hotter’ than one-hot encoding? Preprint at https://doi.org/10.48550/arXiv.2312.16930 (2023).
Gilbert, T. et al. Development and validation of a hospital frailty risk score focusing on older people in acute care settings using electronic hospital records: an observational study. The Lancet 391, 1775–1782 (2018).
Clegg, A. et al. Development and validation of an electronic frailty index using routine primary care electronic health record data. Age Ageing 45, 353–360 (2016).
Chin, M. et al. Comparing the hospital frailty risk score and the clinical frailty scale among older adults with chronic obstructive pulmonary disease exacerbation. JAMA Netw. Open 6, e2253692 (2023).
Pajewski, N. M., Lenoir, K., Wells, B. J., Williamson, J. D. & Callahan, K. E. Frailty screening using the electronic health record within a medicare accountable care organization. J. Gerontol. A Biol. Sci. Med. Sci. 74, 1771–1777 (2019).
Krell, R. W., Girotti, M. E. & Dimick, J. B. Extended length of stay after surgery: complications, inefficient practice, or sick patients?. JAMA Surg. 149, 815–820 (2014).
Olson, R. S. & Moore, J. H. TPOT: A tree-based pipeline optimization tool for automating machine learning. In Proc. of the Workshop on Automatic Machine Learning 66–74 (PMLR 2016).
Olson, R. S. et al. automating biomedical data science through tree-based pipeline optimization. In applications of evolutionary computation (eds. Squillero, G. & Burelli, P.) 123–137 (Springer International Publishing, Cham). https://doi.org/10.1007/978-3-319-31204-0_9 (2016)
Koza, J. R. Genetic programming as a means for programming computers by natural selection. Stat. Comput. 4, 87–112 (1994).
Shanthi, D. L. & Chethan, N. Genetic algorithm based hyper-parameter tuning to improve the performance of machine learning models. SN Comput. Sci. 4, 119 (2022).
Alibrahim, H. & Ludwig, S. A. Hyperparameter optimization: comparing genetic algorithm against grid search and bayesian optimization. In 2021 IEEE Congress on Evolutionary Computation (CEC) 1551–1559 https://doi.org/10.1109/CEC45853.2021.9504761 (2021)
Vincent, A. M. & Jidesh, P. An improved hyperparameter optimization framework for AutoML systems using evolutionary algorithms. Sci. Rep. 13, 4737 (2023).
Salehi, F., Abbasi, E. & Hassibi, B. The impact of regularization on high-dimensional logistic regression. In advances in neural information processing systems 32 (Curran Associates, Inc., 2019).
Winter, E. Chapter 53 The shapley value. In handbook of game theory with economic applications 3 2025–2054 (Elsevier 2002).
Lundberg, S. M. & Lee, S.-I. A Unified approach to interpreting model predictions. In Advances in Neural Information Processing Systems 30 (Curran Associates, Inc., 2017).
Band, S. S. et al. Application of explainable artificial intelligence in medical health: A systematic review of interpretability methods. Inform. Med. Unlocked 40, 101286 (2023).
Yang, C. C. Explainable artificial intelligence for predictive modeling in healthcare. J. Healthc. Inform Res. 6, 228–239 (2022).
Youden, W. J. Index for rating diagnostic tests. Cancer 3, 32–35 (1950).
Urbanowicz, R. J., Olson, R. S., Schmitt, P., Meeker, M. & Moore, J. H. Benchmarking relief-based feature selection methods for bioinformatics data mining. J. Biomed. Inform. 85, 168–188 (2018).
Ritchie, M. D. et al. Multifactor-dimensionality reduction reveals high-order interactions among estrogen-metabolism genes in sporadic breast cancer. Am. J. Hum. Genetics 69, 138–147 (2001).
Granizo-Mackenzie, D. & Moore, J. H. Multiple threshold spatially uniform relieff for the genetic analysis of complex human diseases. In Evolutionary Computation Machine Learning and Data Mining in Bioinformatics (eds. Vanneschi, L., Bush, W. S. & Giacobini, M.) 1–10 (Springer, Berlin, Heidelberg). https://doi.org/10.1007/978-3-642-37189-9_1 (2013)
Freda, P. J., Ye, S., Zhang, R., Moore, J. H. & Urbanowicz, R. J. Assessing the limitations of relief-based algorithms in detecting higher-order interactions. BioData Mining 17, 37 (2024).
World Health Organization. Haemoglobin concentrations for the diagnosis of anaemia and assessment of severity. https://www.who.int/publications/i/item/WHO-NMH-NHD-MNM-11.1.
Shahrestani, S., Brown, N. J., Yue, J. K. & Tan, L. A. Developing mixed-effects models to optimize prediction of postoperative outcomes in a modern sample of over 450,000 patients undergoing elective cervical spine fusion surgery. Clin. Spine Surg. 36, E536–E544 (2023).
Kim, D. U. et al. Central sarcopenia, frailty and comorbidity as predictor of surgical outcome in elderly patients with degenerative spine disease. J. Korean Neurosurg. Soc. 64, 995–1003 (2021).
Han, D. et al. Comparison of three frailty evaluation tools in predicting postoperative adverse events in older patients undergoing lumbar fusion surgery: a prospective cohort study of 240 patients. Eur. Spine J. 34, 1741–1749 (2025).
Hersh, A. M. et al. Comparison of frailty metrics and the charlson comorbidity Index for predicting adverse outcomes in patients undergoing surgery for spine metastases. J. Neurosurg. Spine 36, 849–857 (2022).
Song, X., Mitnitski, A., Cox, J. & Rockwood, K. Comparison of machine learning techniques with classical statistical models in predicting health outcomes. Stud. Health Technol. Inform 107, 736–740 (2004).
Huang, Y., Li, W., Macheret, F., Gabriel, R. A. & Ohno-Machado, L. A tutorial on calibration measurements and calibration models for clinical prediction models. J. Am. Med. Inform. Assoc. 27, 621–633 (2020).
Stevens, R. J. & Poppe, K. K. Validation of clinical prediction models: what does the “calibration slope” really measure?. J. Clin. Epidemiol. 118, 93–99 (2020).
Van Calster, B. et al. Calibration: the Achilles heel of predictive analytics. BMC Med. 17, 230 (2019).
Fluss, R., Faraggi, D. & Reiser, B. Estimation of the youden index and its associated cutoff point. Biom. J. 47, 458–472 (2005).
Raj, R., Kannath, S. K., Mathew, J. & Sylaja, P. N. AutoML accurately predicts endovascular mechanical thrombectomy in acute large vessel ischemic stroke. Front. Neurol. 14, 1259958 (2023).
Wang, J., Xue, Q., Zhang, C. W. J., Wong, K. K. L. & Liu, Z. Explainable coronary artery disease prediction model based on AutoGluon from AutoML framework. Front. Cardiovasc. Med. 11, 1360548 (2024).
Zhu, Z. et al. Integrating machine learning and the SHapley Additive exPlanations (SHAP) framework to predict lymph node metastasis in gastric cancer patients based on inflammation indices and peripheral lymphocyte subpopulations. J. Inflamm. Res. 17, 9551–9566 (2024).
Wang, L. et al. Identification of testicular cancer with T2-weighted MRI-based radiomics and automatic machine learning. BMC Cancer 25, 563 (2025).