Your browser doesn't support javascript.
loading
How Good Is Machine Learning in Predicting All-Cause 30-Day Hospital Readmission? Evidence From Administrative Data.
Li, Qing; Yao, Xueqin; Échevin, Damien.
Afiliación
  • Li Q; The Innovation Research Team on Eco-Logistics, Industrial Agglomeration and Regional Development, School of Economics and Management, Xinjiang University, Urumqi, China.
  • Yao X; The Innovation Research Team on Eco-Logistics, Industrial Agglomeration and Regional Development, School of Economics and Management, Xinjiang University, Urumqi, China.
  • Échevin D; Research Center, Centre Hospitalier Universitaire de Sherbrooke, Sherbrooke, Canada; Department of Economics, Université Laval, Quebec City, Canada; Apexmachina, Quebec, Canada. Electronic address: damien.echevin@usherbrooke.ca.
Value Health ; 23(10): 1307-1315, 2020 10.
Article en En | MEDLINE | ID: mdl-33032774
OBJECTIVES: Hospital readmission is a main cost driver for healthcare systems, but existing works often had poor or moderate predictive results. Although the available information differs in different studies, improving prediction is different from the search for important explanatory variables. With large sample size and abundant information, this study explores state-of-the-art machine-learning algorithms and shows their performance in prediction. METHODS: Using administrative data on 1 631 611 hospital stays from Quebec between 1995 and 2012, we predict the probability of 30-day readmission at hospital admission and discharge. We compare the performance between traditional logistic regression, logistic regression with penalization, and more recent machine-learning algorithms such as random forest, deep learning, and extreme gradient boosting. RESULTS: After a 10-fold cross-validation on the training set (80% of the data), machine learning produced very good results on a separate hold-out test set (20% of the data). The importance of explanatory variables is not the same for different algorithms. The area under receiver operating characteristic curve (AUC) reached above 0.79 at hospital admission and above 0.88 at hospital discharge. Diagnostic codes, which include many different categories, are among the most predictive variables. Logistic regression with penalization also produced good results, but a standard logistic regression failed without penalization. The good results are confirmed by calibration curves. CONCLUSION: Although the identification of those at highest risk of readmission is just 1 step to preventing hospital readmissions, 30-day readmission is highly predictable with machine learning.
Asunto(s)
Palabras clave

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Readmisión del Paciente / Aprendizaje Automático Tipo de estudio: Diagnostic_studies / Etiology_studies / Prognostic_studies / Risk_factors_studies Límite: Female / Humans / Male / Middle aged País/Región como asunto: America do norte Idioma: En Revista: Value Health Asunto de la revista: FARMACOLOGIA Año: 2020 Tipo del documento: Article País de afiliación: China Pais de publicación: Estados Unidos

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Readmisión del Paciente / Aprendizaje Automático Tipo de estudio: Diagnostic_studies / Etiology_studies / Prognostic_studies / Risk_factors_studies Límite: Female / Humans / Male / Middle aged País/Región como asunto: America do norte Idioma: En Revista: Value Health Asunto de la revista: FARMACOLOGIA Año: 2020 Tipo del documento: Article País de afiliación: China Pais de publicación: Estados Unidos