Emergency Department Readmission Risk Prediction: A Case Study in Chile

Authors: Arkaitz Artetxe Vallejo Manuel Graña Andoni Beristain Iraola Sebastian Rios

Date: 19.06.2017


PDF

Abstract

Short time readmission prediction in Emergency Departments (ED) is a valuable tool to improve both the ED management and the healthcare quality. It helps identifying patients requiring further post-discharge attention as well as reducing healthcare costs. As in many other medical domains, patient readmission data is heavily imbalanced, i.e. the minority class is very infrequent, which is a challenge for the construction of accurate predictors using machine learning tools. We have carried computational experiments on a dataset composed of ED admission records spanning more than 100000 patients in 3 years, with a highly imbalanced distribution. We employed various approaches for dealing with this highly imbalanced dataset in combination with different classification algorithms and compared their predictive power for the estimation of the ED readmission probability within 72 h after discharge. Results show that random undersampling and Bagging (RUSBagging) in combination with Random Forest achieves the best results in terms of Area Under ROC Curve (AUC).

BIB_text

@Article {
title = {Emergency Department Readmission Risk Prediction: A Case Study in Chile},
pages = {11-20},
volume = {10338},
keywds = {

Readmission risk, Imbalanced data, Classification, Bagging


}
abstract = {

Short time readmission prediction in Emergency Departments (ED) is a valuable tool to improve both the ED management and the healthcare quality. It helps identifying patients requiring further post-discharge attention as well as reducing healthcare costs. As in many other medical domains, patient readmission data is heavily imbalanced, i.e. the minority class is very infrequent, which is a challenge for the construction of accurate predictors using machine learning tools. We have carried computational experiments on a dataset composed of ED admission records spanning more than 100000 patients in 3 years, with a highly imbalanced distribution. We employed various approaches for dealing with this highly imbalanced dataset in combination with different classification algorithms and compared their predictive power for the estimation of the ED readmission probability within 72 h after discharge. Results show that random undersampling and Bagging (RUSBagging) in combination with Random Forest achieves the best results in terms of Area Under ROC Curve (AUC).


}
isbn = {978-3-319-59773-7},
isi = {1},
doi = {10.1007/978-3-319-59773-7_2},
date = {2017-06-19},
year = {2017},
}
Vicomtech

Parque Científico y Tecnológico de Gipuzkoa,
Paseo Mikeletegi 57,
20009 Donostia / San Sebastián (Spain)

+(34) 943 309 230

close overlay