Machine Learning and Causal Approaches to Predict Readmissions and Its Economic Consequences Among Canadian Patients With Heart Disease: Retrospective Study

doi:10.2196/41725

Original Paper

¹Department of Chemistry, Faculty of Science, The University of British Columbia, Vancouver, BC, Canada

²Department of Computer Science, Faculty of Science, The University of British Columbia, Vancouver, BC, Canada

³School of Biomedical Engineering, Faculty of Applied Sciences, University of British Columbia, Vancouver, BC, Canada

*these authors contributed equally

Corresponding Author:

Ethan Rajkumar

Department of Chemistry, Faculty of Science

The University of British Columbia

2036 Main Mall

Vancouver, BC

Canada

Phone: 1 (604) 822 3266

Email: er12da@student.ubc.ca

Background: Unplanned patient readmissions within 30 days of discharge pose a substantial challenge in Canadian health care economics. To address this issue, risk stratification, machine learning, and linear regression paradigms have been proposed as potential predictive solutions. Ensemble machine learning methods, such as stacked ensemble models with boosted tree algorithms, have shown promise for early risk identification in specific patient groups.

Objective: This study aims to implement an ensemble model with submodels for structured data, compare metrics, evaluate the impact of optimized data manipulation with principal component analysis on shorter readmissions, and quantitatively verify the causal relationship between expected length of stay (ELOS) and resource intensity weight (RIW) value for a comprehensive economic perspective.

Methods: This retrospective study used Python 3.9 and streamlined libraries to analyze data obtained from the Discharge Abstract Database covering 2016 to 2021. The study used 2 sub–data sets, clinical and geographical data sets, to predict patient readmission and analyze its economic implications, respectively. A stacking classifier ensemble model was used after principal component analysis to predict patient readmission. Linear regression was performed to determine the relationship between RIW and ELOS.

Results: The ensemble model achieved precision and slightly higher recall (0.49 and 0.68), indicating a higher instance of false positives. The model was able to predict cases better than other models in the literature. Per the ensemble model, readmitted women and men aged 40 to 44 and 35 to 39 years, respectively, were more likely to use resources. The regression tables verified the causality of the model and confirmed the trend that patient readmission is much more costly than continued hospital stay without discharge for both the patient and health care system.

Conclusions: This study validates the use of hybrid ensemble models for predicting economic cost models in health care with the goal of reducing the bureaucratic and utility costs associated with hospital readmissions. The availability of robust and efficient predictive models, as demonstrated in this study, can help hospitals focus more on patient care while maintaining low economic costs. This study predicts the relationship between ELOS and RIW, which can indirectly impact patient outcomes by reducing administrative tasks and physicians’ burden, thereby reducing the cost burdens placed on patients. It is recommended that changes to the general ensemble model and linear regressions be made to analyze new numerical data for predicting hospital costs. Ultimately, the proposed work hopes to emphasize the advantages of implementing hybrid ensemble models in forecasting health care economic cost models, empowering hospitals to prioritize patient care while simultaneously decreasing administrative and bureaucratic expenses.

JMIR Form Res 2023;7:e41725

doi:10.2196/41725

Keywords

patient readmission (11); health care economics (1); ensemble (5); prediction model (107); classification (90); linear regression resource intensity value (1); hospital (162); health care (577); principal component analysis (14); PCA (2)

Background

An open problem that has arisen in Canadian health care economics is the detrimental cost caused by unplanned patient readmissions in hospitals. North American Hospitals have defined patient readmissions as the admittance of patients within 30 days after discharge [Goldfield NI, McCullough EC, Hughes JS, Tang AM, Eastman B, Rawlins LK, et al. Identifying potentially preventable readmissions. Health Care Financ Rev 2008;30(1):75-91 [FREE Full text] [Medline]1]. In Canada, 1 in 11 patients experience readmittance, resulting in expenses of >2.3 billion Canadian dollars per year [Goldfield NI, McCullough EC, Hughes JS, Tang AM, Eastman B, Rawlins LK, et al. Identifying potentially preventable readmissions. Health Care Financ Rev 2008;30(1):75-91 [FREE Full text] [Medline]1,Samsky MD, Ambrosy AP, Youngson E, Liang L, Kaul P, Hernandez AF, et al. Trends in readmissions and length of stay for patients hospitalized with heart failure in Canada and the United States. JAMA Cardiol 2019 May 01;4(5):444-453 [FREE Full text] [CrossRef] [Medline]2]. Consequently, this enormous expense exemplifies the bidirectional consequences of patient readmission by placing strain on individualized patient care while creating additional expenses for hospitals [Goldfield NI, McCullough EC, Hughes JS, Tang AM, Eastman B, Rawlins LK, et al. Identifying potentially preventable readmissions. Health Care Financ Rev 2008;30(1):75-91 [FREE Full text] [Medline]1,Samsky MD, Ambrosy AP, Youngson E, Liang L, Kaul P, Hernandez AF, et al. Trends in readmissions and length of stay for patients hospitalized with heart failure in Canada and the United States. JAMA Cardiol 2019 May 01;4(5):444-453 [FREE Full text] [CrossRef] [Medline]2]. Furthermore, the COVID-19 pandemic has exacerbated many inequities that revolved around patient readmission owing to inflation. For example, patients with lower income residing in less wealthy neighborhoods were at a higher risk of being readmitted after treatment [Brahmania M, Wiskar K, Walley KR, Celi LA, Rush B. Lower household income is associated with an increased risk of hospital readmission in patients with decompensated cirrhosis. J Gastroenterol Hepatol 2021 Apr 14;36(4):1088-1094 [FREE Full text] [CrossRef] [Medline]3]. Reducing these high readmission rates would prove useful in improving patient outcomes while alleviating financial concerns, for patients and hospitals alike [Hellsten E, Liu G, Yue E, Gao G, Sutherland JM. Improving hospital quality through payment reforms: a policy impact analysis in British Columbia. Healthc Manage Forum 2016 Jan 08;29(1):33-38. [CrossRef] [Medline]4,Cropley S. The relationship-based care model: evaluation of the impact on patient satisfaction, length of stay, and readmission rates. J Nurs Adm 2012 Jun;42(6):333-339. [CrossRef] [Medline]5].

One of the ways to help reduce patient readmissions is to adopt a preventive approach [Huang Y, Talwar A, Chatterjee S, Aparasu RR. Application of machine learning in predicting hospital readmissions: a scoping review of the literature. BMC Med Res Methodol 2021 May 06;21(1):96 [FREE Full text] [CrossRef] [Medline]6]. Risk stratification provides a standardized criterion for assigning a risk status to patients for direct care and to improve overall health outcomes. Machine learning (ML) paradigms have been used to guide clinicians in their efforts to enhance diagnosis and risk stratification [Huang Y, Talwar A, Chatterjee S, Aparasu RR. Application of machine learning in predicting hospital readmissions: a scoping review of the literature. BMC Med Res Methodol 2021 May 06;21(1):96 [FREE Full text] [CrossRef] [Medline]6,Wang H, Cui Z, Chen Y, Avidan M, Abdallah AB, Kronzer A. Predicting hospital readmission via cost-sensitive deep learning. IEEE/ACM Trans Comput Biol Bioinf 2018 Nov 1;15(6):1968-1978. [CrossRef]7]. Using ML, clinicians can be guided to make accurate diagnoses, improve patient outcomes, and even identify patients at risk of developing certain conditions that can be translatable to readmission and its economic cost. A study by Baruah [Baruah P. Predicting Hospital Readmission using Unstructured Clinical Note Data. Thesis. Brown University. 2020. URL: https://cs.brown.edu/research/pubs/theses/ugrad/2020/baruah.prakrit.pdf [accessed 2023-05-09] 8] adopted a detailed approach by analyzing electronic health records using a word convolutional neural network using a “Bag-of-Words.” Although using discharge summaries can allow for the personalization of patient prediction, a work-around for the number of resources required to train a high-throughput model such as word convolutional neural network is of high concern [Baruah P. Predicting Hospital Readmission using Unstructured Clinical Note Data. Thesis. Brown University. 2020. URL: https://cs.brown.edu/research/pubs/theses/ugrad/2020/baruah.prakrit.pdf [accessed 2023-05-09] 8]. Furthermore, Baruah’s [Baruah P. Predicting Hospital Readmission using Unstructured Clinical Note Data. Thesis. Brown University. 2020. URL: https://cs.brown.edu/research/pubs/theses/ugrad/2020/baruah.prakrit.pdf [accessed 2023-05-09] 8] model was limited in addressing the high class imbalance in shorter time frame readmission tasks in contrast to longer time frame readmission tasks [Baruah P. Predicting Hospital Readmission using Unstructured Clinical Note Data. Thesis. Brown University. 2020. URL: https://cs.brown.edu/research/pubs/theses/ugrad/2020/baruah.prakrit.pdf [accessed 2023-05-09] 8]. Solving this short time frame readmission problem can allow for a faster prevention of unplanned patient readmission [Baruah P. Predicting Hospital Readmission using Unstructured Clinical Note Data. Thesis. Brown University. 2020. URL: https://cs.brown.edu/research/pubs/theses/ugrad/2020/baruah.prakrit.pdf [accessed 2023-05-09] 8,Kripalani S, Theobald CN, Anctil B, Vasilevskis EE. Reducing hospital readmission rates: current strategies and future directions. Annu Rev Med 2014 Jan 14;65(1):471-485 [FREE Full text] [CrossRef] [Medline]9].

Although deep learning models were used for risk stratification in health care, they had limited success because of the large amount of data required for training [Wang H, Cui Z, Chen Y, Avidan M, Abdallah AB, Kronzer A. Predicting hospital readmission via cost-sensitive deep learning. IEEE/ACM Trans Comput Biol Bioinf 2018 Nov 1;15(6):1968-1978. [CrossRef]7,Baruah P. Predicting Hospital Readmission using Unstructured Clinical Note Data. Thesis. Brown University. 2020. URL: https://cs.brown.edu/research/pubs/theses/ugrad/2020/baruah.prakrit.pdf [accessed 2023-05-09] 8]. In addition, incorporating comorbidities and their time periods in models could lead to the confounding of other variables [Kripalani S, Theobald CN, Anctil B, Vasilevskis EE. Reducing hospital readmission rates: current strategies and future directions. Annu Rev Med 2014 Jan 14;65(1):471-485 [FREE Full text] [CrossRef] [Medline]9-Kansagara D, Englander H, Salanitro A, Kagen D, Theobald C, Freeman M, et al. Risk prediction models for hospital readmission: a systematic review. JAMA 2011 Oct 19;306(15):1688-1698 [FREE Full text] [CrossRef] [Medline]12]. However, Ben-Assuli et al [Ben-Assuli O, Padman R. Trajectories of repeated readmissions of chronic disease patients: risk stratification, profiling, and prediction. MIS Q 2020 Jan 01;44(1):201-226. [CrossRef]13] found that using multiple time periods and ensemble ML methods on large-scale data enabled early risk identification in specific patient groups [Ben-Assuli O, Padman R. Trajectories of repeated readmissions of chronic disease patients: risk stratification, profiling, and prediction. MIS Q 2020 Jan 01;44(1):201-226. [CrossRef]13]. Stacked ensemble models, including those with boosted tree algorithms, demonstrated strong performance in predicting unplanned patient readmissions by reducing bias from individual models and sensitivities to rare classes [Kripalani S, Theobald CN, Anctil B, Vasilevskis EE. Reducing hospital readmission rates: current strategies and future directions. Annu Rev Med 2014 Jan 14;65(1):471-485 [FREE Full text] [CrossRef] [Medline]9,Ikemura K, Bellin E, Yagi Y, Billett H, Saada M, Simone K, et al. Using automated machine learning to predict the mortality of patients with COVID-19: prediction model development study. J Med Internet Res 2021 Feb 26;23(2):e23458 [FREE Full text] [CrossRef] [Medline]10]. These models also offered better interpretability for health care workers and nonexperts in ML, thanks to their transparent results [Kripalani S, Theobald CN, Anctil B, Vasilevskis EE. Reducing hospital readmission rates: current strategies and future directions. Annu Rev Med 2014 Jan 14;65(1):471-485 [FREE Full text] [CrossRef] [Medline]9-Ko H, Chung H, Kang WS, Park C, Kim DW, Kim SE, et al. An artificial intelligence model to predict the mortality of COVID-19 patients at hospital admission time using routine blood samples: development and validation of an ensemble model. J Med Internet Res 2020 Dec 23;22(12):e25442 [FREE Full text] [CrossRef] [Medline]11].

After determining whether the patients will be readmitted within the next few days, the economic consequences to both the hospital and the patient will be estimated [Hansen KN, Morbitzer KA, Waldron KM, Amerine LB. Development of novel formulas to determine hospital and pharmacy opportunities to reduce extended length of stay. J Pharmacy Technol 2016 Nov 14;33(1):15-22. [CrossRef]14,Barrett B, Way C, McDonald J, Parfrey P. Hospital utilization, efficiency and access to care during and shortly after restructuring acute care in Newfoundland and Labrador. J Health Serv Res Policy 2005 Oct;10 Suppl 2:S2:31-S2:37. [CrossRef] [Medline]15]. This involves finding the causal relationship between patients’ expected length of stay (ELOS) and their resource use, which are both continuous variables for determining the economic aftermath of hospital readmission [Hansen KN, Morbitzer KA, Waldron KM, Amerine LB. Development of novel formulas to determine hospital and pharmacy opportunities to reduce extended length of stay. J Pharmacy Technol 2016 Nov 14;33(1):15-22. [CrossRef]14,Barrett B, Way C, McDonald J, Parfrey P. Hospital utilization, efficiency and access to care during and shortly after restructuring acute care in Newfoundland and Labrador. J Health Serv Res Policy 2005 Oct;10 Suppl 2:S2:31-S2:37. [CrossRef] [Medline]15]. However, if given a time period, linear regressions may prove useful in predicting and comparing the trends behind the relationships between variables such as ELOS and readmission in real time [Hansen KN, Morbitzer KA, Waldron KM, Amerine LB. Development of novel formulas to determine hospital and pharmacy opportunities to reduce extended length of stay. J Pharmacy Technol 2016 Nov 14;33(1):15-22. [CrossRef]14,Barrett B, Way C, McDonald J, Parfrey P. Hospital utilization, efficiency and access to care during and shortly after restructuring acute care in Newfoundland and Labrador. J Health Serv Res Policy 2005 Oct;10 Suppl 2:S2:31-S2:37. [CrossRef] [Medline]15].

Goal of This Study

The objectives of the proposed work were 3-fold. The first, main goal of the project was to implement an ensemble model with individual submodels on the structured data and compare the resulting metrics to metrics resulting from other models that have also explored patient readmission in a heart-disease context. The second goal was to determine the contribution of optimized data manipulation through principal component analysis (PCA) to solving the problem of shorter time frame readmissions. The study also aimed to verify the causal relationship between the ELOS and resource intensity weight (RIW) value. Providing an understanding of this relationship in a quantitative and causal manner can allow for an in-depth economic perspective, as opposed to only readmittance within 30 days.

Ultimately, the economic and predictive aspects of this model are intended to provide a view on resource allocation for health institutes to better predict readmittance and improve patient-clinician outcomes [Cropley S. The relationship-based care model: evaluation of the impact on patient satisfaction, length of stay, and readmission rates. J Nurs Adm 2012 Jun;42(6):333-339. [CrossRef] [Medline]5].

Resources Used

Population Study

The study used a systematic methodology with Python 3.9 and streamlined libraries to analyze the data obtained from the Discharge Abstract Database (DAD) covering 2016 to 2021 [Discharge abstract database metadata (DAD). Canadian Institute for Health Information. URL: https://www.cihi.ca/en/discharge-abstract-database-metadata-dad#:~:text=Overview,DAD%20to%20capture%20day%20surgery [accessed 2023-03-14] 16]. Access to the database was facilitated through the Abacus Data Network, a collaborative effort between several universities [Abacus data network. re3data.org. URL: http://doi.org/10.17616/R3692H [accessed 2023-03-21] 17]. The study used 2 sub–data sets, clinical and geographical data sets, to predict patient readmission and analyze economic implications, respectively. The comprehensive documentation provided by Statistics Canada allowed for a robust analysis of the data set. The workflow, illustrated in Figure 1, shows the process of data analysis and visualization using the matplotlib and seaborn libraries.

Figure 1. Study workflow: data collection (blue), data preparation and machine learning implementation (orange), and outputs (green). DAD: Discharge Abstract Database; PCA: principal component analysis.

Design

Similar to the study by Baruah [Baruah P. Predicting Hospital Readmission using Unstructured Clinical Note Data. Thesis. Brown University. 2020. URL: https://cs.brown.edu/research/pubs/theses/ugrad/2020/baruah.prakrit.pdf [accessed 2023-05-09] 8], during clinical and geographical preprocessing, individuals were screened for specific criteria. Using the International Classification of Diseases, 10th revision (ICD-10) and major complication or comorbidity (MCC) codes similar to the models by Baruah [Baruah P. Predicting Hospital Readmission using Unstructured Clinical Note Data. Thesis. Brown University. 2020. URL: https://cs.brown.edu/research/pubs/theses/ugrad/2020/baruah.prakrit.pdf [accessed 2023-05-09] 8] and Liu et al [Liu S, Wang Y, Wen A, Wang L, Hong N, Shen F, et al. Implementation of a cohort retrieval system for clinical data repositories using the observational medical outcomes partnership common data model: proof-of-concept system validation. JMIR Med Inform 2020 Oct 06;8(10):e17376 [FREE Full text] [CrossRef] [Medline]18], the examination of adult patients and exclusion of individuals aged <18 years were performed to prevent any confounding variables “spilling” onto both models. The ICD-10 PCA codes for the diseases included I092, I098, I099, I100, I101, I11, I13, I500, I501, I509, I516, I518, I519, I520, I521, and I528 [Discharge abstract database metadata (DAD). Canadian Institute for Health Information. URL: https://www.cihi.ca/en/discharge-abstract-database-metadata-dad#:~:text=Overview,DAD%20to%20capture%20day%20surgery [accessed 2023-03-14] 16,Abacus data network. re3data.org. URL: http://doi.org/10.17616/R3692H [accessed 2023-03-21] 17]. As for MCC codes, only code 5 corresponded to cardiovascular diseases [Discharge abstract database metadata (DAD). Canadian Institute for Health Information. URL: https://www.cihi.ca/en/discharge-abstract-database-metadata-dad#:~:text=Overview,DAD%20to%20capture%20day%20surgery [accessed 2023-03-14] 16,Abacus data network. re3data.org. URL: http://doi.org/10.17616/R3692H [accessed 2023-03-21] 17]. Factors that were not considered were clinical gestation of delivery (“GES_AGRP”) along with weight group (“WGT_GRP”), as they were only a direct consequence of the age group that was eliminated [Discharge abstract database metadata (DAD). Canadian Institute for Health Information. URL: https://www.cihi.ca/en/discharge-abstract-database-metadata-dad#:~:text=Overview,DAD%20to%20capture%20day%20surgery [accessed 2023-03-14] 16,Abacus data network. re3data.org. URL: http://doi.org/10.17616/R3692H [accessed 2023-03-21] 17].

Clinical Data Set

Clinical Preprocessing

Figure 2 shows the manipulation done and models trained on the clinical data set of DAD. Isolating for a group of patients who share similar clinical characteristics or medical conditions can be useful for identifying trends and patterns in patient care and outcomes, as well as for conducting research on specific medical conditions such as heart disease.

A clinical preprocessing step was performed to isolate for specific criteria and remove any potential confounding variables. Arbitrary admission and discharge dates were chosen based on previous calculations to avoid errors or inconsistencies in the data set. To ensure that the minimum number of relative admission dates was ≥0, dates were shifted to a minimum of January 5 of the corresponding data set year. This adjustment enabled the creation of the “LTORET30Days” columns. For feature selection and dimensionality reduction, PCA was used, as it was a common methodology used for high-dimensionality data sets.

Figure 2. Clinical workflow: data collection (blue), data preparation and machine learning implementation (orange), implementations (purple), and outputs (green). LGBM: LightGBM; PCA: principal component analysis.

PCA Process

According to the PCA criterion, the components to use were described by the minimum number of features required to obtain a cumulative variance of at least 80% [Nidoi J, Muttamba W, Walusimbi S, Imoko JF, Lochoro P, Ictho J, et al. Impact of socio-economic factors on tuberculosis treatment outcomes in north-eastern Uganda: a mixed methods study. BMC Public Health 2021 Nov 26;21(1):2167 [FREE Full text] [CrossRef] [Medline]19,Xu Q, Ni S, Wu F, Liu F, Ye X, Mougin B, et al. Investigation of variation in gene expression profiling of human blood by extended principle component analysis. PLoS One 2011 Oct 27;6(10):e26905 [FREE Full text] [CrossRef] [Medline]20]. The aim was to reduce the dimensionality of the feature space while retaining as much of the original variance as possible [Nidoi J, Muttamba W, Walusimbi S, Imoko JF, Lochoro P, Ictho J, et al. Impact of socio-economic factors on tuberculosis treatment outcomes in north-eastern Uganda: a mixed methods study. BMC Public Health 2021 Nov 26;21(1):2167 [FREE Full text] [CrossRef] [Medline]19-Zhang W, Cheng L, Huang G. Towards fine-scale population stratification modeling based on kernel principal component analysis and random forest. Genes Genomics 2021 Oct 07;43(10):1143-1155. [CrossRef] [Medline]21]. After obtaining an encoded vector in the form of an array, the data were run through several ensemble algorithms. The ensemble algorithm consisted of several submodels, including random forest classifiers, XGBoost (XGB), and LightGBM (LGBM). Each subclassifier’s output was stacked, allowing for a logistic regression to learn the weighted distribution of the subclassifiers to ensure high predictive accuracy. After dimensionality reduction and splitting into training and testing data sets, the final sample size was n=83,083 for nonreadmitted patients and n=10,271 for readmitted patients.

Submodels: LGBM and XGB

LGBM and XGB presented a relative advantage with regard to efficient computation and high accuracy on a wide range of data sets, including those with high dimensionality and categorical features [Chiu C, Wu C, Chien T, Kao L, Li C, Jiang H. Applying an improved stacking ensemble model to predict the mortality of ICU patients with heart failure. J Clin Med 2022 Oct 31;11(21):6460 [FREE Full text] [CrossRef] [Medline]22-Liang Y, Zheng W, Lee W. Nonlinear associations between medical expenditure, perceived medical attitude, and sociodemographics, and older adults' self-rated health in China: applying the extreme gradient boosting model. Healthcare (Basel) 2021 Dec 26;10(1):39 [FREE Full text] [CrossRef] [Medline]25]. Both methods required sequential decision tree generation via error combination or level-wise tree growth. Having dimensionality reduced data would have decreased the maximum function, δ_loss, for LGBM, allowing for lower error and lower changes in ∇_prediction [El-Rashidy N, El-Sappagh S, Abuhmed T, Abdelrazek S, El-Bakry HM. Intensive care unit mortality prediction: an improved patient-specific stacking ensemble model. IEEE Access 2020;8:133541-133564. [CrossRef]26,Pfob A, Lu S, Sidey-Gibbons C. Machine learning in medicine: a practical introduction to techniques for data pre-processing, hyperparameter tuning, and model comparison. BMC Med Res Methodol 2022 Nov 01;22(1):282 [FREE Full text] [CrossRef] [Medline]27]. Similarly, it was extrapolated that a higher maximum depth for XGB would be achieved, as the number of features was lower [El-Rashidy N, El-Sappagh S, Abuhmed T, Abdelrazek S, El-Bakry HM. Intensive care unit mortality prediction: an improved patient-specific stacking ensemble model. IEEE Access 2020;8:133541-133564. [CrossRef]26,Pfob A, Lu S, Sidey-Gibbons C. Machine learning in medicine: a practical introduction to techniques for data pre-processing, hyperparameter tuning, and model comparison. BMC Med Res Methodol 2022 Nov 01;22(1):282 [FREE Full text] [CrossRef] [Medline]27]. An in-depth analysis about LGBM and XGB can be found in Figures S1-S11 in

Multimedia Appendix 1

In-depth descriptions and mathematical formalisms for all of the submodels and ensemble models, cumulative variance and principal component analysis results and confusion matrices of all of the previous models and graphs for the linear regression, all the codes, and references.

DOCX File , 1057 KB Multimedia Appendix 1.

Random Forest

Random forest was chosen to improve the interpretability of the model when used in conjunction with PCA [Zhang W, Cheng L, Huang G. Towards fine-scale population stratification modeling based on kernel principal component analysis and random forest. Genes Genomics 2021 Oct 07;43(10):1143-1155. [CrossRef] [Medline]21]. As the data set had a large number of features, random forest’s computational cost was high. However, after performing dimensionality reduction using PCA, the computational cost of random forest was substantially reduced, making it a practical option for large data sets [Zhang W, Cheng L, Huang G. Towards fine-scale population stratification modeling based on kernel principal component analysis and random forest. Genes Genomics 2021 Oct 07;43(10):1143-1155. [CrossRef] [Medline]21]. During the testing phase, the random forest classifier predicted the final decision of a new data point, noted by C^Brf^(x), by aggregating the prediction results of all decision trees using a majority vote. The classifier selected the class with the highest number of votes as the final prediction, resulting in an accurate and interpretable model. The algorithm design for random forest is formulated in

Multimedia Appendix 1

DOCX File , 1057 KB Multimedia Appendix 1.

Ensemble Models: Logistic Regression

The ensemble model used in this study was a stacking classifier model with a metamodel (final estimator), which was a logistic regression model [El-Rashidy N, El-Sappagh S, Abuhmed T, Abdelrazek S, El-Bakry HM. Intensive care unit mortality prediction: an improved patient-specific stacking ensemble model. IEEE Access 2020;8:133541-133564. [CrossRef]26]. The metamodel took the outputs of the base models as inputs and optimally combined their predictions to ensure high predictive performance [El-Rashidy N, El-Sappagh S, Abuhmed T, Abdelrazek S, El-Bakry HM. Intensive care unit mortality prediction: an improved patient-specific stacking ensemble model. IEEE Access 2020;8:133541-133564. [CrossRef]26]. The ensemble model consisted of 4 base models and was defined, trained, and tested using the scikit-learn's ensemble module, which was the default. This produced an optimal workflow, which is presented in Figure 2. Detailed formalisms are provided in

Multimedia Appendix 1

DOCX File , 1057 KB Multimedia Appendix 1.

Hyperparameter Tuning

To optimize the performance of each base model, hyperparameter tuning was done using a range of values for each parameter [Pfob A, Lu S, Sidey-Gibbons C. Machine learning in medicine: a practical introduction to techniques for data pre-processing, hyperparameter tuning, and model comparison. BMC Med Res Methodol 2022 Nov 01;22(1):282 [FREE Full text] [CrossRef] [Medline]27]. The models were evaluated based on their F₁-score or recall, and scikit-learn's GridSearchCV and RandomizedSearchCV were used to fine-tune the parameters [Pfob A, Lu S, Sidey-Gibbons C. Machine learning in medicine: a practical introduction to techniques for data pre-processing, hyperparameter tuning, and model comparison. BMC Med Res Methodol 2022 Nov 01;22(1):282 [FREE Full text] [CrossRef] [Medline]27].

In addition, a custom function was used to optimize the final estimator of the stacking model, specifically for the logistic regression component [Zhang W, Cheng L, Huang G. Towards fine-scale population stratification modeling based on kernel principal component analysis and random forest. Genes Genomics 2021 Oct 07;43(10):1143-1155. [CrossRef] [Medline]21]. Table 1 lists the parameters used for all the models. By tuning the hyperparameters of the base models and customizing the final estimator for the stacking model, we aimed to improve the overall performance and accuracy of the ML model [Pfob A, Lu S, Sidey-Gibbons C. Machine learning in medicine: a practical introduction to techniques for data pre-processing, hyperparameter tuning, and model comparison. BMC Med Res Methodol 2022 Nov 01;22(1):282 [FREE Full text] [CrossRef] [Medline]27-Koo A, Elsamadicy A, Lin IH, David W, Freedman IG, Isaac G, et al. Predictors of extended length of stay following treatment of unruptured adult cerebral aneurysms a study of the national inpatient sample. Neurosurgery 2020 Dec;67(Supplement_1). [CrossRef]30].

Table 1. Tuned parameters organized according to submodels and estimators.

Model	Parameters
XGB^a	max_depth, n_estimators, and learning_rate
Random forest	bootstrap and max_depth
LGBM^b	learning_rate, n_estimators, num_leaves, min_child_samples, subsample, max_depth, colsample_bytree, reg_alpha, reg_lambda, and min_data_in_leaf
Logistic regression (stacking ensemble)	solver, penalty, and C

^aXGB: XGBoost.

^bLGBM: LightGBM.

Evaluation of the Ensemble Model Outcomes

Evaluation Metrics

Statistical analysis was performed to ensure that the model was robust in and valid for improving the patient outcomes. Three evaluation metrics were used to evaluate the robustness of the model.

Precision is the ratio between the true positive observations and total positive observations obtained from the confusion matrix [Handelman GS, Kok HK, Chandra RV, Razavi AH, Huang S, Brooks M, et al. Peering into the black box of artificial intelligence: evaluation metrics of machine learning methods. AJR Am J Roentgenol 2019 Jan;212(1):38-43. [CrossRef] [Medline]31]. In other words, it provides the number of retrieved items that are relevant. This was a crucial quantity, especially given that there was high class imbalance:
Recall is the ratio between the number of true positives and the sum of the number of true positives and number of false negatives [Handelman GS, Kok HK, Chandra RV, Razavi AH, Huang S, Brooks M, et al. Peering into the black box of artificial intelligence: evaluation metrics of machine learning methods. AJR Am J Roentgenol 2019 Jan;212(1):38-43. [CrossRef] [Medline]31]. The recall score provides the number of relevant items retrieved [Handelman GS, Kok HK, Chandra RV, Razavi AH, Huang S, Brooks M, et al. Peering into the black box of artificial intelligence: evaluation metrics of machine learning methods. AJR Am J Roentgenol 2019 Jan;212(1):38-43. [CrossRef] [Medline]31]. The recall score was useful in determining the model validity regardless of class imbalance owing to the measurement of false negatives:
Balancing the 2 quantities required the use of F₁-score, which serves as the harmonic mean of the precision score and recall score [Handelman GS, Kok HK, Chandra RV, Razavi AH, Huang S, Brooks M, et al. Peering into the black box of artificial intelligence: evaluation metrics of machine learning methods. AJR Am J Roentgenol 2019 Jan;212(1):38-43. [CrossRef] [Medline]31]:

All the scores for the hyperparameter-tuned data were plotted on a bar graph to ensure a clear presentation of the data [Handelman GS, Kok HK, Chandra RV, Razavi AH, Huang S, Brooks M, et al. Peering into the black box of artificial intelligence: evaluation metrics of machine learning methods. AJR Am J Roentgenol 2019 Jan;212(1):38-43. [CrossRef] [Medline]31].

Geographical Data Set

Feature Selections

To determine the relationship between ELOS and RIW, 2 continuous variables that have been shown to be positively correlated with improved patient outcomes, a linear regression analysis was conducted [Bach J. Causality in medicine. C R Biol 2019 Mar;342(3-4):55-57 [FREE Full text] [CrossRef] [Medline]32-van Oostveen CJ, Gouma DJ, Bakker PJ, Ubbink DT. Quantifying the demand for hospital care services: a time and motion study. BMC Health Serv Res 2015 Jan 22;15(1):15 [FREE Full text] [CrossRef] [Medline]34]. RIW is a weighted measure of the anticipated use of resources associated with various demographic, diagnostic, and surgical procedure characteristics of an individual [Pink GH, Bolley H. Physicians in health care management: 4. Case mix groups and resource intensity weights: physicians and hospital funding. CMAJ 1994 Apr 15;150(8):1255-1261.29,Koo A, Elsamadicy A, Lin IH, David W, Freedman IG, Isaac G, et al. Predictors of extended length of stay following treatment of unruptured adult cerebral aneurysms a study of the national inpatient sample. Neurosurgery 2020 Dec;67(Supplement_1). [CrossRef]30].

Multimedia Appendix 1

DOCX File , 1057 KB Multimedia Appendix 1 discusses the requirements for the calculation and formulation of RIW []. Therefore, linear regression analysis has the potential to provide a quantifiable measure of the correlation between these variables, thereby meeting the third objective of the research paper, which is to conduct an in-depth economic analysis [].

To ensure that the results were not biased by confounding factors, the linear regression analyses were conducted separately for each age group, gender, and readmission column class [Auker-Howlett D. Evidence Evaluation and the Epistemology of Causality in Medicine. Canterbury, England: University of Kent; 2020.33]. This approach ensured that any potential effects of these variables were taken into account. Figure 3 demonstrates the approach used for the geographical data sets.

Figure 3. Geographic workflow: data collection (blue), data preparation and regression implementation (orange), and outputs (green). MCC: major complication or comorbidity.

Main and Controlled Geographic Data Set Variables

After the data were isolated for individuals aged >18 years and the MCC codes, Python pandas were used to condition the data set onto covariates. The entire data set was then placed into specific clusters based on this condition. First, individuals were clustered according to whether they had the same patient readmission column value, and then they were split by gender. Afterward, each data point was separated into age clusters. There were 2 gender data clusters for each of the 2 readmitted clusters and 15 age clusters for each of the 4 resulting clusters, resulting in 60 linear regressions being performed. To clarify, the main independent variable was ELOS, and the dependent variable was RIW. The data were split to verify the hypothesis that there was indeed an economic benefit to extending a patient’s length of stay rather than being readmitted.

Ethical Considerations

This study was exempt from research ethics review, as it was a secondary analysis of research data. As data were received directly from acute care facilities or from their respective health or regional authority or ministry or department of health, facilities in all provinces and territories except Quebec were required to report. The authors do not claim any right to the data, as they are the property of Statistics Canada along with the Abacus Student Network [Discharge abstract database metadata (DAD). Canadian Institute for Health Information. URL: https://www.cihi.ca/en/discharge-abstract-database-metadata-dad#:~:text=Overview,DAD%20to%20capture%20day%20surgery [accessed 2023-03-14] 16,Abacus data network. re3data.org. URL: http://doi.org/10.17616/R3692H [accessed 2023-03-21] 17].

The results of the main study are presented in this section. The results for the PCA, feature selection stages, and more data can be found in section B in

Multimedia Appendix 1

DOCX File , 1057 KB Multimedia Appendix 1.

Classification Reports

The evaluation metrics for the ensemble model were presented using classification reports (Table 2). In this context, class 0 represented the model’s performance for the negative class (ie, patients who did not return within 30 days), and class 1 represented the model’s performance for the positive class (ie, patients who did return within 30 days). The support column indicated how many examples of each class were there in the test set.

Table 2. Classification reports for different models.^a

Model type and class		Precision	Recall	F₁-score
XGBoost
	0^b	0.92	0.99	0.95
	1^c	0.79	0.31	0.44
Random forest
	0	0.93	0.97	0.95
	1	0.65	0.39	0.48
LightGBM
	0	0.96	0.91	0.93
	1	0.49	0.68	0.57
Ensemble model^d
	0	0.96	0.91	0.93
	1	0.49	0.68	0.57

^aAll of these models have been hyperparameter tuned.

^bFor all models, class 0 contains n=16,592.

^cFor all models, class 1 contains n=2079.

^dTuned submodels and tuned ensemble models.

Correlation Between Inpatient RIW and ELOS

A least squares linear regression model was fitted to the ELOS and RIW value columns of a geographical data set, and a summary of the best-fitted lines was obtained (Tables 3-5). The corresponding plots (Figures S6 and S7 in

Multimedia Appendix 1

DOCX File , 1057 KB Multimedia Appendix 1) and tables (-) produced by the least squares linear regression was also obtained, and the data were stratified by readmission status, age group, and gender. The coefficient of determination (R²) was included, and it took a value between 0 and 1, providing a sense of how correlated the 2 variables were, with a value of 1 indicating perfect correlation. Note that all age groups had a P<.001. The threshold was chosen as it was expected highly correlated and the root mean square error was used to measure the distance between predicted and actual values.

Table 3. Regression lines fitted for women who were readmitted within 30 days, separated by age groups.

Age group (years)	Slope (expected length of stay)	Intercept	R² adjusted	RMSE^a	F statistics	Sample size, n
18-24	0.196661	–0.038496	0.469363	1.222708	67.339641	76
25-29	0.236855	–0.032451	0.482341	2.067290	128.653235	138
30-34	0.195582	0.009696	0.308630	2.507248	69.299619	154
35-39	0.455326	–1.431780	0.587717	3.356237	213.402169	150
40-44	0.776407	–3.014587	0.573208	6.474491	284.385704	212
45-49	0.355912	–0.681188	0.626838	2.286141	572.132763	341
50-54	0.338375	–0.350636	0.656287	1.246735	990.071951	519
55-59	0.269042	0.021663	0.500513	1.631754	776.589838	775
60-64	0.266451	0.023355	0.401983	1.507652	755.872164	1124
65-69	0.361558	–0.433117	0.562777	1.813054	1821.045541	1415
70-74	0.346215	–0.373987	0.455203	2.111974	1393.857933	1668
75-79	0.280143	–0.088512	0.499735	1.388378	1795.095118	1797
>80	0.280403	–0.140523	0.321200	1.600127	1975.140727	4173

^aRMSE: root mean squared error.

Table 4. The 4 types of submodels.

Ensemble model	Tuned submodels (Y^a or N^b)	Tuned LR^c (Y or N)
1	N	N
2	N	Y
3	Y	N
4	Y	Y

^aY: yes.

^bN: no.

^cLR: logistic regression.

Table 5. Comparison of existing literature values.

Author name or literature values	Description of model	Comparison to current literature values with precision, recall, and F₁-score
Sharma et al [Sharma V, Kulkarni` V, McAlister F, Eurich D, Keshwani S, Simpson SH, et al. Predicting 30-day readmissions in patients with heart failure using administrative data: a machine learning approach. J Card Fail 2022 May;28(5):710-722 [FREE Full text] [CrossRef] [Medline]36]: “Predicting 30-Day Readmissions in Patients With Heart Failure Using Administrative Data: A Machine Learning Approach”	Sharma et al’s [Sharma V, Kulkarni` V, McAlister F, Eurich D, Keshwani S, Simpson SH, et al. Predicting 30-day readmissions in patients with heart failure using administrative data: a machine learning approach. J Card Fail 2022 May;28(5):710-722 [FREE Full text] [CrossRef] [Medline]36] implementation of XGBoost created a precision-recall curve. Their precision and recall balance for class 1 was significantly lower than that of the ensemble model. However, the ensemble model proposed in this work allows for high balance.	Sharma et al [Sharma V, Kulkarni` V, McAlister F, Eurich D, Keshwani S, Simpson SH, et al. Predicting 30-day readmissions in patients with heart failure using administrative data: a machine learning approach. J Card Fail 2022 May;28(5):710-722 [FREE Full text] [CrossRef] [Medline]36] used a precision-recall curve to evaluate the performance of their model. The bias-variance trade-off was observed to be high during the analysis. The primary evaluation metric used in the study was the AUC^a, which was not used for the proposed work here.
Jamei et al [Jamei M, Nisnevich A, Wetchler E, Sudat S, Liu E. Predicting all-cause risk of 30-day hospital readmission using artificial neural networks. PLoS One 2017 Jul 14;12(7):e0181173 [FREE Full text] [CrossRef] [Medline]37]: “Predicting All-Cause Risk of 30 Day Hospital Readmissions Using Artificial Neural Networks (ANN)”	Jamei et al [Jamei M, Nisnevich A, Wetchler E, Sudat S, Liu E. Predicting all-cause risk of 30-day hospital readmission using artificial neural networks. PLoS One 2017 Jul 14;12(7):e0181173 [FREE Full text] [CrossRef] [Medline]37] predicted patient readmission using a neural network. Their precision and recall balance was skewed, as the precision for their models was low, yet the recall was high. This results in high variance but low bias.	The following scores were given for the 2-layer neural network of Jamei et al [Jamei M, Nisnevich A, Wetchler E, Sudat S, Liu E. Predicting all-cause risk of 30-day hospital readmission using artificial neural networks. PLoS One 2017 Jul 14;12(7):e0181173 [FREE Full text] [CrossRef] [Medline]37], with the number of features being high: precision=23%, recall=59%, and F₁-score=16.5%. This indicates that the proposed model in this work has a significant advantage compared with an ANN^b.
Ho et al [Lin Ho ET, Tan IE, Lee I, Wu PY, Chong HF. Predicting readmission at early hospitalization using electronic health data: a customized model development. Int J Integr Care 2017 Oct 17;17(5):A506. [CrossRef]38]: “Predicting Readmission at Early Hospitalization Using Electronic Health Data: A Customized Model Development”	Ho et al [Lin Ho ET, Tan IE, Lee I, Wu PY, Chong HF. Predicting readmission at early hospitalization using electronic health data: a customized model development. Int J Integr Care 2017 Oct 17;17(5):A506. [CrossRef]38] predicted a within a 24 month period. The model they used was an XGBoost Model having access to specific laboratory data in addition to the variables addressed in our work.	The following scores were present in the readmission stage: recall score of 80% and a precision score of 76%. Although these scores may be higher overall due to the presence of more personalized data such as specific laboratory results for each patient. Furthermore, Ho et al [Lin Ho ET, Tan IE, Lee I, Wu PY, Chong HF. Predicting readmission at early hospitalization using electronic health data: a customized model development. Int J Integr Care 2017 Oct 17;17(5):A506. [CrossRef]38], does not seem to stratify based on specific diseases which could result in bias effecting this score.

^aAUC: area under the curve.

^bANN: artificial neural networks.

Table 6. Regression lines fitted for men who were not readmitted within 30 days, separated by age groups.

Age group (years)	Slope (expected length of stay)	Intercept	R² adjusted	RMSE^a	F statistics	Sample size, n
18-24	0.133004	0.603021	0.301397	1.165577	228.362523	528
25-29	0.332606	–0.276038	0.614197	1.609255	718.990166	452
30-34	0.492425	–1.069907	0.697588	2.279228	1521.145371	660
35-39	0.447525	–0.844292	0.653325	2.407347	1902.504340	1010
40-44	0.466519	–0.840117	0.647550	2.075903	3118.868054	1698
45-49	0.380460	–0.308439	0.583264	1.770296	3957.666828	2828
50-54	0.420954	–0.557944	0.626193	2.074092	7922.913006	4730
55-59	0.410799	–0.440937	0.570193	2.223704	9108.272364	6866
60-64	0.421378	–0.502756	0.471093	2.769979	7587.014726	8518
65-69	0.341375	–0.053999	0.514460	2.144153	10,030.832915	9467
70-74	0.327909	–0.015719	0.476918	1.992737	8977.140732	9846
75-79	0.331579	–0.114296	0.447201	2.177262	7031.801936	8692
>80	0.296201	–0.082061	0.269552	2.414906	6006.483269	16,275

^aRMSE: root mean squared error.

Table 7. Regression lines fitted for men who were readmitted within 30 days, separated by age groups.

Age group (years)	Slope (expected length of stay)	Intercept	R² adjusted	RMSE^a	F statistics	Sample size, n
18-24	0.13304	–0.123251	0.659408	1.936411	159.756983	83
25-29	0.434780	–1.444296	0.563547	3.009955	123.663857	96
30-34	0.205049	0.338259	0.414037	1.963718	109.108782	154
35-39	0.503076	–1.434408	0.597649	3.516315	387.201489	261
40-44	0.319638	–0.426751	0.633530	1.556234	708.053550	410
45-49	0.321520	–0.251261	0.543536	2.238206	883.346712	742
50-54	0.305131	–0.157782	0.546973	1.329511	1546.437115	1281
55-59	0.336327	–0.291609	0.516734	1.844321	1963.077772	1836
60-64	0.387830	–0.541003	0.524773	1.978364	2742.875413	2484
65-69	0.356209	–0.334169	0.526643	1.919789	3103.957220	2790
70-74	0.315150	–0.190539	0.503276	1.686930	2989.908128	2951
75-79	0.330981	–0.242378	0.515653	1.935420	2699.848355	2536
>80	0.320212	–0.272116	0.388353	1.864184	2708.979941	4266

^aRMSE: root mean squared error.

Table 8. Regression lines fitted for women who were not readmitted within 30 days, separated by age groups.

Age group (years)	Slope (expected length of stay)	Intercept	R² adjusted	RMSE^a	F statistics	Sample size, n
18-24	0.290987	–0.168320	0.650458	1.837020	733.188372	306
25-29	0.340779	–0.378737	0.570807	2.726389	589.170872	445
30-34	0.324827	–0.284349	0.584901	2.048568	789.076632	562
35-39	0.368889	–0.515399	0.631981	1.725110	1102.474089	644
40-44	0.253023	0.147074	0.531883	1.569319	1070.319261	944
45-49	0.324630	–0.232875	0.627709	1.493320	2394.218647	1422
50-54	0.301186	–0.073365	0.478817	1.655429	2019.326558	2200
55-59	0.389959	–0.484006	0.576303	1.886088	4468.179114	3287
60-64	0.339517	–0.190579	0.422514	2.111166	3013.638447	4121
65-69	0.297055	–0.029449	0.504601	1.616991	5405.580063	5309
70-74	0.333896	–0.262753	0.482080	2.016495	5622.958698	6043
75-79	0.348289	–0.358721	0.439302	2.057815	5139.697471	6562
>80	0.266803	–0.034627	0.179332	2.489782	4115.368718	18,835

^aRMSE: root mean squared error.

The proposed work aimed to use ensemble models and linear regressions for predicting patient readmissions and analyzing their economic consequences [Bach J. Causality in medicine. C R Biol 2019 Mar;342(3-4):55-57 [FREE Full text] [CrossRef] [Medline]32,Auker-Howlett D. Evidence Evaluation and the Epistemology of Causality in Medicine. Canterbury, England: University of Kent; 2020.33]. The results of this study demonstrate the potential of these models to accurately predict readmissions with a balanced degree of recall and precision, which could help health care providers identify patients who are at risk of readmission and take proactive measures to prevent it.

Notes About the Study

Although the study used cutting-edge algorithms for classification and regression, there are several critical notes that must be considered [Xing Y, Macq B. Improvement of Bragg peak shift estimation using dimensionality reduction techniques and predictive linear modeling. In: Proceedings of the 13th International Symposium on Medical Information Processing and Analysis. 2017 Presented at: 13th International Symposium on Medical Information Processing and Analysis; Oct 5-7, 2017; San Andres Island, Colombia. [CrossRef]39]. The primary evaluation metrics for the models were recall and F₁-scores, with a slight preference for false positives over false negatives to decrease the likelihood of unplanned readmissions [Hong AJ, Bratvold RB, Nævdal G. Robust production optimization with capacitance-resistance model as precursor. Comput Geosci 2017 Jun 24;21(5-6):1423-1442. [CrossRef]40]. However, it is crucial to note that this approach may not be suitable for all health care scenarios and should be evaluated on a case-by-case basis [Lei W, Wang J. Dynamic Stacking ensemble monitoring model of dam displacement based on the feature selection with PCA-RF. J Civil Struct Health Monit 2022 Mar 24;12(3):557-578. [CrossRef]41].

Another crucial consideration is the computational cost associated with clinical and graphical data [Lei W, Wang J. Dynamic Stacking ensemble monitoring model of dam displacement based on the feature selection with PCA-RF. J Civil Struct Health Monit 2022 Mar 24;12(3):557-578. [CrossRef]41]. Although the analysis for this study only took 2 to 3 hours, it is essential to consider the computational requirements for more substantial studies, particularly those with larger data sets or more complex models [Indrasiri PL, Lee E, Rupapara V, Rustam F, Ashraf I. Malicious traffic detection in IoT and local networks using stacked ensemble classifier. Comput MaterialContinua 2021 Nov;71(1):489-515. [CrossRef]42]. The computational cost may impact the feasibility of the study, and efficient models may be necessary to ensure valid and reliable results [Indrasiri PL, Lee E, Rupapara V, Rustam F, Ashraf I. Malicious traffic detection in IoT and local networks using stacked ensemble classifier. Comput MaterialContinua 2021 Nov;71(1):489-515. [CrossRef]42].

In addition, some features in the geographical data, such as the case mix group diagnosis type, could not be split in the geographical data sets because of their high computational cost. This could lead to omitted variable bias and negatively affect the models’ accuracy [Zulfiker M, Kabir N, Biswas AA, Chakraborty P. Predicting insomnia using multilayer stacked ensemble model. In: Advances in Computing and Data Sciences. Cham: Springer; 2021.43]. As the impact of not splitting these features was not taken into account in this study, future research should carefully evaluate the potential impact of not splitting features and consider alternatives to reduce the computational cost [Zulfiker M, Kabir N, Biswas AA, Chakraborty P. Predicting insomnia using multilayer stacked ensemble model. In: Advances in Computing and Data Sciences. Cham: Springer; 2021.43].

Clinical Data Set Result Analysis

In this section, the clinical data set results are analyzed and compared with those of other existing models in the literature.

The Effect of PCA on the Study and the Bias-Variance Trade-off

The use of PCA offered several advantages. The selection of the components that describe the minimum number of features required to achieve a cumulative variance of at least 80% proved to be effective in preventing overfitting [Handelman GS, Kok HK, Chandra RV, Razavi AH, Huang S, Brooks M, et al. Peering into the black box of artificial intelligence: evaluation metrics of machine learning methods. AJR Am J Roentgenol 2019 Jan;212(1):38-43. [CrossRef] [Medline]31,Allam A, Nagy M, Thoma G, Krauthammer M. Neural networks versus Logistic regression for 30 days all-cause readmission prediction. Sci Rep 2019 Jun 26;9(1):9277 [FREE Full text] [CrossRef] [Medline]44-Altman N, Krzywinski M. The curse(s) of dimensionality. Nat Method 2018 Jun 31;15(6):399-400. [CrossRef] [Medline]46]. The data set had high dimensionality and a substantial number of data points, which would have led to high bias and low variance without the use of PCA [Xing Y, Macq B. Improvement of Bragg peak shift estimation using dimensionality reduction techniques and predictive linear modeling. In: Proceedings of the 13th International Symposium on Medical Information Processing and Analysis. 2017 Presented at: 13th International Symposium on Medical Information Processing and Analysis; Oct 5-7, 2017; San Andres Island, Colombia. [CrossRef]39]. This, in turn, would have resulted in a lower precision rate than recall rate. However, PCA prevented this issue by reducing the number of features in the model and substantially increasing computational efficiency [Tiwari A, Chugh A, Sharma A. Ensemble framework for cardiovascular disease prediction. Comput Biol Med 2022 Jul;146:105624. [CrossRef] [Medline]47].

Moreover, PCA eliminated the potential for collinearity, which can create unstable and unreliable estimates of the model parameters [Xing Y, Macq B. Improvement of Bragg peak shift estimation using dimensionality reduction techniques and predictive linear modeling. In: Proceedings of the 13th International Symposium on Medical Information Processing and Analysis. 2017 Presented at: 13th International Symposium on Medical Information Processing and Analysis; Oct 5-7, 2017; San Andres Island, Colombia. [CrossRef]39]. Collinearity makes it difficult to determine the unique contribution of each variable to the outcome [Lei W, Wang J. Dynamic Stacking ensemble monitoring model of dam displacement based on the feature selection with PCA-RF. J Civil Struct Health Monit 2022 Mar 24;12(3):557-578. [CrossRef]41]. Upon computing the covariance matrix and performing an eigenvector decomposition, the resulting eigenvectors were orthogonal to each other, thereby eliminating the presence of collinearity.

Furthermore, the implementation of PCA in conjunction with stacked classifiers enabled a higher interpretability of the models [Indrasiri PL, Lee E, Rupapara V, Rustam F, Ashraf I. Malicious traffic detection in IoT and local networks using stacked ensemble classifier. Comput MaterialContinua 2021 Nov;71(1):489-515. [CrossRef]42]. Stacked models can be challenging to interpret in high-dimensional data, as the layers can contribute to a high level of complexity [Zulfiker M, Kabir N, Biswas AA, Chakraborty P. Predicting insomnia using multilayer stacked ensemble model. In: Advances in Computing and Data Sciences. Cham: Springer; 2021.43]. Moreover, the curse of dimensionality and collinearity can make it difficult for models to isolate specific features, thereby decreasing transparency [Zulfiker M, Kabir N, Biswas AA, Chakraborty P. Predicting insomnia using multilayer stacked ensemble model. In: Advances in Computing and Data Sciences. Cham: Springer; 2021.43]. However, the addition of PCA allowed for a more comprehensive and explained model, as reflected in the submodel and ensemble model analyses in the subsequent sections.

Submodel Analyses

This study found that although the hyperparameter-tuned XGB model outperformed its base model, it was still less accurate than the other individual submodels. This result is consistent with a previous study conducted in Alberta that also found that XGB models did not provide substantial information on patient readmissions [Sharma V, Kulkarni` V, McAlister F, Eurich D, Keshwani S, Simpson SH, et al. Predicting 30-day readmissions in patients with heart failure using administrative data: a machine learning approach. J Card Fail 2022 May;28(5):710-722 [FREE Full text] [CrossRef] [Medline]36]. However, the tuned XGB model performed better than its base model and had a higher precision and recall score, indicating a better balance between precision and recall for both classes relative to the default XGB model.

By contrast, both the tuned random forest and LGBM models (Tables 6 and 7, respectively) demonstrated superior performance compared with their base models (Table S1 in

Multimedia Appendix 1

DOCX File , 1057 KB Multimedia Appendix 1) in predicting patient readmission for class 1, as evidenced by their higher F₁-score and precision. The recall for class 1 was lower for the tuned random forest model, whereas it was higher for the tuned LGBM model. LGBM was shown to balance a slightly higher recall rate and precision rate than its other decision tree counterparts, allowing it to provide substantial information regarding the use of this model.

Final Estimator Analysis

The ensemble model was created to ensure minimization and offset bias and variance between each of the models in discussion [Allam A, Nagy M, Thoma G, Krauthammer M. Neural networks versus Logistic regression for 30 days all-cause readmission prediction. Sci Rep 2019 Jun 26;9(1):9277 [FREE Full text] [CrossRef] [Medline]44,Lone NI, Lee R, Salisbury L, Donaghy E, Ramsay P, Rattray J, et al. Predicting risk of unplanned hospital readmission in survivors of critical illness: a population-level cohort study. Thorax 2019 Nov 05;74(11):1046-1054 [FREE Full text] [CrossRef] [Medline]45]. The 4 types of ensemble models and their classification reports are listed in Table 4 and Table S2 in

Multimedia Appendix 1

DOCX File , 1057 KB Multimedia Appendix 1, respectively.

Upon analyzing the data, it was observed that the default model configuration, which consisted of default submodels and a default final estimator logistic regression, exhibited high precision (0.92) and recall (0.98) for nonreadmitted patients (class 0). However, its ability to predict readmissions (class 1) was comparatively weaker, as evidenced by the lower F₁-score (0.46), precision (0.69), and recall (0.35) for class 1.

The second configuration, which used default submodels with a tuned logistic regression final estimator, demonstrated an improvement in the F₁-score (0.56) for class 1. Nonetheless, its precision (0.47) and recall (0.68) for class 1 remained lower than those for class 0.

The third configuration, which used tuned submodels with a default final estimator logistic regression, yielded high precision (0.92) and recall (0.99) for class 0. However, its performance in predicting readmissions (class 1) was weaker, with a precision of 0.77 and recall of 0.30, leading to an F₁-score of 0.43.

The fourth configuration, in which both submodels and final estimator logistic regression were tuned, resulted in the highest F₁-score (0.57) for class 1, indicating a better performance in predicting patient readmissions. Nevertheless, its precision (0.49) and recall (0.68) for class 1 remained lower than those for class 0.

The overall tuned ensemble model, when compared with the submodels, is identical to the LGBM model, as although recall is favored, the balance between precision and recall for class 1, compared with the other models, is useful in preventing too many false positives from occurring.

Comparison of Tuned Ensemble Models With Literature Value Predictions

The results of this study are not comparable with Baruah’s [Baruah P. Predicting Hospital Readmission using Unstructured Clinical Note Data. Thesis. Brown University. 2020. URL: https://cs.brown.edu/research/pubs/theses/ugrad/2020/baruah.prakrit.pdf [accessed 2023-05-09] 8] values because of the presence of unstructured data types, which cannot serve as a useful comparison to ordered data sets such as the DAD. However, other studies have used the DAD or other similar structured data before. The existing literature review comparisons with 30-day short-term studies are presented in Table 5.

Note that this list is not exhaustive and that there may be other studies that potentially use stacking classifier models and show better results. The comparison with other studies shows that the model has the potential to be viable and robust, but more tuning and comparison between submodels need to be performed.

Limitations of the Clinical Data Set Analysis

One notable limitation of the clinical data set used in this study was the high class imbalance problem. Specifically, there were considerably more training points for class 0 than for class 1, with n=83,083 for class 0 and n=10,271 for class 1. This issue could have led to the trained model being more prone to producing false negatives than to producing false positives, as it was more familiar with class 0 instances and thus had a tendency to classify more instances as class 0 [Artetxe A, Graña M, Beristain A, Ríos S. Balanced training of a hybrid ensemble method for imbalanced datasets: a case of emergency department readmission prediction. Neural Comput Appl 2017 Oct 14;32(10):5735-5744. [CrossRef]48,Du G, Zhang J, Luo Z, Ma F, Ma L, Li S. Joint imbalanced classification and feature selection for hospital readmissions. Knowl Base Syst 2020 Jul;200:106020. [CrossRef]49]. Consequently, this limitation could have negatively impacted the overall performance and accuracy of the model, as well as the reliability of the predictions it produced [Bukhari AA. 30-days All-cause Prediction Model for Readmissions for Heart Failure Patients A Comparative Study of Machine Learning Approaches. Boston, Massachusetts: Northeastern University; Dec 2019.50].

Another limitation of the data set was the encoding of the data, which could have influenced the interpretability and accuracy of the model. Specifically, if the model interpreted the encoded data as ordinal, it could have altered the ordinality of the classifier, thereby influencing the classification results. This limitation could have impacted the ability of the model to identify the most relevant features for predicting patient readmission, reducing its interpretability [Du G, Zhang J, Luo Z, Ma F, Ma L, Li S. Joint imbalanced classification and feature selection for hospital readmissions. Knowl Base Syst 2020 Jul;200:106020. [CrossRef]49]. Moreover, this limitation could have adversely impacted the accuracy of the model, as the model may have learned from the encoded data instead of the underlying features, resulting in a less accurate prediction of patient readmission [Bukhari AA. 30-days All-cause Prediction Model for Readmissions for Heart Failure Patients A Comparative Study of Machine Learning Approaches. Boston, Massachusetts: Northeastern University; Dec 2019.50].

Finally, the data set’s lack of information about the specific principal component that contributed to the accurate prediction of the patient data set was another limitation. This limitation could have constrained the model’s ability to explain how the variables were associated with patient readmission, resulting in a lack of transparency in the model’s predictions and reduced ability to elucidate the rationale behind its decision-making process. As such, identifying the principal components that contribute to the accurate prediction of the patient data set is critical to improving the interpretability and reliability of the model.

Geographical Data Set Result Analysis

Causality of the Linear Regression Model

The study results suggested that the model could potentially establish a causal relationship (albeit with a proper regression type) between ELOS and RIW. The anticipated hypothesis was well supported by the tables presented earlier, indicating the importance of the model. The analysis involved an explicit model of a continuous outcome (RIW) that was affected by a measured continuous variable (ELOS), and the results showed a notable impact. This finding encourages the establishment of causality in the relationship between ELOS and RIW.

ELOS Effects on RIW and Fit of the Linear Regression

The relationship between ELOS and RIW was investigated through a linear regression analysis, which produced the coefficient (slope) from the ELOS variables. The study findings indicated that more resources were expended and more time was spent among women aged 40 to 44 years who were readmitted than among those who were not readmitted. In addition, more resources were expended for men aged 35 to 39 years (Table 7) who were readmitted than for their nonreadmitted counterparts. Surprisingly, most of the slopes associated with ELOS are uniform in nature and are approximately the same across ages. However, a comparison between the results also suggested that ELOS had a significant effect on RIW owing to the low P values.

The F test of overall significance was used to ascertain that the model was better suited than a model with no independent variables [Hubbard A, Munoz I, Decker A, Holcomb JB, Schreiber MA, Bulger EM, PROMMTT Study Group. Time-dependent prediction and evaluation of variable importance using superlearning in high-dimensional clinical data. J Trauma Acute Care Surg 2013 Jul;75(1 Suppl 1):S53-S60 [FREE Full text] [CrossRef] [Medline]51,Sureiman O, Mangera C. F-test of overall significance in regression analysis simplified. J Pract Cardiovasc Sci 2020;6:116-122.52]. All the models had F statistic values significantly greater than their critical F values, which suggested that the linear regression model was a relatively accurate estimate of the relationship between ELOS and RIW.

However, the root mean squared error and R² values suggested otherwise. There was a high degree of error compared with the slope. The low R² values across all the studies implied that linear regression was not a good fit, which could imply that further data clustering into groups was necessary or that further manipulation of the data to perform a different regression was needed. These results were reasonable, considering that the function was not 1-to-1, as demonstrated by the graphs in Figures S6 and S7 in

Multimedia Appendix 1

DOCX File , 1057 KB Multimedia Appendix 1.

Future Directions

Many fundamental aspects of both the ensemble model and linear regression remain unexplored.

Therefore, the suggested future implementations for the ensemble model are as follows:

Including unstructured data (such as clinical data and text notes) in analysis by a deep neural network and performing logistic regression on all the models to give individuality to a specific patient [Liu S, Wang Y, Wen A, Wang L, Hong N, Shen F, et al. Implementation of a cohort retrieval system for clinical data repositories using the observational medical outcomes partnership common data model: proof-of-concept system validation. JMIR Med Inform 2020 Oct 06;8(10):e17376 [FREE Full text] [CrossRef] [Medline]18].
Using deep learning neural networks as a final estimator for the ensemble model and outputting evaluation metrics [Rai HM, Chatterjee K. Hybrid CNN-LSTM deep learning model and ensemble technique for automatic detection of myocardial infarction using big ECG data. Appl Intell 2021 Aug 11;52(5):5366-5384. [CrossRef]53].
Adding more submodels and optimizing for computational resources such as space, time, and memory [Rai HM, Chatterjee K. Hybrid CNN-LSTM deep learning model and ensemble technique for automatic detection of myocardial infarction using big ECG data. Appl Intell 2021 Aug 11;52(5):5366-5384. [CrossRef]53].

The suggested improvements for the linear regression include are as follows:

An instrumental variable that measures the relationship between ELOS and a selection decision variable should be implemented. The instrumental variables should only be involved in the selection decision process. Afterward, the relationship between RIW and the selection decision variable should be measured to ensure low omitted variable biases [Dunn A. Health insurance and the demand for medical care: instrumental variable estimates using health insurer claims data. J Health Econ 2016 Jul;48:74-88. [CrossRef] [Medline]54].
Logistic regression (logistic by the coefficients) should be performed to ensure that root mean squared error is minimized and a more accurate relationship between the ELOS and RIW can be derived [Lin H. Revisiting the relationship between nurse staffing and quality of care in nursing homes: an instrumental variables approach. J Health Econ 2014 Sep;37:13-24. [CrossRef] [Medline]55].

These applications can allow for a more in-depth analysis and provide a multifaceted perspective in the fields of ML, econometrics, and health care interventions.

Conclusions

The study’s implications are to validate the use of hybrid ensemble models and attempt to predict economic cost prediction models. The availability of robust and efficient predictive models, such as the one presented in this study, can enable hospitals to focus more on patients and less on the utility and bureaucratic costs associated with their readmission. As demonstrated by the evaluation metrics, the ensemble model plays a critical role in ensuring more precise results overall. By implementing a crowdsourcing approach, the model can also estimate the resources required to control future epidemics in an easier, time-sensitive manner while maintaining low economic costs. This is particularly relevant in decentralized, universal, publicly funded countries such as Canada, where high inflation on medical equipment, technologies, and maintenance has been observed in the aftermath of the COVID-19 pandemic.

Predicting the relationship between ELOS and RIW can also indirectly predict patient outcomes by reducing bureaucratic and utility costs, thereby reducing the cost burden placed on patients to implement administrative tasks and on physicians to ensure their execution. The ensemble model also considers the specific disease type, and the encoding process has resulted in the classification data being ordinal in nature, which takes into account patient utility in addition to risk stratification.

The linear regression has considered the differences in continuous variables while also allowing for a clear difference in the clustered groups. Further exploration of the cost-benefit economic model can enable hospitals to ensure more cost-free, patient-friendly outcomes. It is recommended that after making several changes to the general ensemble model and the linear regressions, they be used to analyze new and incoming numerical hospital cost data.

Acknowledgments

The authors express their profound gratitude to Adrian Stanley from JMIR for his unwavering support and to Benjamin D Fedoruk from STEM Fellowship for his invaluable assistance in ideation and manuscript. This research would not have been possible without the generous support of the sponsors from the 2022 Inter-University Big Data Challenge, including JMIR Publications, Roche, Statistical Analysis System Institute Inc, Canadian Science Publishing, Digital Science, and Overleaf, whose contributions enabled the authors to conduct this groundbreaking research.

This manuscript received first place in the STEM Fellowship Big Data Challenge Inter-University Innovation Award, which was sponsored by JMIR Publications. JMIR Publications provided APF support for the publication of this paper.

Data Availability

All codes have been made available by the authors in

Multimedia Appendix 1

DOCX File , 1057 KB Multimedia Appendix 1. The data set used in this study was obtained from Statistics Canada and is subject to copyright owned by Statistics Canada.

Authors' Contributions

ER assumed leadership in the administration, drafting, and computational efforts for this manuscript. KN and QG made equal contributions to the programming and algorithm development, demonstrating their expertise and commitment to this project. SR and JP contributed equally to the drafting of the manuscript.

Conflicts of Interest

None declared.

‎

Multimedia Appendix 1

DOCX File , 1057 KB

Goldfield NI, McCullough EC, Hughes JS, Tang AM, Eastman B, Rawlins LK, et al. Identifying potentially preventable readmissions. Health Care Financ Rev 2008;30(1):75-91 [FREE Full text] [Medline]
Samsky MD, Ambrosy AP, Youngson E, Liang L, Kaul P, Hernandez AF, et al. Trends in readmissions and length of stay for patients hospitalized with heart failure in Canada and the United States. JAMA Cardiol 2019 May 01;4(5):444-453 [FREE Full text] [CrossRef] [Medline]
Brahmania M, Wiskar K, Walley KR, Celi LA, Rush B. Lower household income is associated with an increased risk of hospital readmission in patients with decompensated cirrhosis. J Gastroenterol Hepatol 2021 Apr 14;36(4):1088-1094 [FREE Full text] [CrossRef] [Medline]
Hellsten E, Liu G, Yue E, Gao G, Sutherland JM. Improving hospital quality through payment reforms: a policy impact analysis in British Columbia. Healthc Manage Forum 2016 Jan 08;29(1):33-38. [CrossRef] [Medline]
Cropley S. The relationship-based care model: evaluation of the impact on patient satisfaction, length of stay, and readmission rates. J Nurs Adm 2012 Jun;42(6):333-339. [CrossRef] [Medline]
Huang Y, Talwar A, Chatterjee S, Aparasu RR. Application of machine learning in predicting hospital readmissions: a scoping review of the literature. BMC Med Res Methodol 2021 May 06;21(1):96 [FREE Full text] [CrossRef] [Medline]
Wang H, Cui Z, Chen Y, Avidan M, Abdallah AB, Kronzer A. Predicting hospital readmission via cost-sensitive deep learning. IEEE/ACM Trans Comput Biol Bioinf 2018 Nov 1;15(6):1968-1978. [CrossRef]
Baruah P. Predicting Hospital Readmission using Unstructured Clinical Note Data. Thesis. Brown University. 2020. URL: https://cs.brown.edu/research/pubs/theses/ugrad/2020/baruah.prakrit.pdf [accessed 2023-05-09]
Kripalani S, Theobald CN, Anctil B, Vasilevskis EE. Reducing hospital readmission rates: current strategies and future directions. Annu Rev Med 2014 Jan 14;65(1):471-485 [FREE Full text] [CrossRef] [Medline]
Ikemura K, Bellin E, Yagi Y, Billett H, Saada M, Simone K, et al. Using automated machine learning to predict the mortality of patients with COVID-19: prediction model development study. J Med Internet Res 2021 Feb 26;23(2):e23458 [FREE Full text] [CrossRef] [Medline]
Ko H, Chung H, Kang WS, Park C, Kim DW, Kim SE, et al. An artificial intelligence model to predict the mortality of COVID-19 patients at hospital admission time using routine blood samples: development and validation of an ensemble model. J Med Internet Res 2020 Dec 23;22(12):e25442 [FREE Full text] [CrossRef] [Medline]
Kansagara D, Englander H, Salanitro A, Kagen D, Theobald C, Freeman M, et al. Risk prediction models for hospital readmission: a systematic review. JAMA 2011 Oct 19;306(15):1688-1698 [FREE Full text] [CrossRef] [Medline]
Ben-Assuli O, Padman R. Trajectories of repeated readmissions of chronic disease patients: risk stratification, profiling, and prediction. MIS Q 2020 Jan 01;44(1):201-226. [CrossRef]
Hansen KN, Morbitzer KA, Waldron KM, Amerine LB. Development of novel formulas to determine hospital and pharmacy opportunities to reduce extended length of stay. J Pharmacy Technol 2016 Nov 14;33(1):15-22. [CrossRef]
Barrett B, Way C, McDonald J, Parfrey P. Hospital utilization, efficiency and access to care during and shortly after restructuring acute care in Newfoundland and Labrador. J Health Serv Res Policy 2005 Oct;10 Suppl 2:S2:31-S2:37. [CrossRef] [Medline]
Discharge abstract database metadata (DAD). Canadian Institute for Health Information. URL: https://www.cihi.ca/en/discharge-abstract-database-metadata-dad#:~:text=Overview,DAD%20to%20capture%20day%20surgery [accessed 2023-03-14]
Abacus data network. re3data.org. URL: http://doi.org/10.17616/R3692H [accessed 2023-03-21]
Liu S, Wang Y, Wen A, Wang L, Hong N, Shen F, et al. Implementation of a cohort retrieval system for clinical data repositories using the observational medical outcomes partnership common data model: proof-of-concept system validation. JMIR Med Inform 2020 Oct 06;8(10):e17376 [FREE Full text] [CrossRef] [Medline]
Nidoi J, Muttamba W, Walusimbi S, Imoko JF, Lochoro P, Ictho J, et al. Impact of socio-economic factors on tuberculosis treatment outcomes in north-eastern Uganda: a mixed methods study. BMC Public Health 2021 Nov 26;21(1):2167 [FREE Full text] [CrossRef] [Medline]
Xu Q, Ni S, Wu F, Liu F, Ye X, Mougin B, et al. Investigation of variation in gene expression profiling of human blood by extended principle component analysis. PLoS One 2011 Oct 27;6(10):e26905 [FREE Full text] [CrossRef] [Medline]
Zhang W, Cheng L, Huang G. Towards fine-scale population stratification modeling based on kernel principal component analysis and random forest. Genes Genomics 2021 Oct 07;43(10):1143-1155. [CrossRef] [Medline]
Chiu C, Wu C, Chien T, Kao L, Li C, Jiang H. Applying an improved stacking ensemble model to predict the mortality of ICU patients with heart failure. J Clin Med 2022 Oct 31;11(21):6460 [FREE Full text] [CrossRef] [Medline]
Akbar N, Sunyoto A, Arief M, Caesarendra W. Improvement of decision tree classifier accuracy for healthcare insurance fraud prediction by using Extreme Gradient Boosting algorithm. In: Proceedings of the International Conference on Informatics, Multimedia, Cyber and Information System (ICIMCIS). 2020 Presented at: International Conference on Informatics, Multimedia, Cyber and Information System (ICIMCIS); Nov 19-20, 2020; Jakarta, Indonesia. [CrossRef]
Kadiyala A, Kumar A. Applications of python to evaluate the performance of decision tree-based boosting algorithms. Environ Prog Sustainable Energy 2018 Mar 01;37(2):618-623. [CrossRef]
Liang Y, Zheng W, Lee W. Nonlinear associations between medical expenditure, perceived medical attitude, and sociodemographics, and older adults' self-rated health in China: applying the extreme gradient boosting model. Healthcare (Basel) 2021 Dec 26;10(1):39 [FREE Full text] [CrossRef] [Medline]
El-Rashidy N, El-Sappagh S, Abuhmed T, Abdelrazek S, El-Bakry HM. Intensive care unit mortality prediction: an improved patient-specific stacking ensemble model. IEEE Access 2020;8:133541-133564. [CrossRef]
Pfob A, Lu S, Sidey-Gibbons C. Machine learning in medicine: a practical introduction to techniques for data pre-processing, hyperparameter tuning, and model comparison. BMC Med Res Methodol 2022 Nov 01;22(1):282 [FREE Full text] [CrossRef] [Medline]
Owen L. Hyperparameter Tuning with Python Boost Your Machine Learning Model's Performance Via Hyperparameter Tuning. Birmingham: Packt Publishing; 2022.
Pink GH, Bolley H. Physicians in health care management: 4. Case mix groups and resource intensity weights: physicians and hospital funding. CMAJ 1994 Apr 15;150(8):1255-1261.
Koo A, Elsamadicy A, Lin IH, David W, Freedman IG, Isaac G, et al. Predictors of extended length of stay following treatment of unruptured adult cerebral aneurysms a study of the national inpatient sample. Neurosurgery 2020 Dec;67(Supplement_1). [CrossRef]
Handelman GS, Kok HK, Chandra RV, Razavi AH, Huang S, Brooks M, et al. Peering into the black box of artificial intelligence: evaluation metrics of machine learning methods. AJR Am J Roentgenol 2019 Jan;212(1):38-43. [CrossRef] [Medline]
Bach J. Causality in medicine. C R Biol 2019 Mar;342(3-4):55-57 [FREE Full text] [CrossRef] [Medline]
Auker-Howlett D. Evidence Evaluation and the Epistemology of Causality in Medicine. Canterbury, England: University of Kent; 2020.
van Oostveen CJ, Gouma DJ, Bakker PJ, Ubbink DT. Quantifying the demand for hospital care services: a time and motion study. BMC Health Serv Res 2015 Jan 22;15(1):15 [FREE Full text] [CrossRef] [Medline]
Spithoff S, Stockdale J, Rowe R, McPhail B, Persaud N. The commercialization of patient data in Canada: ethics, privacy and policy. CMAJ 2022 Jan 24;194(3):E95-E97 [FREE Full text] [CrossRef] [Medline]
Sharma V, Kulkarni` V, McAlister F, Eurich D, Keshwani S, Simpson SH, et al. Predicting 30-day readmissions in patients with heart failure using administrative data: a machine learning approach. J Card Fail 2022 May;28(5):710-722 [FREE Full text] [CrossRef] [Medline]
Jamei M, Nisnevich A, Wetchler E, Sudat S, Liu E. Predicting all-cause risk of 30-day hospital readmission using artificial neural networks. PLoS One 2017 Jul 14;12(7):e0181173 [FREE Full text] [CrossRef] [Medline]
Lin Ho ET, Tan IE, Lee I, Wu PY, Chong HF. Predicting readmission at early hospitalization using electronic health data: a customized model development. Int J Integr Care 2017 Oct 17;17(5):A506. [CrossRef]
Xing Y, Macq B. Improvement of Bragg peak shift estimation using dimensionality reduction techniques and predictive linear modeling. In: Proceedings of the 13th International Symposium on Medical Information Processing and Analysis. 2017 Presented at: 13th International Symposium on Medical Information Processing and Analysis; Oct 5-7, 2017; San Andres Island, Colombia. [CrossRef]
Hong AJ, Bratvold RB, Nævdal G. Robust production optimization with capacitance-resistance model as precursor. Comput Geosci 2017 Jun 24;21(5-6):1423-1442. [CrossRef]
Lei W, Wang J. Dynamic Stacking ensemble monitoring model of dam displacement based on the feature selection with PCA-RF. J Civil Struct Health Monit 2022 Mar 24;12(3):557-578. [CrossRef]
Indrasiri PL, Lee E, Rupapara V, Rustam F, Ashraf I. Malicious traffic detection in IoT and local networks using stacked ensemble classifier. Comput MaterialContinua 2021 Nov;71(1):489-515. [CrossRef]
Zulfiker M, Kabir N, Biswas AA, Chakraborty P. Predicting insomnia using multilayer stacked ensemble model. In: Advances in Computing and Data Sciences. Cham: Springer; 2021.
Allam A, Nagy M, Thoma G, Krauthammer M. Neural networks versus Logistic regression for 30 days all-cause readmission prediction. Sci Rep 2019 Jun 26;9(1):9277 [FREE Full text] [CrossRef] [Medline]
Lone NI, Lee R, Salisbury L, Donaghy E, Ramsay P, Rattray J, et al. Predicting risk of unplanned hospital readmission in survivors of critical illness: a population-level cohort study. Thorax 2019 Nov 05;74(11):1046-1054 [FREE Full text] [CrossRef] [Medline]
Altman N, Krzywinski M. The curse(s) of dimensionality. Nat Method 2018 Jun 31;15(6):399-400. [CrossRef] [Medline]
Tiwari A, Chugh A, Sharma A. Ensemble framework for cardiovascular disease prediction. Comput Biol Med 2022 Jul;146:105624. [CrossRef] [Medline]
Artetxe A, Graña M, Beristain A, Ríos S. Balanced training of a hybrid ensemble method for imbalanced datasets: a case of emergency department readmission prediction. Neural Comput Appl 2017 Oct 14;32(10):5735-5744. [CrossRef]
Du G, Zhang J, Luo Z, Ma F, Ma L, Li S. Joint imbalanced classification and feature selection for hospital readmissions. Knowl Base Syst 2020 Jul;200:106020. [CrossRef]
Bukhari AA. 30-days All-cause Prediction Model for Readmissions for Heart Failure Patients A Comparative Study of Machine Learning Approaches. Boston, Massachusetts: Northeastern University; Dec 2019.
Hubbard A, Munoz I, Decker A, Holcomb JB, Schreiber MA, Bulger EM, PROMMTT Study Group. Time-dependent prediction and evaluation of variable importance using superlearning in high-dimensional clinical data. J Trauma Acute Care Surg 2013 Jul;75(1 Suppl 1):S53-S60 [FREE Full text] [CrossRef] [Medline]
Sureiman O, Mangera C. F-test of overall significance in regression analysis simplified. J Pract Cardiovasc Sci 2020;6:116-122.
Rai HM, Chatterjee K. Hybrid CNN-LSTM deep learning model and ensemble technique for automatic detection of myocardial infarction using big ECG data. Appl Intell 2021 Aug 11;52(5):5366-5384. [CrossRef]
Dunn A. Health insurance and the demand for medical care: instrumental variable estimates using health insurer claims data. J Health Econ 2016 Jul;48:74-88. [CrossRef] [Medline]
Lin H. Revisiting the relationship between nurse staffing and quality of care in nursing homes: an instrumental variables approach. J Health Econ 2014 Sep;37:13-24. [CrossRef] [Medline]

‎

DAD: Discharge Abstract Database

ELOS: expected length of stay

ICD-10: International Classification of Diseases, 10th revision

LGBM: LightGBM

MCC: major complication or comorbidity

ML: machine learning

PCA: principal component analysis

RIW: resource intensity weight

XGB: XGBoost

Edited by A Mavragani; submitted 06.08.22; peer-reviewed by N Marotta, D Gartner; comments to author 22.11.22; revised version received 21.03.23; accepted 07.04.23; published 26.05.23

©Ethan Rajkumar, Kevin Nguyen, Sandra Radic, Jubelle Paa, Qiyang Geng. Originally published in JMIR Formative Research (https://formative.jmir.org), 26.05.2023.

This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Formative Research, is properly cited. The complete bibliographic information, a link to the original publication on https://formative.jmir.org, as well as this copyright and license information must be included.

This paper is in the following e-collection/theme issue:

Machine Learning and Causal Approaches to Predict Readmissions and Its Economic Consequences Among Canadian Patients With Heart Disease: Retrospective Study