Abstract
Background: Point-of-care ultrasonography has become a valuable tool for assessing diaphragmatic function in critically ill patients receiving invasive mechanical ventilation. However, conventional diaphragm ultrasound assessment remains highly operator-dependent and subjective. Previous research introduced automatic measurement of diaphragmatic excursion and velocity using 2D speckle-tracking technology.
Objective: This study aimed to develop an artificial intelligence–multimodal learning framework to improve the prediction of weaning failure and guide individualized weaning strategies.
Methods: This prospective study enrolled critically ill patients older than 18 years who received mechanical ventilation for more than 48 hours and were eligible for a spontaneous breathing trial in 2 intensive care units in Guangzhou, China. Before the spontaneous breathing trial, diaphragm ultrasound videos were collected using a standardized protocol, and automatic measurements of excursion and velocity were obtained. A total of 88 patients were included, with 50 successfully weaned and 38 experiencing weaning failure. Each patient record included 27 clinical and 6 diaphragmatic indicators, selected based on previous literature and phenotyping studies. Clinical variables were preprocessed using OneHotEncoder, normalization, and scaling. Ultrasound videos were interpolated to a uniform resolution of 224×224×96. Artificial intelligence–multimodal learning based on clinical characteristics, laboratory parameters, and diaphragm ultrasonic videos was established. Four experiments were conducted in an ablation setting to evaluate model performance using different combinations of input data: (1) diaphragmatic excursion only, (2) clinical and diaphragmatic indicators, (3) ultrasound videos only, and (4) all modalities combined (multimodal). Metrics for evaluation included classification accuracy, area under the receiver operating characteristic curve (AUC), average precision in the precision-recall curve, and calibration curve. Variable importance was assessed using SHAP (Shapley Additive Explanation) to interpret feature contributions and understand model predictions.
Results: The multimodal co-learning model outperformed all single-modal approaches. The accuracy improved when predicted through diaphragm ultrasound video data using Video Vision Transformer (accuracy=0.8095, AUC=0.852), clinical or ultrasound indicators (accuracy=0.7381, AUC=0.746), and the multimodal co-learning (accuracy=0.8331, AUC=0.894). The proposed co-learning model achieved the highest score (average precision=0.91) among the 4 experiments. Furthermore, calibration curve analysis demonstrated that the proposed colearning model was well calibrated, as the curve was closest to the perfectly calibrated line.
Conclusions: Combining ultrasound and clinical data for colearning improved the accuracy of the weaning outcome prediction. Multimodal learning based on automatic measurement of point-of-care ultrasonography and automated collection of objective clinical indicators greatly enhanced the practical operability and user-friendliness of the system. The proposed model offered promising potential for widespread clinical application in intensive care settings.
doi:10.2196/72482
Keywords
Introduction
Invasive mechanical ventilation (MV) is a life-saving therapeutic strategy in critically ill patients, yet 20%‐40% of invasive MV patients endure difficult weaning to varying extents []. WEAN-SAFE study, a prospective cohort of 10,232 invasive MV patients in 481 intensive care units (ICU) among 50 countries, showed that the incidence of difficult weaning, prolonged weaning, and weaning failure reached up to 35.3% []. Diaphragm dysfunction is a major concern in critically ill patients due to its potential association with prolonged MV dependency [,], as the diaphragm plays a central role in respiratory mechanics.
The use of point-of-care ultrasonography (POCUS) to assess the diaphragm has seen rapid growth within the ICU because it is readily available, portable, noninvasive, visual, easily repeatable, and free of ionizing radiation []. Previous systematic reviews have examined the accuracy and usefulness of diaphragmatic ultrasound indicators in predicting weaning outcomes, mainly focusing on diaphragm thickening fraction (DTF), excursion, and rapid shallow breathing index [-]. Encouragingly, our serial research, published in January 2023, developed an automatic measurement of diaphragm excursion and velocity based on 2D speckle-tracking technology, showing that the mean diaphragm excursion and velocity had a high diagnostic value on the weaning outcomes [].
In parallel, multimodal artificial intelligence (AI), or multimodal learning, enabling the AI/machine learning (ML) model to learn from and process multiple modes and types of data (image, text, audio, and video) rather than just one, is a rising trend and has the potential to reshape both the AI landscape and clinical research for its novel implications and benefits. By integrating AI into the evaluation process, clinicians can receive more accurate and consistent predictions, allowing for better decision-making. Furthermore, AI-guided prediction models can adapt to the individual characteristics of patients, offering personalized insights that improve the safety and efficacy of the weaning process [,].
To further extend our findings, this study aims to develop and validate a multimodal AI model that integrates clinical characteristics, laboratory parameters, and diaphragm ultrasound videos from the abovementioned cohort. This might provide multisystem and multiorgan morphological and functional status information, which helps predict weaning failure and guide the personalized, accurate weaning strategies in the ICU.
Methods
Study Design
Patients aged older than 18 years who received MV for more than 48 hours, suitable for a spontaneous breathing trial (SBT), were prospectively enrolled in ICUs within 2 hospitals in Guangzhou City, China. Ultrasound video and diaphragm excursion and velocity were acquired, followed by a standardized protocol using the TE7 Diagnostic Ultrasound System (C5-2 array probe, Shenzhen Mindray Biomedical) in a supine or semirecumbent position before SBT by a well-trained expert. Of the total 88 critically ill patients, 50 (57%) patients maintained spontaneous breathing for ≥48 h with no need for any level of ventilator support after extubating, while 38 (43%) patients failed to wean. Clinical indicators, diaphragm ultrasound video, and diaphragmatic indicators (using automatic speckle-tracking measurement) were collected to construct the clinical/diaphragmatic dataset and diaphragm ultrasound dataset.
Indicators Selection
Among the clinical or diaphragmatic indicators, each case contained 27 clinical and 6 diaphragmatic indicators (Table S1 of ). In this study, we selected height, weight, BMI, tidal volume, positive end-expiratory pressure, albumin, hemoglobin, age, temperature, systolic blood pressure, diastolic blood pressure, heart rate, respiratory rate, FiO2, PaO2, PaCO2, pH, serum sodium, serum potassium, white blood cell count, serum creatinine, serum calcium, serum magnesium, serum phosphorus, APACHE II score, SOFA (Sequential Organ Failure Assessment), and N-terminal pro-brain natriuretic peptide. In our previous retrospective study published in 2023 [], clinical phenotypes of ventilated ICU patients to predict outcomes were derived from the eICU (electronic intensive care unit) Collaborative Research Database cohort and validated in the MIMIC-IV (Medical Information Mart for Intensive Care) cohort. Four clinical phenotypes were identified, which correlate with different clinical characteristics and outcomes, including 28-day mortality and extubation success. A total of 24 clinical variables were selected, including patient demographics, vital signs, blood gas analysis, laboratory measurements, scores, and MV parameters for phenotyping. Further, 18 of 24 variables in the abovementioned study were collected in the present cohort. Demographics were collected as baseline data upon ICU admission. Arterial blood gas analysis was performed at the end of the SBT. Ventilator parameters were recorded during the ultrasound examination. Additionally, a systematic review identified 21 factors associated with increased risk for reintubation, including higher APACHE II scores, low hemoglobin, low albumin, and high brain natriuretic peptide []. A case was reported in 2022 of iatrogenic hypermagnesemia, which presented as respiratory depression, preventing weaning from MV following cardiac surgery []. Hypophosphatemia was associated with respiratory muscular weakness, leading to weaning failure [,]. SOFA score during the first 24 hours of ICU admission represented significant independent predictors of diaphragm dysfunction and thus led to difficulties weaning from MV []. Laboratory markers, APACHE II, and SOFA scores were collected within 24 h before extubation. Furthermore, 6 diaphragmatic indicators were calculated using automatic speckle-tracking measurements based on the diaphragm ultrasound video. The detailed method and material were published previously [].
Data
For the diaphragm ultrasound videos, they were recorded with a high resolution of 900*1100 pixels and the length ranged from 48 to 556 frames. The percentage of the videos of 48‐150 frames is 30% and videos of 350‐556 frames is 30%. The overview of the patients and the details of the ultrasound video frame were presented in . During data preprocessing, the numeric variables and the categorical variables within the clinical indicators were preprocessed separately using 1-hot encoding, scaling, and normalization methods. The ultrasound videos were interpolated to a uniform size of 224*224*96. The size was unified considering the diaphragmatic excursion cycle and the model structure. The interpolation method was adapted from the bilinear interpolation algorithm in the Python (Python Software Foundation) package SciPy. Additionally, the data were normalized to a specific range of (0,1) before being encoded by the model.
| Values | ||
| Age, mean (SD) | 73 (14) | |
| BMI, mean (SD) | 22.40 (3.12) | |
| Gender, n (%) | ||
| Male | 60 (68) | |
| Female | 28 (32) | |
| Weaning outcome, n (%) | ||
| Success | 50 (57) | |
| Failure | 38 (43) | |
| Ultrasound length (frame), n (%) | ||
| 0‐50 | 8 (9) | |
| 50‐150 | 17 (21) | |
| 150‐250 | 16 (25) | |
| 250‐350 | 28 (44) | |
| 350‐450 | 13 (21) | |
| 450‐560 | 6 (10) | |
Multimodal Colearning Model
A multimodal colearning model was then constructed (), which simultaneously encoded both the video and clinical data and subsequently predicted weaning success from the integrated features. In this case, features would come from both the ultrasound videos (eg, image-based features) and the clinical indicators. The co-learning model had 3 components. First, Video Vision Transformer (ViViT) [] was run as the ultrasound data encoder (), aiming at extracting the ultrasound features. It included a spatial information encoder and a temporal information encoder based on the attention mechanism. Moreover, the ViViT captured temporal and spatial information at the same time, and for long videos, it also realized long-range sequence interaction through an attention mechanism. A small-scale grid search was used for pretraining. The learning rate for ViViT was between 1e-5 and 1e-4. The dropout rate was 0.5 to 0.7. We experimented with batch sizes of 4, 8, and 16 and set weight decay between 1e-4 and 1e-2. We trained the model for 50 to 100 epochs and applied an early stopping strategy (patience) with a range of 10 to 15 epochs.
Second, for the clinical and diaphragmatic indicators, a multilayer perceptron model, consisting of a sequence of fully connected layers and dropout layers, was adopted for clinical data encoding. Finally, in the fusion module, the ultrasound and clinical features were integrated. A multilayer perceptron classifier was used to obtain the final weaning outcome. In the multilayer perceptron classifier, 2 dropout layers were applied to avoid overfitting.
In the 5-fold cross-validation process, the 88 cases were randomly divided into training sets (n=60) and testing sets (n=18) in the ratio of 8:2. In each fold, 60 cases were randomly chosen as the training set, while the test set was the remaining 18 cases. In each fold, the model was trained on the training set and tested on the test set. The final result was calculated using the average results in the 5 folds. In the training process, the early stopping method was managed to avoid overfitting and improve model generalization performance. Receiver operating characteristic–area under the curve (AUC), precision-recall curve, and calibration curve for deep learning models were used to evaluate the results of the variant models. The model was developed and fit using Keras (Google LLC). The hyperparameters were optimized in the training process, with balanced class weights, binary cross-entropy loss, and 0.005 learning rate using the Adam optimizer.
Variable importance analysis was conducted to identify the key features influencing the prediction of weaning outcomes and to assess their relative impact. In deep learning models, currently, no universally accepted sensitivity analysis method fits all types of models. Therefore, we used the SHAP (Shapley Additive Explanation) method, a powerful technique for explaining individual predictions made by complex models. SHAP was applied to the classification of clinical indicators. Categorical features were processed using OneHotEncoder, and continuous features were normalized. SHAP values provide a measure of the contribution of each feature to a particular prediction, quantifying both the magnitude and direction (positive or negative) of their influence on the model’s output. SHAP values enabled a deeper understanding of the model’s behavior and the sensitivity of predictions to changes in individual indicators []. It should be noted that in this study, we incorporated SHAP to understand the importance of different features and enhance the interpretability, rather than to investigate causal relationships in clinical scenarios. SHAP plots generally do not directly show statistical significance (such as P values). To observe whether there is statistical significance, it needs to be verified through statistical tests. SHAP analysis can serve as a computer-aided diagnosis tool for clinicians, and its application requires integration with clinicians’ domain expertise to achieve better diagnostic outcomes.


Ethical Considerations
All subjects’ families provided written informed consent. The study was performed following the approval of the ethics committee of Guangdong Provincial People’s Hospital (No. 2020-246H-1). All procedures were followed per the ethical standards of the responsible committee on human experimentation (institutional or regional) and with the Helsinki Declaration. Measures were taken to ensure participant privacy, including data anonymization and secure data storage. No financial or material compensation was provided to the participants or their families for participation in this study.
Results
Four experiments were conducted on the prediction of weaning failure in an ablation setting, testing each component of the colearning model with corresponding data. To demonstrate the advantages of the proposed co-learning model in terms of modal interaction, the prediction performance was compared in a multimodal setting with 3 single-modal settings (diaphragm excursion, clinical or diaphragmatic indicators, and diaphragm ultrasound video). The 88 cases of ultrasound and clinical data were divided into training sets and test sets at a ratio of 8:2, and a 5-fold cross-validation was performed (). The results of the 4 experiments were demonstrated by classification accuracy, receiver operating characteristic curves, precision-recall curve, and calibration curve.
Initially, using automatic mean excursion as the sole predictor yielded a moderate performance with an AUC score of 0.659 [], while the use of diaphragm ultrasound video data, processed with the ViViT model, resulted in an enhanced prediction performance, achieving a notable accuracy of 0.8095 and an AUC of 0.852. The prediction model based on a combination of clinical and ultrasound indicators provided a respectable performance (accuracy=0.7381, AUC=0.746). The most accurate and effective model was the multimodal co-learning approach. This model achieved an accuracy of 0.8331 and an AUC of 0.894, reflecting a significant improvement over the 3 single-modality approaches ().
Moreover, in the precision-recall curve, the average precision score was calculated to summarize a precision-recall curve as the weighted mean of precisions achieved at each threshold. The multimodal co-learning model ran the best compared to the other 3 single modalities. The proposed co-learning model achieved the highest score (average precision=0.91) among the 4 experiments ().
At last, the calibration curve was presented as a measurement of the confidence level of a proposed model and provided support for probability prediction (). The proposed co-learning model was demonstrated to be well calibrated as the pink curve was closest to the perfectly calibrated line (the diagonal line at a 45-degree angle), showing that the predicted probabilities were highly consistent with the true outcomes.
| Data | Colearning (AUC) | ViViT (AUC) |
| 1st-fold (n=18) | 0.857 | 0.889 |
| 2nd-fold (n=18) | 0.843 | 0.778 |
| 3rd-fold (n=18) | 0.892 | 0.861 |
| 4th-fold (n=18) | 0.924 | 0.843 |
| 5th-fold (n=18) | 0.956 | 0.889 |
aAUC: area under curve.
bViViT: Video Vision Transformer.



The experiments showed that the proposed co-learning model fully exploited the single-modal features and multimodal complementary features and achieved feature interaction between the modalities. Using both ultrasound and clinical data for colearning improved the accuracy of the weaning outcome prediction.
The SHAP values for the features were shown in . The SHAP plot ranks features by their importance, with the highest importance at the top. The vertical axis ranked the features by the sum of the SHAP values across all samples, while the horizontal axis represented the SHAP value for each sample. Each dot corresponded to an individual sample, with the sample size indicated by vertical stacking of dots. On one hand, the color of each dot reflected the feature’s value, with redder colors corresponding to higher values and bluer colors to lower values. For example, tidal volume had the highest overall SHAP value. High tidal volumes (in red) clustered on the negative side of the SHAP axis (located to the left of the SHAP=0 line), suggesting that in this cohort, it negatively impacted weaning success. Furthermore, hemoglobin level was identified as a significant predictor, with higher hemoglobin values being associated with a greater likelihood of successful weaning. Albumin also emerged as an important factor in predicting weaning outcomes. The red dots were primarily concentrated on the right side of the SHAP=0 line, indicating that higher albumin levels were associated with successful weaning. However, lower albumin levels were linked to a higher probability of weaning failure, as evidenced by the blue dots located on the leftmost side of the albumin row. On the other hand, the importance of each feature was determined by the sum of its SHAP values, and the features with the highest sum of SHAP values were ranked at the top. Tidal volume, with the largest total SHAP value, emerged as the most influential feature for accurately predicting weaning outcomes. It should be noted that in this study, we incorporated SHAP to understand the importance of different features and enhance the interpretability, rather than to investigate causal relationships in clinical scenarios. SHAP plots generally do not directly show statistical significance (such as P values), which needs to be verified through statistical tests.

Discussion
The Clinical Challenge of Weaning and the Role of Diaphragm Dysfunction
About 26%‐39% of patients have trouble weaning [,] after resolution of the cause of respiratory failure and passing an SBT. Diaphragm dysfunction, resulting from weakness, atrophy, neurological damage, or mechanical disruption, is linked with prolonged MV duration, higher risk of weaning failure, longer length of ICU stay, and poor clinical outcomes []. Unsurprisingly, there is growing interest in investigating noninvasive bedside tools to evaluate the diaphragm and detect diaphragm dysfunction accurately. This is an important consideration when selecting which multiple parameters may be clinically useful to assess outcomes and for the prognostication of patients who may benefit from targeted interventions. Our serial research indicated that the incidence of diaphragm dysfunction in severe patients was as high as 75%, and diaphragm dysfunction was an important cause of weaning failure []. Over the past 10 years, diaphragm ultrasound has gradually become an extensively used tool for assessing diaphragm function []. As demonstrated in our previous studies, the ability to detect diaphragm dysfunction might become an important part of predicting weaning failure or prolonged weaning and holds the potential for determining weaning techniques. However, different studies have different predictors in addition to diaphragmatic indicators. Peer colleagues have also pointed out that the optimal prediction model has not been established, which is worth further exploration [].
Existing AI-Based Weaning Prediction Studies
This report discusses the potential of POCUS assessment of the diaphragm using automatic speckle-tracking measurement plus other clinical characteristics in predicting weaning outcomes. Integrated with diaphragmatic indicators and 27 clinical indexes, the multimodal model was constructed to achieve accurate and intelligent prediction of weaning outcomes (accuracy=0.8331, AUC=0.894). The results reflected a significant improvement over the other 3 single-modality approaches. The high AUC value indicated that the multimodal model had a strong ability to correctly classify the outcomes, suggesting that the fusion of diverse data types (clinical or ultrasound video) generated from the critical care scenarios allowed the model to capture a more comprehensive and complex set of patterns for better predictions. The use of AI algorithms or ML to build predictive models for the successful weaning process was promising. Derivation of different clinical phenotypes of ventilated patients from the MIMIC-IV cohort was associated with extubation success. A study conducted at Chi Mei Medical Center selected 26 feature variables to build the predictive models with AI/ML algorithms, with a high AUC ranging from 0.792 to 0.868 []. Cheng Hsin General Hospital collected 28 variables and created a simple AI model for predicting weaning time []. A retrospective cohort study in Central Taiwan showed that explainable ML could predict successful weaning among patients requiring prolonged MV (AUC=0.908) []. Moreover, the 2-stage AI prediction models effectively and precisely predicted the optimal timing to wean intubated patients, with the AUCs of optimal models ranging from 0.889 to 0.944 [].
Future Optimization 1: AI Integrates Multisystem POCUS Assessment
Of note, the result of this study was compromised due to a small sample. The AI model required a prospective cohort of a larger sample to validate its efficacy. Multiple risk factors were associated with weaning failure, such as cardiopulmonary dysfunction and older adults with severe comorbidities. It was urgent to optimize the decision-making process of weaning based on merely curtailing a few indicators. POCUS provides the dynamic assessment of multiorgan function during the weaning process. Even though high operator dependency limited the implementation and promotion of multisystem ultrasound monitoring in invasive MV patients, AI-integrated multisystem ultrasonic parameters rapidly and accurately assess, including for heart, lung, pleural cavity, diaphragm, and volume assessment.
Future Optimization 2: Diaphragm-Protective Ventilation Strategy
For the concern raised by Sabourin et al [], suggesting that diaphragm ultrasonography should be done in patients with spontaneous breathing without pressure support, we hope to refine the objective parameters such as laboratory indexes and MV settings to improve the prediction in a clinical scenario. We hypothesize that, with the assistance of AI, the multimodal model might potentially distinguish between MV support and the patient’s spontaneous breathing efforts, resembling clinical patterns seen in pressure support ventilation or synchronized intermittent mandatory ventilation mode. In our previous study, the excursion did not fully reflect whether the diaphragm movement was active or passive. Moreover, the value of diaphragmatic indicators was not only appreciated merely through excursion, but also in combination with the automatic measurement of diaphragm thickness and DTF. The abovementioned integration would become a strategy for diaphragm-protective ventilation.
Limitations
First, an automatic measurement for DTF, a reliable indicator for the weaning outcome and supposedly less impacted by pressure support ventilation [], was under development. Moreover, the success of the proposed model relied on the quality of diaphragm ultrasound videos. In clinical practice, variations in equipment quality and ICU doctors’ experience in performing POCUS could affect the reliability of these measurements. Future algorithms, integrating DTF and AI, would be explored to enhance accuracy and improve inter- and intraoperator reproducibility. Second, while the co-learning model achieved high predictive performance, the underlying decision-making process of the model was not fully interpretable. The translation of the findings of this study into clinical strategies still requires further refinement of the algorithm. Third, the relatively small sample size was not formally calculated, which limited the generalizability of the results to larger populations. A larger, multicenter study was needed to confirm the robustness and applicability of our findings across diverse patient cohorts. Recent literature, including 2 reviews published in 2022 [] and 2024 [], lacked evidence that combined ultrasound videos with AI for predicting weaning outcomes. As such, our proposed model from 2022 [], integrating diaphragm ultrasound videos and AI techniques, represented its novelty in this specific context. In conclusion, the weaning outcome was closely related to multiple dynamic factors and the function of the diaphragm and even various organs. The integration of multimodal AI, using automatic measurement from POCUS and automated collection of objective clinical and laboratory indicators, greatly enhanced the prediction of weaning outcomes. This approach held promise for broader clinical application, a personalized tool that supported decision-making and was conducive to clinical promotion.
Acknowledgments
This research was supported by the self-funded project within the Scientific Research Institute of Guangdong Provincial People's Hospital(No.8237080090, No.8247080256), and the Guangdong Basic and Applied Basic Research Foundation (No. 2024A1515220052). To reflect appropriate contributors to the study, we thank Mingliu Zhu, PhD; Jiaming Jiao, PhD; and Shuo Liu, PhD, who are the system research engineers from the ultrasound imaging system development department of Shenzhen Mindray Bio-Medical Electronics Co, Ltd, for technical support.
Data Availability
The data generated or analyzed during this study are not publicly available to protect the privacy of participants, but are available from the corresponding author upon reasonable request.
Authors' Contributions
FS investigated and wrote the original draft. H Liu worked on the formal analysis. HM reviewed and edited the writing. XC curated the data. SW handled the project administration. TQ collected the resources. H Liang supervised this study. DH conceptualized and supervised. All authors read and approved the final paper.
Authors DH (hdzdoctor@139.com) and H Liang (lianghuiying@hotmail.com) are co-corresponding authors for this article
Conflicts of Interest
None declared.
Table of clinical or diaphragm ultrasound indicators used in the model.
DOCX File, 27 KBReferences
- McConville JF, Kress JP. Weaning patients from the ventilator. N Engl J Med. Dec 6, 2012;367(23):2233-2239. [CrossRef] [Medline]
- Pham T, Heunks L, Bellani G, et al. Weaning from mechanical ventilation in intensive care units across 50 countries (WEAN SAFE): a multicentre, prospective, observational cohort study. Lancet Respir Med. May 2023;11(5):465-476. [CrossRef] [Medline]
- Dres M, Dubé BP, Mayaux J, et al. Coexistence and impact of limb muscle and diaphragm weakness at time of liberation from mechanical ventilation in medical intensive care unit patients. Am J Respir Crit Care Med. Jan 1, 2017;195(1):57-66. [CrossRef] [Medline]
- Goligher EC, Dres M, Fan E, et al. Mechanical ventilation–induced diaphragm atrophy strongly impacts clinical outcomes. Am J Respir Crit Care Med. Jan 15, 2018;197(2):204-213. [CrossRef]
- Lee J, Geyer B, Naraghi L, et al. Advanced imaging use in intensive care units has decreased, resulting in lower charges without negative effects on patient outcomes. J Crit Care. Jun 2015;30(3):460-464. [CrossRef] [Medline]
- Truong D, Abo S, Whish-Wilson GA, et al. Methodological and clinimetric evaluation of inspiratory respiratory muscle ultrasound in the critical care setting: a systematic review and meta-analysis. Crit Care Med. Feb 1, 2023;51(2):e24-e36. [CrossRef] [Medline]
- Tashiro N, Hasegawa T, Nishiwaki H, et al. Clinical utility of diaphragmatic ultrasonography for mechanical ventilator weaning in adults: a study protocol for systematic review and meta-analysis. Health Sci Rep. Jul 2023;6(7):e1378. [CrossRef] [Medline]
- Llamas-Álvarez AM, Tenza-Lozano EM, Latour-Pérez J. Diaphragm and lung ultrasound to predict weaning outcome: systematic review and meta-analysis. Chest. Dec 2017;152(6):1140-1150. [CrossRef] [Medline]
- Mahmoodpoor A, Fouladi S, Ramouz A, Shadvar K, Ostadi Z, Soleimanpour H. Diaphragm ultrasound to predict weaning outcome: systematic review and meta-analysis. Anaesthesiol Intensive Ther. 2022;54(2):164-174. [CrossRef] [Medline]
- Le Neindre A, Philippart F, Luperto M, et al. Diagnostic accuracy of diaphragm ultrasound to predict weaning outcome: a systematic review and meta-analysis. Int J Nurs Stud. May 2021;117:103890. [CrossRef] [Medline]
- Huang D, Song F, Luo B, et al. Using automatic speckle tracking imaging to measure diaphragm excursion and predict the outcome of mechanical ventilation weaning. Crit Care. Jan 14, 2023;27(1):18. [CrossRef] [Medline]
- Jia Y, Kaul C, Lawton T, Murray-Smith R, Habli I. Prediction of weaning from mechanical ventilation using convolutional neural networks. Artif Intell Med. Jul 2021;117:102087. [CrossRef] [Medline]
- Liu CF, Hung CM, Ko SC, et al. An artificial intelligence system to predict the optimal timing for mechanical ventilation weaning for intensive care unit patients: a two-stage prediction approach. Front Med (Lausanne). 2022;9:935366. [CrossRef] [Medline]
- Chen X, Li J, Liu G, et al. Identification of distinct clinical phenotypes of heterogeneous mechanically ventilated ICU patients using cluster analysis. J Clin Med. Feb 14, 2023;12(4):1499. [CrossRef] [Medline]
- Li W, Zhang Y, Wang Z, et al. The risk factors of reintubation in intensive care unit patients on mechanical ventilation: a systematic review and meta-analysis. Intensive Crit Care Nurs. Feb 2023;74:103340. [CrossRef] [Medline]
- Premkumar CR, Parthasarathy R, Kumar S, Vijaykumar S. Postoperative respiratory depression caused by iatrogenic hypermagnesaemia. Anaesth Rep. 2022;10(1):e12154. [CrossRef] [Medline]
- Talakoub R, Bahrami M, Honarmand A, Abbasi S, Gerami H. The predicting ability of serum phosphorus to assess the duration of mechanical ventilation in critically ill patients. Adv Biomed Res. 2017;6:51. [CrossRef] [Medline]
- Zhao Y, Li Z, Shi Y, et al. Effect of hypophosphatemia on the withdrawal of mechanical ventilation in patients with acute exacerbations of chronic obstructive pulmonary disease. Biomed Rep. Apr 2016;4(4):413-416. [CrossRef] [Medline]
- Pu H, Doig GS, Lv Y, et al. Modifiable risk factors for ventilator associated diaphragmatic dysfunction: a multicenter observational study. BMC Pulm Med. Sep 13, 2023;23(1):343. [CrossRef] [Medline]
- Arnab A, Dehghani M, Heigold G, Sun C, Lucic M, Schmid C. ViViT: a Video Vision Transformer. Preprint posted online on Mar 29, 2021. URL: https://ui.adsabs.harvard.edu/abs/2021arXiv210315691A [Accessed 2025-07-08] [CrossRef]
- Li J, Liu S, Hu Y, Zhu L, Mao Y, Liu J. Predicting mortality in intensive care unit patients with heart failure using an interpretable machine learning model: retrospective cohort study. J Med Internet Res. Aug 9, 2022;24(8):e38082. [CrossRef] [Medline]
- Funk GC, Anders S, Breyer MK, et al. Incidence and outcome of weaning from mechanical ventilation according to new categories. Eur Respir J. Jan 2010;35(1):88-94. [CrossRef] [Medline]
- Peñuelas O, Frutos-Vivar F, Fernández C, et al. Characteristics and outcomes of ventilated patients according to time to liberation from mechanical ventilation. Am J Respir Crit Care Med. Aug 15, 2011;184(4):430-437. [CrossRef] [Medline]
- Purro A, Appendini L, De Gaetano A, Gudjonsdottir M, Donner CF, Rossi A. Physiologic determinants of ventilator dependence in long-term mechanically ventilated patients. Am J Respir Crit Care Med. Apr 2000;161(4 Pt 1):1115-1123. [CrossRef] [Medline]
- Huang D, Ma H, Zhong W, et al. Using M-mode ultrasonography to assess diaphragm dysfunction and predict the success of mechanical ventilation weaning in elderly patients. J Thorac Dis. Sep 2017;9(9):3177-3186. [CrossRef] [Medline]
- Haaksma ME, Smit JM, Boussuges A, et al. Expert Consensus On Diaphragm Ultrasonography in the Critically Ill (EXODUS): a Delphi consensus statement on the measurement of diaphragm ultrasound-derived parameters in a critical care setting. Crit Care. Apr 8, 2022;26(1):99. [CrossRef] [Medline]
- Wang L, Muhetaer Y, Zhu L, Wang Q, Wu W. Is it reasonable to predict weaning by measuring diaphragm activity under ultrasound especially its reduction of excursion? Crit Care. 2023;27(1):309. [CrossRef]
- Liao KM, Ko SC, Liu CF, et al. Development of an interactive AI system for the optimal timing prediction of successful weaning from mechanical ventilation for patients in respiratory care centers. Diagnostics (Basel). Apr 13, 2022;12(4):975. [CrossRef] [Medline]
- Chen WT, Huang HL, Ko PS, Su W, Kao CC, Su SL. A simple algorithm using ventilator parameters to predict successfully rapid weaning program in cardiac intensive care unit patients. J Pers Med. Mar 21, 2022;12(3):501. [CrossRef] [Medline]
- Lin MY, Li CC, Lin PH, et al. Explainable machine learning to predict successful weaning among patients requiring prolonged mechanical ventilation: a retrospective cohort study in central Taiwan. Front Med (Lausanne). 2021;8:663739. [CrossRef] [Medline]
- Sabourin E, Carpentier C, Lai C, Monnet X, Pham T. “Under pressure”: should we use diaphragm excursion to predict weaning success in patients receiving pressure support ventilation? Crit Care. Jun 15, 2023;27(1):238. [CrossRef] [Medline]
- Gallifant J, Zhang J, Del Pilar Arias Lopez M, et al. Artificial intelligence for mechanical ventilation: systematic review of design, reporting standards, and bias. Br J Anaesth. Feb 2022;128(2):343-351. [CrossRef] [Medline]
- Stivi T, Padawer D, Dirini N, Nachshon A, Batzofin BM, Ledot S. Using artificial intelligence to predict mechanical ventilation weaning success in patients with respiratory failure, including those with acute respiratory distress syndrome. J Clin Med. Mar 5, 2024;13(5):1505. [CrossRef] [Medline]
Abbreviations
| AI: artificial intelligence |
| AUC: area under the curve |
| DTF: diaphragm thickening fraction |
| eICU: electronic intensive care unit |
| ICU: intensive care unit |
| MIMIC-IV: Medical Information Mart for Intensive Care |
| ML: machine learning |
| MV: mechanical ventilation |
| POCUS: point-of-care ultrasonography |
| SBT: spontaneous breathing trial |
| SHAP: Shapley Additive Explanation |
| SOFA: Sequential Organ Failure Assessment |
| ViViT: Video Vision Transformer |
Edited by Javad Sarvestan; submitted 11.02.25; peer-reviewed by Ahmed N Elshaer, Guo Tu; final revised version received 01.06.25; accepted 04.06.25; published 08.09.25.
Copyright© Feier Song, Huazhang Liu, Huan Ma, Xuanhui Chen, Shouhong Wang, Tiehe Qin, Huiying Liang, Daozheng Huang. Originally published in JMIR Formative Research (https://formative.jmir.org), 8.9.2025.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Formative Research, is properly cited. The complete bibliographic information, a link to the original publication on https://formative.jmir.org, as well as this copyright and license information must be included.

