Search Articles

View query in Help articles search

Search Results (1 to 10 of 71 Results)

Download search results: CSV END BibTex RIS

CSV download: Download all 71 search results (up to 5,000 articles maximum)

Peer Review of “Estimating Variance of Log Standardized Incidence Ratios Assessing Health Care Providers’ Performance: Comparative Analysis Using Bayesian, Bootstrap, and Delta Method Approaches”

Peer Review of “Estimating Variance of Log Standardized Incidence Ratios Assessing Health Care Providers’ Performance: Comparative Analysis Using Bayesian, Bootstrap, and Delta Method Approaches”

This is a peer-review report for “Estimating Variance of Log Standardized Incidence Ratios Assessing Health Care Providers’ Performance: Comparative Analysis Using Bayesian, Bootstrap, and Delta Method Approaches.”

Emmanuel Oluwagbade

JMIRx Med 2025;6:e83798


Authors’ Response to the Peer Review of “Estimating Variance of Log Standardized Incidence Ratios Assessing Health Care Providers’ Performance: Comparative Analysis Using Bayesian, Bootstrap, and Delta Method Approaches”

Authors’ Response to the Peer Review of “Estimating Variance of Log Standardized Incidence Ratios Assessing Health Care Providers’ Performance: Comparative Analysis Using Bayesian, Bootstrap, and Delta Method Approaches”

This is the authors’ response to the peer-review report of “Estimating Variance of Log Standardized Incidence Ratios Assessing Health Care Providers’ Performance: Comparative Analysis Using Bayesian, Bootstrap, and Delta Method Approaches.”

Solomon Woldeyohannes, Yomei Jones, Paul Lawton

JMIRx Med 2025;6:e83796


Estimating Variance of Log Standardized Incidence Ratios Assessing Health Care Providers’ Performance: Comparative Analysis Using Bayesian, Bootstrap, and Delta Method Approaches

Estimating Variance of Log Standardized Incidence Ratios Assessing Health Care Providers’ Performance: Comparative Analysis Using Bayesian, Bootstrap, and Delta Method Approaches

Although funnel plots have been used in meta-analyses, in particular to detect publication bias, they have recently been strongly recommended as the most appropriate way to display performance indicators such as comparisons of risk-adjusted rates between health care units [8]. SMRs are the commonly used performance index for institutional comparisons [9].

Solomon Woldeyohannes, Yomei Jones, Paul Lawton

JMIRx Med 2025;6:e77415


Performance of Natural Language Processing for Information Extraction From Electronic Health Records Within Cancer: Systematic Review

Performance of Natural Language Processing for Information Extraction From Electronic Health Records Within Cancer: Systematic Review

Some variations between the average F1-score performance differences were observed (see Figure 3). Our results show that more advanced models outperform less advanced ones. The largest difference between the category performance F1-scores was observed between the BT category and the rule-based category. BT models were compared with rule-based models in 4 studies, yielding an average performance difference of 0.2335 in terms of F1-score. BT was the best-performing category.

Simon Dahl, Martin Bøgsted, Tomer Sagi, Charles Vesteghem

JMIR Med Inform 2025;13:e68707


Improving the Predictive Accuracy of the National Early Warning Score 2: Protocol for Algorithm Refinement

Improving the Predictive Accuracy of the National Early Warning Score 2: Protocol for Algorithm Refinement

In an evaluation of the performance of the original National Early Warning Score (NEWS), it demonstrated consistent predictive accuracy across multiple patient groups [5]. However, the performance of NEWS2 varies between care settings and between patients with different characteristics [11,16,17].

Chris Plummer, Cen Cong, Madison Milne-Ives, Lynsey Threlfall, Peta Le Roux, Edward Meinert

JMIR Res Protoc 2025;14:e70303


Technology-Assisted Motor-Cognitive Training Among Older Adults: Rapid Systematic Review of Randomized Controlled Trials

Technology-Assisted Motor-Cognitive Training Among Older Adults: Rapid Systematic Review of Randomized Controlled Trials

If a study demonstrated statistical significance for a particular outcome (any related outcome indicators in physical, cognitive, or dual-task performance), it was considered valid in that area (eg, physical, cognitive, or dual-task performance). In these cases, if there are some outcome indicators with statistical significance, they were marked with a checkmark. The literature search yielded 5874 potentially relevant studies.

Yaqin Li, Yaqian Liu, Angela YM Leung, Jed Montayre

JMIR Serious Games 2025;13:e67250


A Practical Guide and Assessment on Using ChatGPT to Conduct Grounded Theory: Tutorial

A Practical Guide and Assessment on Using ChatGPT to Conduct Grounded Theory: Tutorial

Reliability is a key metric for evaluating Chat GPT’s coding performance, assessing whether different coders (human or machine) produce similar results [9-11]. Typical measures include percent agreement (comparing human and artificial intelligence [AI] coding similarity) and the κ coefficient (measuring agreement beyond chance) [12,13].

Yongjie Yue, Dong Liu, Yilin Lv, Junyi Hao, Peixuan Cui

J Med Internet Res 2025;27:e70122


Performance of 3 Conversational Generative Artificial Intelligence Models for Computing Maximum Safe Doses of Local Anesthetics: Comparative Analysis

Performance of 3 Conversational Generative Artificial Intelligence Models for Computing Maximum Safe Doses of Local Anesthetics: Comparative Analysis

Recent studies have extensively evaluated the performance of generative AI models in medical question-answering scenarios. These models have shown promising results in medical licensing examinations [4,5] clinical case discussions and diagnostic reasoning [6,7]. However, their performance varies significantly based on task complexity.

Mélanie Suppan, Pietro Elias Fubini, Alexandra Stefani, Mia Gisselbaek, Caroline Flora Samer, Georges Louis Savoldelli

JMIR AI 2025;4:e66796


The Applications of Large Language Models in Mental Health: Scoping Review

The Applications of Large Language Models in Mental Health: Scoping Review

When evaluating the performance of these LLMs, most studies measured the performance of LLMs with various metrics, such as F1-score (54/95, 57%), precision (34/95, 36%), accuracy (45/95, 47%), and recall (32/95, 34%). The number of studies mapped by country is presented in Figure 3 A (a higher resolution version of figure is also available in Multimedia Appendix 5).

Yu Jin, Jiayi Liu, Pan Li, Baosen Wang, Yangxinyu Yan, Huilin Zhang, Chenhao Ni, Jing Wang, Yi Li, Yajun Bu, Yuanyuan Wang

J Med Internet Res 2025;27:e69284


Encouraging the Voluntary Mobilization of Mental Resources by Manipulating Task Design: Explorative Study

Encouraging the Voluntary Mobilization of Mental Resources by Manipulating Task Design: Explorative Study

Thus, in this context it should be considered regarding the corresponding perceived performance and frustration when performing the task. Positive mental effort should be accompanied by high perceived performance and low frustration. To make the notion of positive mental effort operational, we return to the definition of this concept, specifying that it is part of the MWL construct.

Lina-Estelle Louis, Saïd Moussaoui, Sébastien Ravoux, Isabelle Milleville-Pennel

JMIR Form Res 2025;9:e63491