Published on in Vol 8 (2024)

Preprints (earlier versions) of this paper are available at https://preprints.jmir.org/preprint/59267, first published .
Evaluating ChatGPT-4’s Accuracy in Identifying Final Diagnoses Within Differential Diagnoses Compared With Those of Physicians: Experimental Study for Diagnostic Cases

Evaluating ChatGPT-4’s Accuracy in Identifying Final Diagnoses Within Differential Diagnoses Compared With Those of Physicians: Experimental Study for Diagnostic Cases

Evaluating ChatGPT-4’s Accuracy in Identifying Final Diagnoses Within Differential Diagnoses Compared With Those of Physicians: Experimental Study for Diagnostic Cases

Journals

  1. Dalal A, Plombon S, Konieczny K, Motta-Calderon D, Malik M, Garber A, Lam A, Piniella N, Leeson M, Garabedian P, Goyal A, Roulier S, Yoon C, Fiskio J, Schnock K, Rozenblum R, Griffin J, Schnipper J, Lipsitz S, Bates D. Adverse diagnostic events in hospitalised patients: a single-centre, retrospective cohort study. BMJ Quality & Safety 2025;34(6):377 View
  2. Barabucci G, Shia V, Chu E, Harack B, Laskowski K, Fu N. Combining Multiple Large Language Models Improves Diagnostic Accuracy. NEJM AI 2024;1(11) View
  3. Sanchez Tena M, Alvarez‐Peregrina C, Martinez‐Perez C. Evaluation of the perception of information from ChatGPT in myopia education: Perspectives of students and professionals. Ophthalmic and Physiological Optics 2025;45(3):883 View
  4. Padovan M, Palla A, Marino R, Porciatti F, Cosci B, Carlucci F, Nerli G, Petillo A, Necciari G, Dell’Amico L, Lucisano V, Scarinci S, Foddis R. ChatGPT-4 vs. Google Bard: Which Chatbot Better Understands the Italian Legislative Framework for Worker Health and Safety?. Applied Sciences 2025;15(3):1508 View
  5. Saglam S, Uludag V, Karaduman Z, Arıcan M, Yücel M, Dalaslan R. Comparative evaluation of artificial intelligence models GPT-4 and GPT-3.5 in clinical decision-making in sports surgery and physiotherapy: a cross-sectional study. BMC Medical Informatics and Decision Making 2025;25(1) View
  6. Sarı M, Tufenkci P. Evaluation of the Competency of Large Language Models GPT-4o and Claude 3.5 Sonnet in Endodontic Emergencies. European Annals of Dental Sciences 2025;52(1):10 View
  7. Bolgova O, Ganguly P, Mavrych V. Comparative analysis of LLMs performance in medical embryology: A cross‐platform study of ChatGPT, Claude, Gemini, and Copilot. Anatomical Sciences Education 2025;18(7):718 View
  8. Bridges J, Jiang X, Ige M, Toyobo O. Computerized diagnostic decision support systems—Isabel Pro versus ChatGPT-4 part II. JAMIA Open 2025;8(3) View
  9. Zhang A, Chen J. AI-Driven Network Biology Identifies SRC as a Therapeutic Target in Metastatic Pancreatic Adenocarcinoma. Intelligent Oncology 2025 View