Published on in Vol 9 (2025)

Preprints (earlier versions) of this paper are available at https://preprints.jmir.org/preprint/77357, first published .
Performance of DeepSeek-R1, ChatGPT (GPT-o3-mini), and Gemini 2.0 Flash on German Medical Multiple-Choice Questions: Comparative Evaluation

Performance of DeepSeek-R1, ChatGPT (GPT-o3-mini), and Gemini 2.0 Flash on German Medical Multiple-Choice Questions: Comparative Evaluation

Performance of DeepSeek-R1, ChatGPT (GPT-o3-mini), and Gemini 2.0 Flash on German Medical Multiple-Choice Questions: Comparative Evaluation

Annika Meyer   1 , Dr med ;   Yassin Karay   2 , Dr rer med ;   Andrea U Steinbicker   1 , Prof Dr Med ;   Thomas Streichert   3 , Prof Dr Med ;   Remco Overbeek   1 , Dr med

1 Department of Anesthesiology and Operative Intensive Care, Faculty of Medicine and University Hospital, University Hospital Cologne, Cologne, Germany

2 Dean’s Office for Student Affairs, Faculty of Medicine, University Hospital Cologne, Cologne, Germany

3 Institute for Clinical Chemistry, Faculty of Medicine and University Hospital, University Hospital Cologne, Cologne, Germany

Corresponding Author:

  • Annika Meyer, Dr med
  • Department of Anesthesiology and Operative Intensive Care
  • Faculty of Medicine and University Hospital
  • University Hospital Cologne
  • Kerpener Str. 62
  • Cologne 50937
  • Germany
  • Phone: 1 0000000000
  • Email: annika.meyer1@uk-koeln.de