Published on in Vol 8 (2024)

Preprints (earlier versions) of this paper are available at https://preprints.jmir.org/preprint/57592, first published .
Evaluating Bard Gemini Pro and GPT-4 Vision Against Student Performance in Medical Visual Question Answering: Comparative Case Study

Evaluating Bard Gemini Pro and GPT-4 Vision Against Student Performance in Medical Visual Question Answering: Comparative Case Study

Evaluating Bard Gemini Pro and GPT-4 Vision Against Student Performance in Medical Visual Question Answering: Comparative Case Study

Authors of this article:

Jonas Roos1 Author Orcid Image ;   Ron Martin2 Author Orcid Image ;   Robert Kaczmarczyk3 Author Orcid Image

Journals

  1. Cornelius J, Knitza J, Hack J, Pavlovic M, Kuhn S. Einsatzmöglichkeiten von Large Language Models in der Unfallchirurgie. Die Unfallchirurgie 2025;128(8):587 View
  2. Bashah A, Salem A, Al-waqeerah A, Ghaleb E, Wahan N, Awad A, Al-tos O, Chen G. Evaluation of deepseek, gemini, ChatGPT-4o, and perplexity in responding to salivary gland cancer. BMC Oral Health 2025;25(1) View
  3. Baldıran Ş, Eryılmaz B. YAPAY ZEKA ARAÇLARININ SEYAHAT PLANLAMADA KULLANIMI: KARŞILAŞTIRMALI BİR ANALİZ. Pamukkale University Journal of Social Sciences Institute 2025;(71) View
  4. Senngam M, Pornwattanakavee S, Leelakanok N, Todsarot T, Guinto G, Takun R, Sumativit A. The Effectiveness of ChatGPT, Google Gemini, and Microsoft Copilot in Answering Thai Drug Information Queries: a Cross-sectional Study (Preprint). JMIR AI 2025 View
  5. Luo D, Liu M, Zhang H, Wang X, Gao Q, Kuang N, Yin T, Zheng Z. Comparative performance of Chinese and international large language models on the Chinese radiology attending physician qualification examination. Scientific Reports 2025;15(1) View

Conference Proceedings

  1. Ngo A, Doan T, Ngo T, Nguyen V. Proceedings of the 2025 10th International Conference on Intelligent Information Technology. Legal Documents Query Application for Vietnamese Law Using LLM and RAG Techniques View
  2. Rughiniș C, Dascălu M, Rasnayake S. 2025 25th International Conference on Control Systems and Computer Science (CSCS). GenAI Reliability in Content Analysis: Assessing Agreement Between LLMs in Measuring Discursive Violence View