Search Articles

View query in Help articles search

Search Results (1 to 10 of 45 Results)

Download search results: CSV END BibTex RIS


Beyond Benchmarks: Evaluating Generalist Medical Artificial Intelligence With Psychometrics

Beyond Benchmarks: Evaluating Generalist Medical Artificial Intelligence With Psychometrics

To measure a certain construct in GMAI, psychometrics specifies a variety of assessment formats that are not limited to test-based assessment such as benchmarks. Other formats include practical assessment, as shown in the example with AMIE [3], observational assessment, situational assessment, interactive assessment, among others, all of which are commonly used to evaluate skills and behaviors in psychometrics [16].

Luning Sun, Christopher Gibbons, José Hernández-Orallo, Xiting Wang, Liming Jiang, David Stillwell, Fang Luo, Xing Xie

J Med Internet Res 2025;27:e70901

Developing and Validating a Self-Care Self-Efficacy Scale for Oral Anticoagulation Therapy in Patients With Nonvalvular Atrial Fibrillation: Protocol for a Mixed Methods Study

Developing and Validating a Self-Care Self-Efficacy Scale for Oral Anticoagulation Therapy in Patients With Nonvalvular Atrial Fibrillation: Protocol for a Mixed Methods Study

This second phase (ie, validation phase) aims to define the construct intended to be measured, providing evidence of psychometrics validity for each translated version of SCSE-OAC. Two main substeps will be performed (1) content and face validity and (2) construct validity. According to the COSMIN guideline, the content validity of the scale will be assessed based on the key criteria of relevance, comprehensiveness, and comprehensibility.

Arianna Magon, Jeroen Hendriks, Rosario Caruso

JMIR Res Protoc 2024;13:e51489