TY - JOUR AU - Wang, Andrew AU - Fulton, Rachel AU - Hwang, Sy AU - Margolis, David J AU - Mowery, Danielle PY - 2024 DA - 2024/1/26 TI - Patient Phenotyping for Atopic Dermatitis With Transformers and Machine Learning: Algorithm Development and Validation Study JO - JMIR Form Res SP - e52200 VL - 8 KW - atopic dermatitis KW - classification KW - classifier KW - dermatitis KW - dermatology KW - EHR KW - electronic health record KW - health records KW - health KW - informatics KW - machine learning KW - natural language processing KW - NLP KW - patient phenotyping KW - phenotype KW - skin KW - transformer KW - transformers AB - Background: Atopic dermatitis (AD) is a chronic skin condition that millions of people around the world live with each day. Performing research into identifying the causes and treatment for this disease has great potential to provide benefits for these individuals. However, AD clinical trial recruitment is not a trivial task due to the variance in diagnostic precision and phenotypic definitions leveraged by different clinicians, as well as the time spent finding, recruiting, and enrolling patients by clinicians to become study participants. Thus, there is a need for automatic and effective patient phenotyping for cohort recruitment. Objective: This study aims to present an approach for identifying patients whose electronic health records suggest that they may have AD. Methods: We created a vectorized representation of each patient and trained various supervised machine learning methods to classify when a patient has AD. Each patient is represented by a vector of either probabilities or binary values, where each value indicates whether they meet a different criteria for AD diagnosis. Results: The most accurate AD classifier performed with a class-balanced accuracy of 0.8036, a precision of 0.8400, and a recall of 0.7500 when using XGBoost (Extreme Gradient Boosting). Conclusions: Creating an automated approach for identifying patient cohorts has the potential to accelerate, standardize, and automate the process of patient recruitment for AD studies; therefore, reducing clinician burden and informing the discovery of better treatment options for AD. SN - 2561-326X UR - https://formative.jmir.org/2024/1/e52200 UR - https://doi.org/10.2196/52200 UR - http://www.ncbi.nlm.nih.gov/pubmed/38277207 DO - 10.2196/52200 ID - info:doi/10.2196/52200 ER -