Search Articles

View query in Help articles search

Search Results (1 to 10 of 82 Results)

Download search results: CSV END BibTex RIS

CSV download: Download all 82 search results (up to 5,000 articles maximum)

Monitoring Opioid-Related Social Media Chatter Using Natural Language Processing and Large Language Models: Temporal Analysis

Monitoring Opioid-Related Social Media Chatter Using Natural Language Processing and Large Language Models: Temporal Analysis

The proposed system uses supervised LLM to automatically classify opioid-related discussions into 4 predefined categories, such as personal experience, external experience, information, and not related, and integrates temporal data from Reddit to monitor the opioid crisis in specific time frames. For this purpose, we used 6 years of publicly available social media posts from Reddit, dated between January 1, 2018, and December 30, 2023.

Grigori Sidorov, Muhammad Ahmad, Pierpaolo Basile, Muhammad Waqas, Rita Orji, Ildar Batyrshin

JMIR Infodemiology 2025;5:e77279


What Does Text Mining of Reddit Forums Reveal About Factors Surrounding Mental Health in Singapore?

What Does Text Mining of Reddit Forums Reveal About Factors Surrounding Mental Health in Singapore?

Our study is not considered to meet federal definitions under the jurisdiction of an IRB and falls outside the purview of the Human Research Protect Office as the data was obtained from social media posts consistent with the Reddit terms of use applicable at the time. All the secondary data used were anonymous. Reddit posts spanning January 2015 to December 2022 were retrieved from four Singapore-based subreddits – r/ask Singapore, r/singapore, r/singapore R, and r/singapore Raw.

Charles Alba, Abel Beng Heng Ang, Vahid Abbasian, Gerard Chung

J Med Internet Res 2025;27:e72959


Assessing Large Language Models in Building a Structured Dataset From AskDocs Subreddit Data: Methodological Study

Assessing Large Language Models in Building a Structured Dataset From AskDocs Subreddit Data: Methodological Study

Recent studies on social media platforms such as Reddit have highlighted active user engagement in health-related discussions, such as medication abortion [4] and dermatology [5]. The r/Ask Docs subreddit has been a focal point for analyzing user demographics and health topic trends, marked by a dramatic increase in user posts over time [3].

Quinn Snell, Chase Westhoff, John Westhoff, Ethan Low, Carl L Hanson, E Shannon Neeley Tass

J Med Internet Res 2025;27:e74094


Real-World Use of Topical Ruxolitinib in Vitiligo: Retrospective Cross-Sectional Mixed Methods Infodemiology Study of the r/Vitiligo Subreddit

Real-World Use of Topical Ruxolitinib in Vitiligo: Retrospective Cross-Sectional Mixed Methods Infodemiology Study of the r/Vitiligo Subreddit

Reddit, in particular, has emerged as a valuable ecological data source for infodemiology studies due to its topic-specific communities (subreddits) that foster candid discussions. For instance, Reddit data were used to understand the emotional journey of patients and caregivers navigating brain cancer diagnoses [12].

Michael Constantin Kirchberger, Carola Berking, Andreas Eisenried

J Med Internet Res 2025;27:e78247


Analyzing Reddit Social Media Content in the United States Related to H5N1: Sentiment and Topic Modeling Study

Analyzing Reddit Social Media Content in the United States Related to H5N1: Sentiment and Topic Modeling Study

The study incorporates data sourced from various Reddit communities segmented by US states, allowing for a more localized analysis of public sentiments and concerns. This approach provides a granular understanding of regional differences, which is crucial for tailoring public health strategies.

Oscar Pang, Zahra Movahedi Nia, Murray Gillies, Doris Leung, Nicola Bragazzi, Itlala Gizo, Jude Dzevela Kong

J Med Internet Res 2025;27:e70746


Stressors Disclosed on Reddit by Caregivers of Older Adults and Social Support Received: Content Analysis

Stressors Disclosed on Reddit by Caregivers of Older Adults and Social Support Received: Content Analysis

Among these, Reddit stands out for its unique features and popularity among informal caregivers. As an assemblage of community forums, Reddit enables users to engage in discussions on a specific topic within peer-to-peer communities (ie, subreddits) [15]. The anonymous nature of Reddit encourages users to discuss sensitive topics freely without fear of judgment, making it easier for caregivers to disclose their challenges [16,17].

Nova Mengxia Huang, Bryan Timothy, Shirley S Ho

JMIR Aging 2025;8:e71452


Self-Disclosure and Social Support in a Web-Based Opioid Recovery Community: Machine Learning Analysis

Self-Disclosure and Social Support in a Web-Based Opioid Recovery Community: Machine Learning Analysis

To develop the opioid use and recovery stage classifier, we leveraged the Drug Abuse Reddit dataset from Ghosh et al [31], which provided a more balanced distribution of recovery stages. This dataset contains 3151 Reddit posts categorized into 5 recovery stages: “Addicted,” “E-Recovery” (Early), “M-Recovery” (Maintaining), “A-Recovery” (Advanced), and “Others.”

Yu Chi, Huai-yu Chen, Khushboo Thaker

JMIR Form Res 2025;9:e71207


Using Natural Language Processing to Describe the Use of an Online Community for Abortion During 2022: Dynamic Topic Modeling Analysis of Reddit Posts

Using Natural Language Processing to Describe the Use of an Online Community for Abortion During 2022: Dynamic Topic Modeling Analysis of Reddit Posts

Reddit data have historically been accessible through free, publicly available application programming interfaces (APIs). Researchers have often used Pushshift’s Reddit API and Reddit’s official API to obtain compiled information about content shared on Reddit, including creation date, submission (post or comment) text, community interactions (likes, upvotes, and downvotes), and more.

Elizabeth Pleasants, Ndola Prata, Ushma D Upadhyay, Cassondra Marshall, Coye Cheshire

JMIR Infodemiology 2025;5:e72771


Psychometric Evaluation of Large Language Model Embeddings for Personality Trait Prediction

Psychometric Evaluation of Large Language Model Embeddings for Personality Trait Prediction

Our study addresses these critical gaps through a systematic evaluation of LLM embeddings on a Reddit dataset [4]. Specifically, we aim to investigate the predictive efficacy of LLM embeddings on a psychology domain and analyze their characterization of key psychology dimensions across each trait. In addition to this, we also investigate the benchmark performance-cost trade-offs across model architecture.

Julina Maharjan, Ruoming Jin, Jianfeng Zhu, Deric Kenne

J Med Internet Res 2025;27:e75347


Exploring Inflammatory Bowel Disease Discourse on Reddit Throughout the COVID-19 Pandemic Using OpenAI’s GPT-3.5 Turbo Model: Classification Model Validation and Case Study

Exploring Inflammatory Bowel Disease Discourse on Reddit Throughout the COVID-19 Pandemic Using OpenAI’s GPT-3.5 Turbo Model: Classification Model Validation and Case Study

Another study by Rohde et al [8] characterized topics associated with IBD and distress on Reddit and Twitter, finding that symptoms (n=23,294, 47.8%) and medication (n=12,218, 30.1%) were the most prevalent topics. Additionally, a 2023 study by Stemmer et al [9] analyzed the content and sentiments expressed in posts by patients with IBD, revealing that they expressed more sadness and fear compared with a control group of healthy users.

Tyler Babinski, Sara Karley, Marita Cooper, Salma Shaik, Y Ken Wang

J Med Internet Res 2025;27:e53332