TY - JOUR AU - Ahmed, Arfan AU - Aziz, Sarah AU - Khalifa, Mohamed AU - Shah, Uzair AU - Hassan, Asma AU - Abd-Alrazaq, Alaa AU - Househ, Mowafa PY - 2022 DA - 2022/3/11 TI - Thematic Analysis on User Reviews for Depression and Anxiety Chatbot Apps: Machine Learning Approach JO - JMIR Form Res SP - e27654 VL - 6 IS - 3 KW - anxiety KW - depression KW - chatbots KW - conversational agents KW - topic modeling KW - latent Dirichlet allocation KW - thematic analysis KW - mobile phone AB - Background: Anxiety and depression are among the most commonly prevalent mental health disorders worldwide. Chatbot apps can play an important role in relieving anxiety and depression. Users’ reviews of chatbot apps are considered an important source of data for exploring users’ opinions and satisfaction. Objective: This study aims to explore users’ opinions, satisfaction, and attitudes toward anxiety and depression chatbot apps by conducting a thematic analysis of users’ reviews of 11 anxiety and depression chatbot apps collected from the Google Play Store and Apple App Store. In addition, we propose a workflow to provide a methodological approach for future analysis of app review comments. Methods: We analyzed 205,581 user review comments from chatbots designed for users with anxiety and depression symptoms. Using scraper tools and Google Play Scraper and App Store Scraper Python libraries, we extracted the text and metadata. The reviews were divided into positive and negative meta-themes based on users’ rating per review. We analyzed the reviews using word frequencies of bigrams and words in pairs. A topic modeling technique, latent Dirichlet allocation, was applied to identify topics in the reviews and analyzed to detect themes and subthemes. Results: Thematic analysis was conducted on 5 topics for each sentimental set. Reviews were categorized as positive or negative. For positive reviews, the main themes were confidence and affirmation building, adequate analysis, and consultation, caring as a friend, and ease of use. For negative reviews, the results revealed the following themes: usability issues, update issues, privacy, and noncreative conversations. Conclusions: Using a machine learning approach, we were able to analyze ≥200,000 comments and categorize them into themes, allowing us to observe users’ expectations effectively despite some negative factors. A methodological workflow is provided for the future analysis of review comments. SN - 2561-326X UR - https://formative.jmir.org/2022/3/e27654 UR - https://doi.org/10.2196/27654 UR - http://www.ncbi.nlm.nih.gov/pubmed/35275069 DO - 10.2196/27654 ID - info:doi/10.2196/27654 ER -