Accessibility settings

Published on in Vol 10 (2026)

Preprints (earlier versions) of this paper are available at https://preprints.jmir.org/preprint/86025, first published .
Detection of Posttraumatic Stress Disorder With Rest-Activity Data: Machine Learning Approach Using Wearable and Self-Report Data

Detection of Posttraumatic Stress Disorder With Rest-Activity Data: Machine Learning Approach Using Wearable and Self-Report Data

Detection of Posttraumatic Stress Disorder With Rest-Activity Data: Machine Learning Approach Using Wearable and Self-Report Data

1Department of Psychology, University of California, Irvine, 214 Pereira Dr, Irvine, CA, United States

2Department of Behavioral Sciences and Psychiatry, Rush University Medical Center, Chicago, IL, United States

3Department of Pediatrics, Johns Hopkins Medicine, Baltimore, MD, United States

4Department of Psychiatry, University of Michigan Medicine, Ann Arbor, MI, United States

Corresponding Author:

Katherine Wislocki, MA


Background: Growing evidence suggests that disruptions in rest-activity rhythms may serve as relevant markers of posttraumatic stress disorder (PTSD). Despite the emergence of machine learning methods applied to actigraphy and self-report data, few studies have used these approaches to identify individuals with clinically diagnosed PTSD. Prior work has focused on predicting probable PTSD based on self-report measures, yet discrepancies exist between clinical diagnoses and probable PTSD derived from self-reports.

Objective: This study explored whether wrist actigraphy and sleep logs could be used to accurately predict clinician-rated PTSD diagnosis and probable diagnosis of PTSD based on established self-report cutoffs (PTSD Checklist for Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition [PCL-5] ≥31 and ≥38) among trauma-exposed service members and veterans. We also explored which features were most strongly predictive of each outcome and whether models were able to predict PTSD diagnosis even when accounting for other mental health disorders.

Methods: Wrist actigraphy data and daily sleep logs were collected over 1 week from trauma-exposed male service members and veterans (N=36; mean age 41, SD 5.3 y). Candidate features were identified using univariate feature selection. Extreme gradient boosting models were trained using leave-one-subject-out cross-validation to predict the diagnosis of PTSD and probable diagnosis of PTSD based on 2 self-report cutoffs (PCL-5≥31 and ≥38). Performance metrics were then calculated at the person level. Linear regression was used to assess the discriminant validity of model-predicted scores and each PTSD outcome specifically, relative to other mental health diagnoses.

Results: Machine learning models predicting PTSD diagnosis and probable PTSD based on the PCL-5≥31 threshold demonstrated satisfactory performance in this sample. The diagnosis model achieved an area under the curve (AUC) of 0.83 (95% CI 0.61‐1.00), with high accuracy (88%) and specificity (96%) and moderate sensitivity (63%). The PCL-5≥31 model yielded comparable performance (AUC=0.84, 95% CI 0.71‐0.98) with balanced sensitivity (73%) and specificity (82%). For both models, a combination of subjective and objective features was the most impactful. These models were able to predict PTSD even when accounting for non-PTSD mental health diagnoses, as model-predicted scores were significantly associated with 2 outcomes: clinician-rated PTSD (B=0.19; P=.002) and probable PTSD based on a PCL-5≥31 cutoff (B=0.24; P=.003). In contrast, the model predicting probable PTSD based on the PCL-5≥38 threshold performed poorly (AUC=0.47, 95% CI 0.24‐0.69), with a nonsignificant relationship between model-predicted scores and the outcome (B<0.01; P=.89).

Conclusions: Both subjective and objective rest-activity features may improve the prediction of PTSD. Further research is needed to validate these findings and explore the use of integrating wearable sensor data and subjective information to support PTSD assessment.

JMIR Form Res 2026;10:e86025

doi:10.2196/86025

Keywords



The circadian system is the intricate internal clock that aligns various biological processes in accordance with the 24-hour cycle of day and night [1]. Disruptions in one’s circadian rhythm can impact the functioning of different physiological processes, including those related to sleep regulation, metabolic functioning, and hormone secretion [1]. Increasing research has demonstrated evidence of circadian disruption, or “disturbances in biological timing,” in individuals experiencing mental health issues, including depression [2], bipolar disorder [3], schizophrenia [4], and posttraumatic stress disorder (PTSD) [5,6]. Past work has indicated that circadian disruption may be central to the development and maintenance of PTSD [1,7-9]. Trauma exposure, and the onset of traumatic stress, may result in short- and long-term circadian disruption [1,8]. In turn, circadian disruption is thought to potentially mediate the neurobiological consequences associated with traumatic stress [1,8,9].

Research has increasingly focused on using objective forms of measurement to capture information regarding circadian disruption through individual rest-activity rhythms [1,10]. This has become more feasible with the introduction of technology capable of passively sensing objective data related to rest and activity [11]. Actigraphy is a promising tool for passively sensing information regarding rest and activity patterns [11], and research has demonstrated that actigraphy can reliably capture certain objective information about rest and activity [12]. In addition, wrist actigraphy provides greater objectivity in capturing sleep characteristics than self-report [13]. For example, a prior study comparing the use of self-reported sleep characteristics (eg, sleep onset, sleep offset, and nightly awakenings) and actigraphy to capture nightly sleep information in a veteran sample demonstrated that less than half of participants demonstrated adequate reliability between self-report of sleep characteristics and sleep characteristics captured through actigraphy [13]. By providing continuous, real-time data on physical activity and sleep, actigraphy offers an approach to feasibly understanding information about rest-activity rhythms in individuals with PTSD.

Historically, actigraphy data have been used to understand sleep-related impairments in individuals with PTSD [11,14, 15, 16, 17]. Numerous studies have found little difference in sleep-related actigraphy measures between individuals with and without PTSD, including total sleep time [17], sleep onset latency [16,17], nocturnal awakenings [16,17], and sleep efficiency [15,17]. In contrast, some research has demonstrated specific differences in rest-activity rhythms in individuals with PTSD compared to other clinical populations and healthy controls [5,18], such as significantly greater frequency of awakenings during sleep and variability in daily activity levels [18]. Beyond specific rest-activity metrics, the timing of rest and activity patterns is also particularly important for identifying and understanding rest-activity rhythms among individuals with PTSD symptoms. For example, Sandahl et al [6] previously found a significant positive association between PTSD symptoms and the timing of the onset of the most active period of the day, such that individuals with more severe PTSD symptoms were more likely to be most active later in the day compared to earlier in the day. In contrast to healthy controls, individuals with PTSD have also been shown to demonstrate lower relative amplitude (ie, the ratio between daytime and nighttime activity levels) and day-to-day stability in activity [5]. A recent meta-analysis of 17 studies encompassing 1847 participants demonstrated that relative amplitude was significantly related to PTSD symptoms [19]. As such, differences in rest-activity rhythms, as well as the variability in those rhythms, may be helpful in identifying individuals with PTSD.

Actigraphy data can provide access to thousands of measurements per day related to rest, sleep, and light exposure [10,20]. Common measures related to rest-activity rhythms can be nonlinear (eg, acrophase) or linear (eg, mean activity during sleep). Analytic methods must be able to effectively accommodate the breadth, scope, and nature of actigraphy data. Machine learning methods have been used to analyze actigraphy data [21-23]. Past work has used health data (eg, self-report and medical record data) and actigraphy data to help identify individuals with mental health conditions other than PTSD, such as depression, through machine learning [22]. However, limited previous research has used actigraphy data and machine learning to successfully predict PTSD-related outcomes [21,24]. Accurately identifying individuals with a PTSD diagnosis is critical for efforts to assess and intervene to address their symptoms. Rest-activity data may provide insights into the assessment of PTSD symptoms beyond traditional assessment methods, providing an opportunity to differentiate individuals with and without PTSD with reduced burden.

One past study used wrist actigraphy data to predict probable PTSD (ie, defined as PTSD Checklist for Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition [PCL-5] scores≥31) in a national sample 2 months following exposure to traumatic events, resulting in a mean accuracy rate of 56% and a mean area under the curve (AUC) value of 0.56 [21]. Rest-activity features, including intradaily variability of activity periods, interdaily stability in rest periods, interdaily stability in activity periods, and circadian rhythm strength, were among the most important actigraphy features in predicting different PTSD outcomes at 8 weeks [21]. When combining both baseline self-report surveys focused on stress and trauma, and wrist actigraphy data, performance in predicting probable PTSD increased (AUC=0.73) [21]. However, the nationwide sample from this study obfuscates how rest-activity data can be used to assess PTSD among other populations, including those at greater risk of experiencing trauma exposure and developing PTSD, such as veterans [25]. More research is needed to understand whether the use of actigraphy data can effectively differentiate individuals with and without PTSD, particularly in individuals with clinically diagnosed PTSD in populations at higher risk for PTSD.

Notably, past research has used a common threshold for establishing a probable PTSD diagnosis through self-report (ie, PCL-5≥31). Research has demonstrated that self-report assessments of PTSD may not always align with clinical assessments of PTSD [26], and self-reported PTSD symptoms are highly variable across time [27,28]. Furthermore, existing research has established a range of “optimal” diagnostic cutoff scores depending on the sample and context of the validation study (eg, PCL-5 scores ranging from 22‐49) [29]. As a result, predicting a probable diagnosis based on a self-report cutoff may not fully reflect the effectiveness of actigraphy in identifying individuals who may meet formal diagnostic criteria for PTSD. Moreover, probable diagnoses based on self-report cutoffs and predicting clinical diagnoses based on clinician-administered interviews represent distinct outcomes; for example, self-report scores offer greater insights into symptom severity, while clinician-administered interviews represent formal clinical evaluations of symptoms based on existing diagnostic standards.

This study is a secondary analysis of wrist actigraphy data and sleep logs in trauma-exposed service members and veterans. Information from wrist actigraphy data and sleep logs from trauma-exposed service members and veterans were used to predict PTSD diagnosis through diagnostic interviews and probable PTSD diagnosis through self-report (PCL-5≥31 and ≥38). While past research has demonstrated a range of potential self-report cutoff scores for identifying PTSD based on self-report (eg, 22‐49), PCL-5 cutoffs of 31 and 38 have been identified as potential diagnostic cutoffs in military populations [29]. We hypothesized that the prediction of PTSD diagnosis and probable PTSD diagnosis would be fair (AUC≥0.70). Feature selection was conducted to assess the most important features for predicting PTSD diagnosis through diagnostic interviews and probable PTSD diagnosis through self-report. To determine whether models could uniquely predict PTSD rather than mental health symptoms more broadly, the associations between co-occurring mental health diagnoses and model predictions were assessed to understand the discriminant validity of wrist actigraphy and sleep logs in differentiating those with PTSD and other mental health concerns. It was hypothesized that there would be adequate discriminant validity, demonstrated through stronger relationships between PTSD diagnosis and probable PTSD diagnosis based on self-report and model-predicted scores, compared to the relationship between non-PTSD mental health diagnosis and model-predicted scores for each model.


Procedures

This study was a secondary analysis of data from a study focused on the intergenerational transmission of trauma among service members, veterans, and their families. Military families (N=36; ie, composed of a male-identifying veteran, one of the veteran’s biological children, and the child’s biological mother) were recruited through several methods, including digital recruitment via social media; ResearchMatch; listservs; and in-person recruitment through schools, local communities, and resource fairs. Participants were eligible to participate if they were male veterans or service members deployed in service of US military, had previously been exposed to a Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition (DSM-5) criterion A traumatic event during military service, had a biological child aged 7 to 16 years, were at least aged 18 years, and were fluent in English. Though the initial inclusion criteria required participants to have an index trauma within the context of their military service (determined via phone screening), several participants identified a different index trauma during their clinical interview. These participants were retained in the sample given that all participants had a history of military-related criterion A trauma. Participants were excluded if they had a lifetime history of mania or psychosis, significant cognitive impairment, significant suicidality (defined as suicidal ideation or behavior in the past 6 months), or significant unstable illness. The analysis sample included 33 participants, who had a mean age of 41.36 (SD 5.28) years and an average PCL-5 score of 22.88 (SD 19.04). A total of 8 (24.24%) participants met the criteria for PTSD diagnosis, while 11 (33.33%) and 7 (21.21%) participants met or exceeded the PCL-5 cutoffs of 31 and 38, respectively. Additional information regarding the analysis sample can be found in Table 1.

Table 1. Demographic characteristics of participants (n=33)a.
VariableParticipants, n (%)
Race
Asian2 (6.06)
Black or African American2 (6.06)
Mixed race or other4 (12.12)
White25 (75.76)
Ethnicity
Hispanic, Latino, or Spanish origin7 (21.21)
Employment status
Employed full time27 (81.81)
Retired1 (3.03)
Student and employed2 (6.06)
Full-time student3 (9.09)
Highest degree earned
 High school diploma or GEDb5 (15.15)
 Some college or associate’s degree5 (15.15)
 Baccalaureate degree14 (42.42)
 Master’s degree9 (27.27)
Annual income (US $)
0-49,9992 (6.06)
50,000-99,99911 (33.33)
100,000-149,99911 (33.33)
150,0009 (27.27)
Index trauma on SCID-5c
Combat29 (87.87)
Accidental or violent death3 (9.09)
Physical assault1 (3.03)
Non-PTSDd mental health diagnosis9 (27.27)
Current mental health diagnoses
PTSD8 (24.24)
Major depressive disorder4 (12.12)
Generalized anxiety disorder4 (12.12)
Social anxiety disorder4 (12.12)
Alcohol use disorder3 (9.09)
Specific phobia2 (6.06)
Persistent depressive disorder1 (3.03)
Obsessive-compulsive disorder1 (3.03)
Agoraphobia1 (3.03)
PCL-5e
≥3111 (33.33)
≥387 (21.21)

aIndex trauma information was not available through the SCID-5–Research Version assessment for 1 participant as they were unable to complete it. As such, index trauma for this participant was established through the PCL-5.

bGED: General Education Development.

cSCID-5: Structured Clinical Interview for Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition.

dPTSD: posttraumatic stress disorder.

ePCL-5: Posttraumatic Stress Disorder Checklist for Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition.

Participants were initially scheduled for a 1-hour virtual onboarding session and were shipped a research kit containing an actigraphy watch. During the onboarding session, participants received instructions on how to wear the actigraphy watches and complete daily sleep logs during the 1-week study period. At the end of the onboarding session, participants were emailed self-report assessments for them to complete. Throughout the week, participants received reminders from study staff to complete the self-report assessments. Participants were also scheduled for a diagnostic interview consisting of the Structured Clinical Interview for DSM-5 Disorders–Research Version (SCID-5-RV) [30] with a trained graduate student assessor during the 1-week study period. At the end of the 1-week period, participants mailed the actigraphy watches back to the study team. The actigraphy data were processed by the study team using the Philips Actiware 6.1.0 program (Philips Respironics).

Wrist actigraphy data and sleep logs from participants were used to derive daily features related to individual sleep (eg, sleep efficiency, wake after sleep onset [WASO], subjective sleep quality, and subjective restfulness), activity (eg, mean and SD of activity), and rest-activity patterns (eg, acrophase time and intradaily stability). Given the small sample size, candidate features were identified through univariate feature selection using aggregated features from the full sample. In turn, extreme gradient boosting (XGBoost) models used daily features to try to differentiate individuals with and without diagnosed PTSD and probable PTSD based on self-report cutoffs. Model performance was then evaluated at the person-level across models. The most important predictive features for each model were identified using mean Shapley Additive Explanations (SHAP) values and mean information gain values. To determine the discriminant validity of the model in predicting PTSD diagnosis rather than mental health problems more generally, linear regression analysis was used to assess the relationship between PTSD diagnosis (ie, based on clinical diagnosis or probable diagnosis via self-report) and model-predicted scores while accounting for other non-PTSD mental health diagnoses and other relevant covariates (ie, age).

Ethical Considerations

Informed consent was obtained from all participants during their onboarding session. Participants were compensated up to US $220 for their participation in this study. Procedures for this study were approved by the institutional review board at the University of California, Irvine (20184726). Additional approval was not required for secondary data analysis. Participant identifiers were stored on encrypted servers, separate from deidentified research data.

Measures

PTSD Checklist for DSM-5

The PCL-5 assesses the severity of PTSD symptoms in the past month based on an individual’s self-reported index trauma [31]. Although all participants experienced trauma within the context of their military service, participants were permitted to identify an index trauma related or unrelated to their military service. The PCL-5 was used to evaluate an individual’s PTSD symptom severity in the past month at baseline. PCL-5 scores greater than or equal to 31 and 38 were evaluated as probable PTSD diagnosis cutoff. Past research has identified various optimal PCL-5 cutoff scores for identifying individuals who may meet the criteria for PTSD, including PCL-5≥31 and PCL-5≥38 in military populations [29].

Structured Clinical Interview for DSM-5 Disorders–Research Version

The SCID-5-RV was conducted by trained graduate-level clinical assessors to assess lifetime and current mental health diagnoses [30]. All graduate students conducting assessments received training from the lead author, a clinical psychologist. The assessors conducted a mock screening that was reviewed by the lead author and rated for fidelity prior to conducting study assessments. The lead author also watched and rated recordings of the first study assessment for all assessors to ensure fidelity. Subsequently, the assessors received regular supervision to ensure consistency across raters. Although uncommon, participants were permitted to identify an index trauma through the SCID-5-RV assessment that differed from the index trauma reported through the PCL-5. SCID-5-RV ratings were used to establish a current PTSD diagnosis. These ratings were also used to establish any current diagnosis other than PTSD, including major depressive disorder, persistent depressive disorder, hypomanic episode, manic episode, cyclothymic disorder, generalized anxiety disorder, panic disorder, agoraphobia, social anxiety disorder, specific phobia, obsessive-compulsive disorder, and substance use disorders (coded as 1 for any current non-PTSD diagnosis or 0 for no current non-PTSD diagnosis). Additionally, the SCID-5-RV was used to identify and exclude individuals with a lifetime history of psychosis-spectrum symptoms.

Wrist Actigraphy

Wrist actigraphy data were captured for a period of seven days through wrist actigraphy watches (Actiwatch Spectrum Plus, Philips Respironics) worn by participants. Wrist actigraphy watches measured activity and sleep data in 30-second intervals across the study period. Epoch-level data for each participant were exported from the Actiware program. The following features were derived from wrist actigraphy data: mesor, acrophase time, amplitude, relative amplitude, mean activity, SD of activity, root mean square successive difference of activity, intradaily variability of activity, interdaily stability of activity, mean activity during the 10 most active hours of the day (M10), mean activity during the 5 least active hours of the day (L5), circadian rhythm strength, total sleep time, sleep efficiency, WASO, and fragmentation. Total sleep time, sleep efficiency, WASO, and fragmentation were calculated through the Philips Actiware 6.1.0 program, while relative amplitude, mean activity, SD of activity, root mean square successive difference of activity, intradaily variability of activity, interdaily stability of activity, mean activity during the 10 most active hours of the day (M10), mean activity during the 5 least active hours of the day (L5), and circadian rhythm strength were calculated through R version 4.3.1 (R Foundation for Statistical Computing) [32]. Mesor, acrophase time, and amplitude were calculated using the CosinorPy package in Python (Python Software Foundation) [33]. Definitions of each feature can be found in Table S1 in Multimedia Appendix 1.

Sleep Logs

Participants rated their subjective sleep quality and restfulness each day during the study using a 1-10 scale (1=“worst” to 10=“best”). Sleep logs were completed each morning after waking.

Analysis

All preprocessing and analysis were conducted in Python [34] and R statistical software [32]. Wrist actigraphy data were used to compute daily features for each individual. Features derived from cosinor-based rhythmometry and the engineering of rest-activity data were used as predictors in each model (see “Measures” section). One participant was excluded from all analyses due to night shift work (n=1), 2 participants were excluded based on a lifetime history of psychosis-spectrum symptoms (n=2), and 1 participant was excluded from diagnostic-specific analysis as they did not complete a SCID-5-RV assessment (n=1). Two participants completed the study during a time change (ie, daylight savings time; n=2). As such, days within 2 days after the initial time change were excluded [35]. Actigraphy data from days with increased off-wrist time (ie, ≥30% off-wrist time) were excluded from analyses. Days with missing sleep log data (n=1) were excluded from analyses. Furthermore, days in which circadian rhythm measures (eg, acrophase time, mesor, and amplitude) were calculated over a period of less than 24 hours were excluded to ensure the reliability of feature estimation. Because of this, the first study day (day 1) and the last study day (day 7) were removed, resulting in 5 days of available data per person. This resulted in approximately 165 days of data for each model. Correlation analysis between variables is presented in Table 2.

Table 2. Correlations between rest-activity features and posttraumatic stress disorder (PTSD) outcomesa.
VariablesPTSD diagnosisPCL-5b totalPCL-5≥ 31PCL-5≥ 38Non-PTSD diagnosis
PTSD diagnosis
PCL-5 total.50c
PCL-5≥31.39d.87e
PCL-5≥38.46c.81e.73e
Non-PTSD diagnosis.28.24.14.18
Mesor.04.15.18–.02.18
Acrophase time.33–.04–.23–.08.13
Amplitude–.01.05.08–.12.10
Relative amplitude.07–.13–.16–.22–.27
Mean activity.07.20.26.02–.01
SD activity.04.10.17–.02–.05
CRSf.12–.06–.06–.16–.14
M10g.10.18.19–.01.04
L5h–.20.07.13.06.22
RMSSDi.03.07.18–.02–.02
TSTj.14–.18–.19–.12–.22
IVk.06.17.11.34.14
ISl–.18–.18–.03–.32–.10
Sleep quality rating–.38d–.29–.33–.21–.05
Restfulness rating–.43d–.44d–.46c–.40d–.11
WASOm.26.13.15.10.06
Efficiency–.04–.36d–.35d–.13–.31
Fragmentation.21.35d.28.23.34

aCorrelations were calculated among averaged daily features and mental health variables.

bPCL-5: Posttraumatic Stress Disorder Checklist for Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition.

cP<.01.

dP<.05.

eP<.001.

fCRS: circadian rhythm strength.

gM10: mean activity during the 10 most active hours of the day.

hL5: mean activity during the 5 least active hours of the day.

iRMSSD: root mean square of successive differences.

jTST: total sleep time.

k IV: intradaily variability.

lIS: interdaily stability.

mWASO: wake after sleep onset.

Due to the limited sample size, univariate feature selection using the F-score was conducted on aggregated features for the full sample. Features with higher average F-scores were considered to have stronger discriminatory power in predicting each outcome. Given the small sample size, the top 3 features identified through this procedure, with no elevated correlation with another selected feature, were then used across all XGBoost model training folds for each outcome. (ie, r≥0.70; Table S2 in Multimedia Appendix 2). For example, this led to the exclusion of the subjective sleep quality rating when the restfulness rating was included, as these features were highly correlated with each other (r=0.91). Univariate relationships between each outcome and aggregated features across the full sample are presented in Multimedia Appendix 3. Given the small sample size, a total of 3 features were selected per model to maximize stability in model performance and mitigate model overfitting (Table S3 in Multimedia Appendix 3).

XGBoost models used selected day-level features from wrist actigraphy data and sleep logs to predict PTSD diagnosis and probable PTSD diagnosis via validated PCL-5 cutoffs (ie, PCL-5≥31 and PCL-5≥38). XGBoost is a machine learning algorithm that uses an ensemble learning method by leveraging the performance of individual decision trees to make predictions [36]. In comparison to deep learning models, XGBoost is often more parsimonious and computationally efficient while providing robust and accurate predictions in a variety of contexts, including high-dimensional data and data containing nonlinear relationships [36]. Prior work has used XGBoost to predict mental and physical health outcomes based on passively sensed data [37,38]. Furthermore, past research by Choi et al [22] demonstrated that in comparison to other machine learning and statistical models tested, XGBoost provided the most accurate performance in predicting depression diagnosis based on actigraphy data.

Training and testing data were transformed using a Yeo-Johnson transformation prior to being used to train and test each XGBoost model. Hyperparameters were optimized using randomized grid search. Information about hyperparameters for each model can be found in Table S4 in Multimedia Appendix 4. XGBoost models were trained using leave-one-subject-out cross-validation. In each fold, 1 participant’s day-level data served as the test set, while day-level data from the remaining participants was used for model training. This process was repeated until each participant’s data was used in testing, ensuring that no participant’s data appeared in both training and testing within any fold. Model-predicted scores were then averaged within-person for each XGBoost model and performance statistics for each model were calculated, including area under the receiver operating characteristic curve (accuracy, sensitivity, specificity, precision, and F1-score). Mean SHAP and information gain values were computed and presented to evaluate the importance of each feature in predicting the outcome. It was hypothesized that the prediction of PTSD diagnosis and probable PTSD diagnosis based on wrist actigraphy and sleep log variables would be fair in this study (AUC≥0.70). SHAP values correspond to the marginal importance of each feature, in context with all other features, in establishing each model prediction [39]. Information gain values reflect the value of each feature in reducing model entropy. Bootstrap tests (using 2000 iterations) were used to compare the AUC values for each model.

Discriminative performance of model predictions in specifically predicting diagnosed and probable PTSD, rather than other mental health outcomes, was assessed using linear regression. For each outcome, a linear regression was conducted using each PTSD-specific outcome (ie, diagnosed PTSD and probable PTSD) to predict XGBoost model-predicted probability scores per person, while covarying for other non-PTSD mental health diagnoses (1=“present” or 0=“absent”). It was hypothesized that there would be adequate discriminant validity, demonstrated through stronger relationships between PTSD diagnosis and probable PTSD diagnosis based on self-report and mean model-predicted probability scores, compared to the relationship between non-PTSD mental health diagnoses and mean model-predicted probability scores for each model. As past work has demonstrated a significant association between PTSD risk and age among male-identifying veterans [40], age was additionally included as a covariate in each linear regression model. It was hypothesized that there would be adequate discriminant validity, demonstrated through stronger relationships between PTSD diagnosis and probable PTSD diagnosis based on self-report and model-predicted scores, compared to the relationship between non-PTSD mental health diagnoses and model-predicted scores for each model.

Logistic regression was additionally used to further evaluate the relationships between the identified features and each outcome. The identified features were averaged within-person and then entered into a logistic regression model for each outcome. Model fit was evaluated using the Nagelkerke R².


Overview

Correlations between features and outcomes can be found in Table 2. Feature selection results are presented in Table S3 in Multimedia Appendix 3. PTSD diagnosis was strongly correlated with total PCL-5 scores (r=0.50; P=.003). There was a marginally significant association between current PTSD diagnosis and the PCL-5≥31 cutoff (χ²1=3.10; P=.08). There was a significant relationship between current PTSD diagnosis and the PCL-5≥38 cutoff (χ²1=4.38; P=.04). Logistic regression results are summarized in Table S5 in Multimedia Appendix 5.

Diagnosed PTSD

Three features were selected for the PTSD diagnosis model: restfulness rating, WASO, and acrophase time (Table S3 in Multimedia Appendix 3). The model performance metrics are presented in Table 3.

The diagnostic model achieved a satisfactory AUC of 0.83 (95% CI 0.61‐1.00), an accuracy of 88%, a sensitivity of 63% (5/8), a specificity of 96% (24/25), a precision of 83%, and an F1-score of 0.71. As performance metrics were calculated per person, the precision of these metrics (ie, demonstrated by broad CIs) was impacted by the smaller sample size. Restfulness rating was the most influential predictor (mean SHAP=0.90; information gain=0.39), followed by WASO (SHAP=0.46; information gain=0.32) and acrophase time (SHAP=0.49; information gain=0.30).

Table 3. Extreme gradient boosting (XGBoost) model results.
Metric or featurePTSD Diagnosis ModelPCL-5a≥31 ModelPCL-5≥38 Model
AUCb (95% CI)0.83 (0.61‐1.00)0.84 (0.71‐0.98)0.47 (0.24‐0.69)
Accuracy0.880.790.61
Sensitivityc0.630.730.43
Specificityd0.960.820.65
F1-scoree0.710.700.32
Precisionf0.830.670.25
Mean SHAPgh
 Feature 1Rest Rating=0.90Efficiency=1.38Rest Rating=0.42
 Feature 2Acrophase=0.49Fragmentation=1.35Fragmentation=0.26
 Feature 3WASOi=0.46Rest Rating=0.92IVj=0.20
Mean information gaink
 Feature 1Rest Rating=0.39Efficiency=0.48Rest Rating=0.60
 Feature 2WASO=0.32Fragmentation=0.38IV=0.20
 Feature 3Acrophase=0.30Rest Rating=0.14Fragmentation=0.20

aPCL-5: Posttraumatic Stress Disorder Checklist for Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition.

bAUC: area under the curve.

cSensitivity is the proportion of true positives correctly identified.

dSpecificity is the proportion of true negatives correctly identified.

eF1-score is the harmonic mean of precision and recall.

fPrecision is the proportion of predicted positives that are true positives.

gSHAP: Shapley Additive Explanations.

hMean SHAP quantifies the average marginal contribution of each feature to individual predictions. Mean SHAP for each feature is organized in rank from largest to smallest.

iWASO: wake after sleep onset.

jIV: intradaily variability.

kMean information gain reflects how much a feature reduces model entropy when used to split the data during training. Mean information gain for each feature is organized in rank from largest to smallest.

Linear regression analysis revealed a significant positive association between model-predicted scores and clinician-rated PTSD diagnosis (B=0.19, P.002), such that individuals with PTSD had predicted probability scores 0.19 points higher on average than those without (Table 4).

However, the model explained only 21% of the variance in predicted scores (adjusted =0.21). Age and current non-PTSD mental health diagnosis were not significant predictors (P values>.05; Table 4).

Table 4. Relationship between model-predicted score and posttraumatic stress disorder (PTSD) outcomes.
VariableaBbSEtP valueR²c
PTSD diagnosis Model0.21d
Intercept0.420.192.24.03d
PTSD diagnosis0.190.063.40.002e
Non-PTSD mental health diagnosis−0.050.050.91.37
Age (years)<−0.01<0.010.35.73
Probable PTSD (PCL-5f≥31) Model0.21d
Intercept0.430.281.55.13
Probable PTSD (PCL-5≥31)0.240.073.30.003e
Non-PTSD mental health diagnosis−0.030.070.44.66
Age (years)<−0.01<0.01−0.47.64
Probable PTSD (PCL-5≥38) Model<0.01
Intercept0.360.142.53.02d
Probable PTSD (PCL-5≥38)<0.010.040.14.89
Non-PTSD mental health diagnosis0.020.040.46.65
Age (years)<0.01<0.010.67.51

aLinear regression was used to assess the relationship between mean predicted scores from XGBoost models and each outcome (ie, PTSD diagnosis, probable PTSD diagnosis [PCL-5≥31], and probable PTSD diagnosis [PCL-5≥38]), while accounting for age and current non-PTSD mental health diagnosis.

bB values reflect the change in average XGBoost predicted score.

cR2 values presented are adjusted.

dP<.05.

eP<.01.

fPCL-5: Posttraumatic Stress Disorder Checklist for Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition.

Probable PTSD Diagnosis (PCL-5≥31)

Three features were selected for the PCL-5≥31 model: efficiency, fragmentation, and restfulness rating (Table S3 in Multimedia Appendix 3). Model performance was satisfactory (Table 3), with an AUC of 0.84 (95% CI 0.71‐0.98), accuracy of 79%, sensitivity of 73% (8/11), specificity of 82% (18/22), precision of 67%, and an F1-score of 0.70. A bootstrap test indicated that the AUC of the PCL-5≥31 model was not significantly different from that achieved by the PTSD diagnosis model (P=.89). Efficiency was the strongest predictor (SHAP=1.38; information gain=0.48), followed by fragmentation (SHAP=1.35; information gain=0.38) and restfulness rating (SHAP=0.92; information gain=0.14).

Linear regression results indicated a significant positive relationship between predicted probability scores and probable PTSD status (B=0.24; P.003), with individuals meeting the PCL-5≥31 threshold showing higher predicted scores, around .24 points on average, than those who did not meet this threshold (Table 4). The model accounted for approximately 21% of the variance in predicted scores (adjusted =0.21). Similar to the diagnosis model, age and non-PTSD diagnoses were not significant predictors (P values>.05).

Probable PTSD Diagnosis (PCL-5≥38)

Three features were selected for the PCL-5≥38 model: restfulness rating, intradaily variability, and fragmentation. As shown in Table 3, this model exhibited the poorest performance, with an AUC of 0.47 (95% CI 0.24‐0.69), accuracy of 61%, sensitivity of 43% (3/7), specificity of 65% (17/26), precision of 25%, and an F1-score of 0.32. Bootstrap tests demonstrated that the AUC of the PCL-5≥38 model was significantly lower than the AUC of the PCL-5≥31 model (P=.004) and the PTSD diagnosis model (P=.02). Restfulness rating remained the top predictor (SHAP=0.42; information gain=0.60), followed by intradaily variability (SHAP=0.20; information gain=0.20) and fragmentation (SHAP=0.26; information gain=0.20).

Linear regression revealed no significant association between predicted probability scores and probable PTSD status at the PCL-5≥38 threshold (B<0.01; P=.89), with the model explaining a negligible proportion of variance in predicted scores (<0.01; Table 4). Age and non-PTSD mental health diagnoses were also not significant predictors (P values>.05).


This is the first study to explore whether rest, activity, and rest-activity rhythm data, derived from actigraphy and sleep logs, could be used to predict diagnosed PTSD and probable PTSD based on commonly used cutoffs on the PCL-5. The PTSD diagnosis model and the PCL-5≥31 model showed satisfactory performance across model metrics, including accuracy, sensitivity, specificity, and F1-scores. Bootstrapped tests revealed no significant differences in AUC values across the diagnosis and PCL-5≥31 models. However, performance estimates should be interpreted with caution, due to the nature of the study and the impact of the smaller sample size on estimate precision. Moreover, predicted scores from both models showed a significant positive association with PTSD diagnosis, while accounting for non-PTSD mental health diagnoses. This indicates that average model-predicted probability scores were significantly associated with PTSD diagnosis or probable diagnosis and were not merely predictive of the presence of mental health problems more broadly. However, adjusted R2 values reflect a large portion of unexplained variance in model-predicted probability scores. The findings align with results from a recent meta-analysis examining the performance of tree-based machine learning models in predicting PTSD diagnosis in US military samples (pooled AUC: 0.75; range=0.57-0.92) [41]. Notably, results from these past studies were based on the use of different markers to predict PTSD, namely self-report assessments [42], speech [43], and heart rate data [44]. Prior studies using passively sensed rest-activity and heart rate data to predict probable PTSD (PCL-5≥31) in a national sample of trauma-exposed adults have shown relatively lower model performance (AUCs of 0.64 and 0.56) [21,24]. Greater predictive performance has been achieved in past work using both wrist actigraphy and self-report mental health surveys to predict probable PTSD (AUC=0.73) [21]. In line with this, this exploratory study leveraged both actigraphy data and sleep log data, potentially contributing to improved predictive performance. Notably, the integration of sleep logs may provide a more efficient way to capture important information for predicting PTSD compared to longer self-report batteries. It may be that a combination of both self-report data and wrist actigraphy data can provide greater predictive performance in identifying PTSD through clinical diagnosis and probable PTSD through self-report.

The features that were used as predictors varied across the models, with the top 3 features retained in each to ensure model stability and mitigate overfitting. In the PTSD diagnosis model, subjective restfulness was the most influential predictor. WASO and acrophase time also contributed, suggesting that both sleep quality and circadian rhythm markers are relevant to predicting PTSD diagnosis. In the PCL-5≥31 model, sleep efficiency was the strongest predictor, followed closely by fragmentation, and subjective restfulness. In the PTSD diagnosis model and the PCL-5≥31 model, measures of both subjective and objective features contributed to the overall model prediction. Additionally, among the objective markers, features associated with sleep quality (WASO, sleep efficiency, and fragmentation) were consistently identified as top features. This is consistent with past research, which has demonstrated relationships between PTSD symptoms and the features retained across models in this study, including relationships between PTSD symptoms and markers of sleep fragmentation [18], WASO [19], intradaily variability [18], and acrophase time [45]. However, in this study, feature selection was performed using the entire sample to ensure stability in features. As a result, model performance is likely overly optimistic.

The results of this preliminary study suggest that combining passively sensed sleep data and sleep log data may enhance PTSD symptom assessment and monitoring. In line with this, past research has demonstrated different relationships between PTSD symptoms and objective and subjective measures of rest-activity rhythms [13,14,16,18,19]. Findings from this study may support the idea that subjective and objective rest-activity markers capture distinct phenomena and differ in their use for predicting PTSD symptoms. Furthermore, the results may suggest that both subjective and objective data can offer greater benefits than relying on either in isolation. For example, combining subjective and objective data may help identify markers of paradoxical insomnia, which is often observed in individuals with clinically significant PTSD symptoms [46]. However, given the sample size and scope of this study, further research is needed to validate these findings beyond this small study.

The model predicting probable PTSD at the more conservative cutoff (PCL-5≥38) demonstrated relatively poor performance in this sample. Bootstrap test results reflected that the AUC value for the PCL-5≥38 model was significantly lower than the AUC value achieved by the diagnosis and the PCL-5≥31 models. Importantly, fewer individuals met the more conservative PCL-5 cutoff in this study (n=7; approximately 35 days of data), resulting in potentially greater difficulty in distinguishing individuals based on this outcome, compared to the more liberal cutoff (n=11). Interestingly, this model shared overlapping features with the other 2 models, including restfulness and fragmentation. However, SHAP values were much lower for these features, indicating that they were less meaningful in the overall prediction of a more conservative PCL-5 cutoff. Unlike the other 2 models, the PCL-5≥38 model also included intradaily variability, which captures variability in rest-activity rhythms (ie, how often transitions occur between rest and activity within a day). Potentially, those reporting more severe symptoms, but not meeting criteria for a clinical diagnosis, may have more erratic rest-activity patterns that make predictions from rest-activity metrics more difficult. Future work should consider using longitudinal data to capture the relationship between symptom trajectories and rest-activity rhythm data over time rather than relying on predicting symptoms at a single time point. In particular, research may benefit from more dynamic prediction methods (eg, deep learning using rest-activity data) to effectively predict PTSD symptoms. By capturing these data, it may become evident that the predictive use of different markers varies for individuals across time. For example, the predictive use of subjective restfulness may be greater for individuals who experience more persistent circadian rhythm disruption across time.

There are several important limitations to consider when interpreting these findings. First, the sample size was relatively small and consisted exclusively of male-identifying service members and veterans who had experienced trauma in the context of military service. In contrast, prior studies have developed predictive models using larger and more heterogeneous samples [21]. Only 2 cutoffs (PCL-5≥31 and ≥38) were used in this study; however, a range of potential cutoffs has been identified [29]. This limits the understanding of the predictive performance of these models across different potential outcomes, which may also vary based on context, population, and goal. Notably, rest-activity disturbances are evaluated through both self-report and clinician-administered assessments for PTSD. As this study used the subjective and objective rest-activity data to predict PTSD-specific outcomes, there may be an elevated risk of circularity in the relationships between predictors and each outcome.

This study used a 2-stage modeling approach in which feature selection was conducted on aggregated person-level data across the full sample prior to model training and testing using a leave-one-subject-out cross-validation procedure on day-level data. This approach was chosen to ensure the stability of feature selection in light of the small sample size. However, this results in data leakage, resulting in upwardly-biased model performance estimates. While univariate feature selection was used to reduce the risk of overfitting, these results provide less understanding of multivariate interactions. Future research with larger and more diverse samples is necessary to validate these findings and further explore complex interactions between features. Model performance may be biased due to the sample size and feature selection methods. In addition, validating model performance in independent clinical samples is specifically needed. Including other objective rest-activity metrics (eg, mean activity during the 5 least active hours of the day and mean activity during the 10 most active hours of the day) may enhance model performance and provide a more comprehensive understanding of the relationship between rest-activity patterns and PTSD symptoms. Furthermore, future research may benefit from using individualized modeling approaches to assess the use of intraindividual prediction and the relative importance of features on an individual basis. This may offer more clinically relevant insights into how specific patterns of circadian rhythm disruption relate to PTSD symptoms at the individual level, which in turn can create individualized forms of assessment, monitoring, and intervention. For example, this may support the implementation of tailored chronotherapy (eg, just-in-time interventions focused on regulating one’s circadian rhythm and time-based reminders for certain medications based on individual rest-activity rhythms) to improve clinical outcomes for individuals with PTSD symptoms. Moreover, actigraphy data used in this study captured only a single week of time. Some evidence suggests that differing periods of monitoring may significantly improve the reliability and predictive power of rest-activity models for other mental health concerns (eg, depression) [22]. PTSD symptoms are dynamic across time, and thus, greater periods of time may be needed to capture symptom trends [27,28]. As such, future work would benefit from investigating the optimal window of time needed to capture rest-activity patterns most predictive of PTSD symptoms. This is particularly important for developing scalable and clinically meaningful forms of assessment, monitoring, and intervention that integrate objective and subjective rest-activity data.

Ultimately, the machine learning classifier used in this study demonstrated satisfactory performance in predicting both clinician-diagnosed PTSD and probable PTSD using the more liberal self-report threshold (PCL-5≥31) in this small sample, leveraging a combination of subjective and objective rest-activity data. In contrast, the model predicting probable PTSD based on the more conservative cutoff (PCL-5≥38) demonstrated poor performance. These results highlight differences in the rest-activity markers most relevant to clinically-diagnosed PTSD versus probable PTSD based on self-report. Future research should aim to examine the integration of wearable technology and real-time naturalistic assessment to evaluate dynamic methods of PTSD assessment involving rest-activity rhythm data. Ultimately, integrating both subjective and objective measures of rest-activity may offer a more comprehensive and accurate approach to predicting PTSD.

Acknowledgments

Generative artificial intelligence was not used in the preparation of this manuscript.

Funding

This study was supported by a Young Investigator Grant from the Brain & Behavior Research Foundation.

Data Availability

Data are available upon reasonable request to the corresponding author.

Conflicts of Interest

HJB is a consultant for Natrol, LLC. MP is employed by and retains equity in Reunion Neuroscience. MP also retains equity in Argus Cognitive. DAG is the founder and Chief Scientific and Strategy Advisor at Salimetrics LLC and SalivaBio LLC; these relationships are managed by the policies on conflict of interest at the Johns Hopkins University School of Medicine and the University of California at Irvine.

Multimedia Appendix 1

Wrist actigraphy features.

DOCX File, 8 KB

Multimedia Appendix 2

Correlations between rest-activity features.

DOCX File, 11 KB

Multimedia Appendix 3

F1-scores of aggregated features with each outcome.

DOCX File, 15 KB

Multimedia Appendix 4

Extreme gradient boosting (XGBoost) model hyperparameters.

DOCX File, 14 KB

Multimedia Appendix 5

Logistic regression results.

DOCX File, 4210 KB

  1. Agorastos A, Olff M. Traumatic stress and the circadian system: neurobiology, timing and treatment of posttraumatic chronodisruption. Eur J Psychotraumatol. Nov 27, 2020;11(1):1833644. [CrossRef] [Medline]
  2. Walker WH, Walton JC, DeVries AC, Nelson RJ. Circadian rhythm disruption and mental health. Transl Psychiatry. Jan 23, 2020;10(1):28. [CrossRef] [Medline]
  3. Melo MCA, Abreu RLC, Linhares Neto VB, de Bruin PFC, de Bruin VMS. Chronotype and circadian rhythm in bipolar disorder: a systematic review. Sleep Med Rev. Aug 2017;34:46-58. [CrossRef] [Medline]
  4. Bromundt V, Köster M, Georgiev-Kill A, et al. Sleep-wake cycles and cognitive functioning in schizophrenia. Br J Psychiatry. Apr 2011;198(4):269-276. [CrossRef] [Medline]
  5. Mascaro L, Phillips AJK, Clark JW, Straus LD, Drummond SPA. Diurnal rhythm robustness in individuals with PTSD and insomnia and the association with sleep. J Biol Rhythms. Apr 2021;36(2):185-195. [CrossRef] [Medline]
  6. Sandahl H, Baandrup L, Vindbjerg E, Jennum P, Carlsson J. Social zeitgebers and circadian dysrhythmia are associated with severity of symptoms of PTSD and depression in trauma-affected refugees. Eur Arch Psychiatry Clin Neurosci. Oct 2021;271(7):1319-1329. [CrossRef] [Medline]
  7. Agorastos A, Kellner M, Baker DG, Otte C. When time stands still: an integrative review on the role of chronodisruption in posttraumatic stress disorder. Curr Opin Psychiatry. Sep 2014;27(5):385-392. [CrossRef] [Medline]
  8. Agorastos A, Pervanidou P, Chrousos GP, Baker DG. Developmental trajectories of early life stress and trauma: a narrative review on neurobiological aspects beyond stress system dysregulation. Front Psychiatry. 2019;10:118. [CrossRef] [Medline]
  9. Germain A. Sleep disturbances as the hallmark of PTSD: Where are we now? Am J Psychiatry. Apr 2013;170(4):372-382. [CrossRef] [Medline]
  10. Moon E, Yang M, Seon Q, Linnaranta O. Relevance of objective measures in psychiatric disorders-rest-activity rhythm and psychophysiological measures. Curr Psychiatry Rep. Oct 29, 2021;23(12):85. [CrossRef] [Medline]
  11. Lam L, Ho FYY, Wong VWH, et al. Actigraphic sleep monitoring in patients with post-traumatic stress disorder (PTSD): a meta-analysis. J Affect Disord. Jan 1, 2023;320:450-460. [CrossRef] [Medline]
  12. Kahawage P, Jumabhoy R, Hamill K, de Zambotti M, Drummond SPA. Validity, potential clinical utility, and comparison of consumer and research-grade activity trackers in insomnia disorder I: in-lab validation against polysomnography. J Sleep Res. Feb 2020;29(1):e12931. [CrossRef] [Medline]
  13. Westermeyer J, Sutherland RJ, Freerks M, et al. Reliability of sleep log data versus actigraphy in veterans with sleep disturbance and PTSD. J Anxiety Disord. 2007;21(7):966-975. [CrossRef] [Medline]
  14. Zalta AK, Vanderboll K, Dent AL, et al. Sleep timing, chronotype, and posttraumatic stress disorder: an individual participant data meta-analysis. Psychiatry Res. Mar 2023;321:115061. [CrossRef] [Medline]
  15. Klein E, Koren D, Arnon I, Lavie P. Sleep complaints are not corroborated by objective sleep measures in post-traumatic stress disorder: a 1-year prospective study in survivors of motor vehicle crashes. J Sleep Res. Mar 2003;12(1):35-41. [CrossRef] [Medline]
  16. Slightam C, Petrowski K, Jamison AL, et al. Assessing sleep quality using self-report and actigraphy in PTSD. J Sleep Res. Jun 2018;27(3):e12632. [CrossRef] [Medline]
  17. Lewis C, Lewis K, Kitchiner N, Isaac S, Jones I, Bisson JI. Sleep disturbance in post-traumatic stress disorder (PTSD): a systematic review and meta-analysis of actigraphy studies. Eur J Psychotraumatol. Jul 9, 2020;11(1):1767349. [CrossRef] [Medline]
  18. Tsanas A, Woodward E, Ehlers A. Objective characterization of activity, sleep, and circadian rhythm patterns using a wrist-worn actigraphy sensor: insights into posttraumatic stress disorder. JMIR mHealth uHealth. Apr 20, 2020;8(4):e14306. [CrossRef] [Medline]
  19. Zhu N, Sarawgi A, Bühner M, et al. The relation between passively collected data and PTSD: a systematic review and meta-analysis. NPJ Digit Med. Jul 5, 2025;8(1):413. [CrossRef] [Medline]
  20. Sadeh A, Acebo C. The role of actigraphy in sleep medicine. Sleep Med Rev. Apr 2002;6(2):113-124. [CrossRef] [Medline]
  21. Cakmak AS, Alday EAP, Da Poian G, et al. Classification and prediction of post-trauma outcomes related to PTSD using circadian rhythm changes measured via wrist-worn research watch in a large longitudinal cohort. IEEE J Biomed Health Inform. Aug 2021;25(8):2866-2876. [CrossRef] [Medline]
  22. Choi JG, Ko I, Han S. Depression level classification using machine learning classifiers based on actigraphy data. IEEE Access. 2021;9:116622-116646. [CrossRef]
  23. Logacjov A, Skarpsno ES, Kongsvold A, Bach K, Mork PJ. A machine learning model for predicting sleep and wakefulness based on accelerometry, skin temperature and contextual information. Nat Sci Sleep. 2024;16:699-710. [CrossRef] [Medline]
  24. Cakmak AS, Thigpen N, Honke G, et al. Using convolutional variational autoencoders to predict post-trauma health outcomes from actigraphy data. arXiv. Preprint posted online on Nov 14, 2020. [CrossRef]
  25. Goldstein RB, Smith SM, Chou SP, et al. The epidemiology of DSM-5 posttraumatic stress disorder in the United States: results from the national epidemiologic survey on alcohol and related conditions-III. Soc Psychiatry Psychiatr Epidemiol. Aug 2016;51(8):1137-1148. [CrossRef] [Medline]
  26. Resick PA, Straud CL, Wachen JS, et al. A comparison of the CAPS-5 and PCL-5 to assess PTSD in military and veteran treatment-seeking samples. Eur J Psychotraumatol. 2023;14(2):2222608. [CrossRef] [Medline]
  27. Biggs QM, Ursano RJ, Wang J, Wynn GH, Amin R, Fullerton CS. Daily variation in sleep characteristics in individuals with and without post traumatic stress disorder. BMC Psychiatry. Jun 5, 2021;21(1):292. [CrossRef] [Medline]
  28. Schuler K, Ruggero CJ, Mahaffey B, et al. When hindsight is not 20/20: ecological momentary assessment of PTSD symptoms versus retrospective report. Assessment. Jan 2021;28(1):238-247. [CrossRef] [Medline]
  29. Forkus SR, Raudales AM, Rafiuddin HS, Weiss NH, Messman BA, Contractor AA. The Posttraumatic Stress Disorder (PTSD) Checklist for DSM-5: a systematic review of existing psychometric evidence. Clin Psychol (New York). Mar 2023;30(1):110-121. [CrossRef] [Medline]
  30. First MB, Williams JBW, Karg RS, Spitzer RL. Structured Clinical Interview for DSM-5—Research Version (SCID-5-RV). American Psychiatric Association; 2015. ISBN: 9781585624614
  31. Weathers FW, Litz BT, Keane TM, Palmieri PA, Marx BP, Schnurr PP. PTSD checklist for DSM-5 (PCL-5). PTSD: National Center for PTSD. URL: https://www.ptsd.va.gov/professional/assessment/adult-sr/ptsd-checklist.asp [Accessed 2026-04-02]
  32. The R Project for Statistical Computing. 2021. URL: https://www.R-project.org/ [Accessed 2025-10-10]
  33. Moškon M. CosinorPy: A python package for cosinor-based rhythmometry. BMC Bioinformatics. Oct 29, 2020;21(1):485. [CrossRef] [Medline]
  34. Python 3.9.0. Python. 2020. URL: https://www.python.org/downloads/release/python-390/ [Accessed 2025-10-10]
  35. Lahti TA, Leppämäki S, Lönnqvist J, Partonen T. Transitions into and out of daylight saving time compromise sleep and the rest-activity cycles. BMC Physiol. Feb 2008;8(1). [CrossRef]
  36. Chen T, Guestrin C. XGBoost: a scalable tree boosting system. Presented at: KDD ’16: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining; Aug 13-17, 2016. [CrossRef]
  37. Jacobson NC, Chung YJ. Passive sensing of prediction of moment-to-moment depressed mood among undergraduates with clinical levels of depression sample using smartphones. Sensors (Basel). Jun 24, 2020;20(12):3572. [CrossRef] [Medline]
  38. Perski O, Kale D, Leppin C, et al. Supervised machine learning to predict smoking lapses from ecological momentary assessments and sensor data: Implications for just-in-time adaptive intervention development. PLOS Digit Health. Aug 2024;3(8):e0000594. [CrossRef] [Medline]
  39. Lundberg S, Lee SI. A unified approach to interpreting model predictions. Presented at: NIPS’17: Proceedings of the 31st International Conference on Neural Information Processing Systems; Dec 4-9, 2017. URL: https:/​/proceedings.​neurips.cc/​paper_files/​paper/​2017/​file/​8a20a8621978632d76c43dfd28b67767-Paper.​pdf [Accessed 2026-04-02]
  40. Lehavot K, Katon JG, Chen JA, Fortney JC, Simpson TL. Post-traumatic stress disorder by gender and veteran status. Am J Prev Med. Jan 2018;54(1):e1-e9. [CrossRef] [Medline]
  41. Vali M, Nezhad HM, Kovacs L, Gandomi AH. Machine learning algorithms for predicting PTSD: a systematic review and meta-analysis. BMC Med Inform Decis Mak. Jan 21, 2025;25(1):34. [CrossRef] [Medline]
  42. Papini S, Norman SB, Campbell-Sills L, et al. Development and validation of a machine learning prediction model of posttraumatic stress disorder after military deployment. JAMA Netw Open. Jun 1, 2023;6(6):e2321273. [CrossRef] [Medline]
  43. Marmar CR, Brown AD, Qian M, et al. Speech-based markers for posttraumatic stress disorder in US veterans. Depress Anxiety. Jul 2019;36(7):607-616. [CrossRef] [Medline]
  44. McDonald AD, Sasangohar F, Jatav A, Rao AH. Continuous monitoring and detection of post-traumatic stress disorder (PTSD) triggers among veterans: a supervised machine learning approach. IISE Trans Healthc Syst Eng. Jul 3, 2019;9(3):201-211. [CrossRef]
  45. Magal N, Netzer O, Eldar E, et al. Circadian instability predicts PTSD symptom severity following mass trauma. medRxiv. Preprint posted online on Mar 20, 2025. [CrossRef]
  46. Rezaie L, Fobian AD, McCall WV, Khazaie H. Paradoxical insomnia and subjective-objective sleep discrepancy: a review. Sleep Med Rev. Aug 2018;40:196-202. [CrossRef] [Medline]


AUC: area under the curve
DSM-5: Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition
PCL-5: Posttraumatic Stress Disorder Checklist for Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition
PTSD: posttraumatic stress disorder
SCID-5-RV: Structured Clinical Interview for DSM-5 Disorders–Research Version
SHAP: Shapley Additive Explanations
WASO: wake after sleep onset
XGBoost: extreme gradient boosting


Edited by Amy Schwartz, Matthew Balcarras; submitted 16.Oct.2025; peer-reviewed by Juan M Santos-Gago, Ningzhe Zhu; final revised version received 04.Mar.2026; accepted 05.Mar.2026; published 19.May.2026.

Copyright

© Katherine Wislocki, Ghazal Naderi, Jessica L Borelli, Mark Pollack, Douglas A Granger, David P Cenkner, Margaret Canady, Helen J Burgess, Alyson K Zalta. Originally published in JMIR Formative Research (https://formative.jmir.org), 19.May.2026.

This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Formative Research, is properly cited. The complete bibliographic information, a link to the original publication on https://formative.jmir.org, as well as this copyright and license information must be included.