Skip to main content

Twenty-four-hour physical activity patterns associated with depressive symptoms: a cross-sectional study using big data-machine learning approach

Abstract

Background

Depression is a global burden with profound personal and economic consequences. Previous studies have reported that the amount of physical activity is associated with depression. However, the relationship between the temporal patterns of physical activity and depressive symptoms is poorly understood. In this exploratory study, we hypothesize that a particular temporal pattern of daily physical activity could be associated with depressive symptoms and might be a better marker than the total amount of physical activity.

Methods

To address the hypothesis, we investigated the association between depressive symptoms and daily dominant activity behaviors based on 24-h temporal patterns of physical activity. We conducted a cross-sectional study on NHANES 2011–2012 data collected from the noninstitutionalized civilian resident population of the United States. The number of participants that had the whole set of physical activity data collected by the accelerometer is 6613. Among 6613 participants, 4242 participants had complete demography and Patient Health Questionnaire-9 (PHQ-9) questionnaire, a tool to quantify depressive symptoms. The association between activity-count behaviors and depressive symptoms was analyzed using multivariable logistic regression to adjust for confounding factors in sequential models.

Results

We identified four physical activity-count behaviors based on five physical activity-counting patterns classified by unsupervised machine learning. Regarding PHQ-9 scores, we found that evening dominant behavior was positively associated with depressive symptoms compared to morning dominant behavior as the control group.

Conclusions

Our results might contribute to monitoring and identifying individuals with latent depressive symptoms, emphasizing the importance of nuanced activity patterns and their probability of assessing depressive symptoms effectively.

Peer Review reports

Background

Depression is a mood disorder that causes persistent sadness and loss of interest in enjoyable activities [1,2,3,4]. Depression is a prevalent mental health condition affecting about 280 million people worldwide and responsible for more than 47 million disability-adjusted life years in 2019 [5, 6]. It is also a leading cause of disability globally and is associated with premature mortality from other illnesses [7] and suicide [8]. Depression also imposes substantial economic burdens. In 2010, it was estimated that approximately $US210.5 billion was spent annually in the US alone on lost work productivity and medical treatment associated with depression [9,10,11]. The manifestation of depressive symptoms is an important sign of the onset of depression [2, 4]. Therefore, understanding the prevalence of depressive symptoms within the population is vital for effective public health planning [12].

Physical activity is a factor associated with depression [13]. Recent observational studies found that every additional hour of light-intensity physical activity decreased the chances of being depressed [14, 15]. Similarly, higher physical activity levels, especially among women, are associated with a reduced likelihood of having depression [16]. Consistent with these observations, studies involving interventions have demonstrated that all modes of physical activity effectively reduce depressive symptoms, although high-intensity physical activity shows more significant improvements in reducing mild to moderate symptoms of depression, anxiety, and psychological distress in diverse adult populations [17].

While recent investigations challenge the notion that a higher volume or intensity of physical activity is beneficial for depressive symptoms [18, 19], some studies have indicated a potential association between the timing of physical activity and depressive symptoms [20, 21], possibly due to the shift in the circadian rhythm experienced by individuals with depression [22]. The shift in the circadian rhythm could result in changes in physical, mental, and behavioral patterns throughout the 24-h cycle, leading to a change in preferred times for engaging in activities among depressive populations [23, 24]. Therefore, the physical activity pattern in terms of timing could be a helpful and vital parameter in understanding the association between physical activity and depressive symptoms.

Several studies have explored the relationship between physical activity patterns, specifically in timing, and the association with depressive symptoms [20, 25,26,27]. By contrast, a previous study has reported that individuals with depression and anxiety exhibited lower overall activity levels but did not show a specific association between the timing of activity and the presence or severity of depressive symptoms [28]. No differences in the timing of physical activity between depressed and non-depressed individuals were found, even when participants receiving antidepressant treatment were excluded from the analysis [29]. Considering these reports, the association between physical activity patterns and depressive symptoms remains inconsistent.

The inconsistency of the association between depressive symptoms and physical activity patterns, including timing or phase, might be due to the difficulty of establishing a standardized procedure to categorize physical activity patterns. Previous studies can be attributed to the limitations in capturing the behavior of physical activity based on the temporal shape of the activity for each day [20, 26,27,28,29]. By aggregating data from multiple days, the unique temporal shape of each day's activity is diluted, potentially masking the associations with depressive symptoms [30]. These temporal shapes could be critical because depressive individuals have more fragmented physical activity throughout the day because of distorted circadian rhythm [30, 31]. Thus, the temporal shape of physical activity could be a valuable marker for detecting depressive symptoms [32].

Recently, machine learning (ML) has gained increasing prominence in the field of epidemiology due to its potential for effective disease prediction and patient care [32,33,34,35]. ML is also applied to detecting physical activities in daily behaviors such as sleeping, sitting, walking, ascending stairs, descending stairs, and running from big data obtained from wearable devices or smartphones [36]. Raw physical activity data is often complex and multidimensional. Therefore, ML analyses are beneficial even when relationships or interactions between variables are nonlinear [37,38,39]. In addition, another advantage of the ML approach is to enable us to measure accurate physical activity over a long period, possibly leading to figuring out physical activity patterns, which could be challenging to handle and unrecognized by manual assessments [32,33,34,35].

In this study, we investigated the relationship between activity behaviors, defined as the relative frequency of temporal physical activity patterns, and depressive symptoms using big data of the accelerometer from the National Health and Nutrition Examination Survey (NHANES) 2011–2012 [40] by time-series analysis with ML [41]. Here, we identified five 24-h activity-counting patterns and four physical activity-count behaviors. The 24-h activity-counting patterns were classified by temporal physical activity patterns for 24 h, reflecting a distinguishable activity pattern throughout the day. On the other hand, the physical activity-count behaviors were determined by the relative frequencies of the physical activity-counting patterns, and they would indicate a major physical activity pattern that each individual engages in throughout the week. Individuals with the Evening dominant behavior had a higher prevalence of depressive symptoms than those with the Morning dominant behavior (as a reference), even though they showed similar total activity count. These findings could highlight the importance of distribution in activity behaviors for understanding the relationship between physical activity and depressive symptoms. Recognizing the patterns and variability of physical activity might contribute to finding a behavioral context for manifesting depressive symptoms.

Methods

In our cross-sectional study, we used data from NHANES 2011–2012, focusing on the noninstitutionalized civilian resident population of the United States. The study involved 6,917 participants who were given accelerometers to measure physical activity. Of these, 6,613 participants provided a complete 7-day data set of physical activity, resulting in 46,291 days (7 days for each participant: 7 × 6,613 = 46,291). We employed unsupervised machine learning by Kernel K-means to cluster the twenty-four-hour activity counting pattern, and later, based on the dominant activity counting pattern, we clustered the activity counting behavior by k-means clustering.

In this study, the physical activity-count behaviors were determined by the relative frequency of the physical activity-counting patterns, which are classified as temporal shapes based on physical activity-counting within 24 h. The physical activity-count behaviors would provide us with insights into the specific type of physical activity pattern that each individual engages through a week, while the physical activity-counting patterns reflect how an individual's activity is distributed throughout the day, based on rhythms or lifestyle. We classified four distinct activity-count behaviors based on the dominant activity pattern. Individuals with the Evening dominant behavior had a higher prevalence of depressive symptoms than those with the Morning dominant behavior (as a reference), even though they showed similar total activity count.

4,242 of 6,613 participants had complete demographic information and depressive symptoms data using the Patient Health Questionnaire-9 (PHQ-9), a tool used to assess depressive symptoms. Assessment using the PHQ-9 was performed once during data collection. Our subsequent analysis focused on these 4,242 participants. To explore the relationship between activity-count behavior and depressive symptoms, we conducted a multivariable logistic regression analysis on this subset of participants. We used cluster labels obtained from 6,613 participants, including those without a complete set of the demography and the PHQ-9, to generalize the dominant activity counting pattern and to minimize the selection bias depending on the response rate. The detailed participant flow and analysis overview are explained in a flow chart in Supplementary Figure S1.

Activity counting data

National Health and Nutrition Examination Survey (NHANES) 2011–2012 data was used for the analysis [40]. The NHANES 2011–2012 received approval from the National Center for Health Statistics (NCHS) Ethics Review Board (Protocol #2011–2017). Participants were asked to wear a wrist-worn ActiGraph model GT3X + accelerometer for seven consecutive days, 24 h a day, starting on the day of their examination [42]. The wrist-worn ActiGraph model GT3X + accelerometer captured acceleration along three axes (x, y, and z) with 80 Hz sampling intervals as raw data in the original NHANES survey (PAX80_G Doc in [40]). The raw data was used to calculate the count data per minute of the physical activity monitor on the same website (Physical Activity Monitor—Minute in [40]). We used the count data per minute provided on the NHANS website as a starting data set to produce “counting data” per hour. Based on a previous study, the ActiGraph model GT3X + accelerometer has been reported as a reliable tool for capturing information on physical activity in free-living conditions over an extended duration. The study indicates that the accuracy of data obtained is higher when the accelerometer is worn continuously for 7 days, as opposed to a single-day wear scenario [43]. In this study, the device was worn on the non-dominant wrist, and participants were instructed to wear it continuously except when they needed to temporarily remove it. In our study, we exclusively included participants who provided complete data for the entire 7-day duration without resorting to any imputation techniques. This approach ensured that only individuals with a full set of data for each day were considered in our analysis.

Depression data

The NHANES 2011–2012 uses the PHQ-9 questionnaire to measure depression. The Patient Health Questionnaire (PHQ-9) is a commonly used screening tool to assess the presence and severity of depression. It consists of nine questions about the frequency of symptoms of depression experienced over the past 2 weeks. Each question has four response categories: “not at all,” “several days,” “more than half the days,” and “nearly every day.” Each response category is assigned a point ranging from 0 to 3, with 0 indicating that the symptom was not present and 3 indicating that the symptom was present nearly every day. The scores for each item are then summed to give a total score ranging from 0 to 27. The final follow-up question in the PHQ-9 assesses the overall impairment caused by depressive symptoms. This question asks the patient to rate the degree to which their symptoms have caused problems in their daily life, including work, school, or relationships. Overall, the PHQ-9 provides a simple and reliable method for identifying individuals experiencing depression and assessing the severity of their symptoms. It is widely used in clinical practice and research to screen for depression and monitor the response to treatment. PHQ-9 score > or = 10 had a sensitivity of 88% and a specificity of 88% for major depression [44,45,46].

Software and data processing

MATLAB (2021a, MathWorks, MA) and Python (3.11.0) were used for data processing and visualization.

Twenty-four-hour activity counting pattern and daily activity counting behaviors

In this analysis, the minute-level physical activity data from the NHANES 2011–2012 dataset is utilized. Only participants with a whole day of data for 7 complete days (totaling 6613 participants) are included to ensure data completeness. The data is then converted into a standardized format (mean = 0 and SD = 1) for each 24 h, resulting in a dataset comprising 46,291 days of 24-h standardized data. To cluster the 24-h physical activity patterns, unsupervised ML such as Kernel K-means using the tslearn Python package is employed [47]. The physical activity-count behaviors were obtained as follows: 1) for each participant, the relative frequency of each 24-h physical activity-counting pattern (AD, M, E, BP, and IM) was calculated from the proportion of each pattern for 7 days. 2) To obtain the activity-count behaviors, vectors of the relative frequencies of each pattern (five variables for each participant) were clustered by unsupervised ML using a K-means clustering algorithm with MATLAB. The cluster names were determined according to the major patterns with higher relative frequency contained in each cluster. Figure 1 displays five clusters representing different 24-h physical activity patterns: all day (AD), morning (M), evening (E), bi-phasic (BP), and irregular morning (IM). Figure 2 illustrates four clusters of daily activity-count behaviors, categorized as AD dominant, M dominant, AD + M dominant, and E dominant.

Fig. 1
figure 1

Five 24-h physical activity patterns. A Individual plots of standardized activity count for each pattern. B Plots of mean values with standard deviation (SD) for each pattern. AD, All-day; M, Morning; E, Evening; BP, Bi-phasic; IM, Irregular morning

Fig. 2
figure 2

Four daily behavioral patterns. A Individual spider plots of the relative frequency of 24-h physical activity patterns. B Spider plots of mean values for each pattern. AD (All day) dominant; M (Morning) dominant; AD + M (All day + Morning) dominant; E (Evening) dominant

Depression measurement

The study used the PHQ-9 questionnaire to measure depression. Out of the total 6613 participants, 4,242 participants provided complete answers to all nine questions, and they have the complete dataset for Age, Gender, BMI, and Work situation. The questionnaire provides a total score ranging from 0 to 27, with a threshold score of 10 being used to determine depression [48]. Participants who scored less than 10 were labeled as not having depression, while those who scored 10 or more were labeled as having depression. Out of 4242 participants, 390 participants have depressive symptoms, and 3852 participants have no depressive symptoms.

Statistical analysis

The statistical analyses were performed using JMP software (version 16.2.0). For the demographic analysis, we reported descriptive statistics such as means and standard deviations (SD) for continuous variables and percentages for categorical variables stratified by activity-count behavior. To compare differences in continuous variables, we used ANOVA, while chi-square tests were employed for categorical variables across different activity-count behaviors.

To investigate the relationship between activity-count behavior and depressive symptoms, we conducted a multivariable logistic regression analysis. Depressive symptoms were the dependent variable, and activity-count behavior was the independent variable. We calculated odds ratios (ORs) and 95% confidence intervals (CIs). Initially, we examined the crude model (Model 1), which assessed the association between activity-count behavior and depressive symptoms without considering any confounding factors. Subsequently, we introduced one confounding factor at a time, progressing from Model 2 to Model 6.

In order to assess the impact of residual confounding, we conducted a sensitivity analysis. This analysis aimed to determine the influence of age, BMI, gender, work situation, and total activity on the association between activity-count behavior and depressive symptoms. We performed logistic regression analyses similar to the previous models, adding only one individual confounding factor in each model. We used the Wald test to assess the significance of the association in both cases. We also checked the correlations between each individual variable. We considered statistical significance as p < 0.05 for all analyses.

Results

Demographic characteristics of activity count behavior

Twenty-four-hour activity-counting patterns and daily activity-count behaviors were determined using the ML procedure reported previously [41]. Five 24-h activity-counting patterns all day (AD), morning (M), evening (E), bi-phasic (BP), and irregular morning (IM) and 4 clusters of daily activity-count behaviors (AD dominant, M dominant, AD + M dominant, and E dominant) were identified (Figs. 1 and 2 and Supplementary Tables S1-S3). The daily activity-count behaviors were named based on the higher relative frequency of specific 24-h activity-counting patterns after clustering using unsupervised ML (Supplementary Table S3). For instance, AD dominant showed a 0.7900 ± 0.1281 of the relative frequency of AD pattern. M and E showed 0.8341 ± 0.1303 and 0.4916 ± 0.2396, respectively. AD + M dominant contains AD and M in similar proportions (0.4342 ± 0.1340 and 0.4396 ± 0.1168). Participants with a complete set of demography and PHQ-9 data (n = 4242) were extracted from the clustered data (n = 6613) and used in subsequent analyses. Table 1 presents the demographic characteristics of each activity-count behavior. Out of 4242 participants, M dominant behavior held the maximum number of participants (n = 1444, 34.04%), followed by AD + M dominant (n = 1075, 25.34%), AD dominant (n = 1012, 23.86%), and E dominant (n = 711, 16.76%) behaviors. For age, M dominant is the oldest adult (55.82 ± 16.43 years), followed by AD + M dominant (47.47 ± 17.60 years), AD dominant (44.68 ± 18.52 years), and E dominant (34.75 ± 15.76 years) (p < 0.0001). BMI does not show a significant difference among the activity-count behavior (p = 0.1284), and the BMI is highest for the AD + M dominant (29.18 ± 7.03) and lowest for the AD dominant (28.49 ± 7.09). E dominant showed the highest percentage of male participants (55.27%), followed by M dominant (52.22%) and AD + M dominant (49.12%). AD dominant has the lowest percentage of male participants (44.47%). The work situation also differs between activity-count behaviors (p < 0.0001). The total activity is highest in AD dominant (13114.39 ± 3606.71), followed by M dominant (12844.19 ± 4140.83), AD + M dominant (11960.01 ± 4292.37), and E dominant (11544.57 ± 4926.13). E dominant showed the highest prevalence of depressive symptoms (15.75%), followed by AD + M dominant (8.56%), AD dominant (8.40%), and M dominant (6.99%) (p < 0.0001).

Table 1 Demographic characteristics of the clusters

Relationship between activity-count behavior and depressive symptoms

Table 2 represents the odds ratios (ORs) and 95% confidence intervals (CIs) for the depressive symptoms based on activity-count behaviors. The M dominant behavior was chosen as a reference since the cluster had the largest population with the lowest prevalence of depressive symptoms and the most representative activity-counting behavior in this population (Table 1 and Supplementary Table S2). In the crude model (Model 1), the OR for depressive symptoms was 2.48 (95% CI: 1.86–3.30) for the E dominant behavior, 1.24 (95% CI: 0.92–1.67) for the AD + M dominant behavior, and 1.21 (95% CI: 0.90–1.64) for the AD dominant behavior. A significant association between E dominant behavior and depressive symptoms was observed after adjusting for covariates, including age, BMI, gender, work situation, and total activity (Model 2 to 6). No strong correlation was observed among these covariates (Supplementary Figure S2). A significant difference was observed after adjusting for each confounding factor (age, BMI, gender, work situation, and total activity). E dominant behavior remains associated with a higher prevalence of depressive symptoms compared to M dominant behavior (p < 0.0001), as shown in Supplementary Table S4. These results suggest that the association between E dominant behavior and depressive symptoms is robust against these confounding factors.

Table 2 Association between activity count behaviors and the prevalence of depressive symptoms among 4242 participants

Discussion

In this study, we first characterized physical activity-counting patterns by examining the temporal shape of physical activity that reflects how an individual's activity is distributed throughout 24 h. Then, we defined physical activity-count behaviors by identifying the dominant physical activity-counting patterns, showing the repetitive features of the physical activity that individuals engage in throughout the week. This procedure was achieved using unsupervised ML techniques. The current study identified five physical activity-counting patterns and four physical activity-count behaviors. Based on the identified physical activity-count behaviors, we revealed that individuals exhibiting E dominant (Evening dominant) behavior had a higher prevalence of depressive symptoms compared to those with M dominant (Morning dominant) behavior, irrespective of total activity count. This association was confirmed after adjusting for potential confounding factors. We also found that individuals with a higher prevalence of depressive symptoms have higher day-to-day variability.

Recently, studies using ML to analyze a huge data set of physical movements appeared with potential applications in epidemiology, public health, and human behavioral science [32,33,34,35]. The ML is applied to identify or categorize behaviors such as sleeping, lying, sitting, standing, walking, ascending or stairs, running, and other routine activities like biking, household chores, and yoga [36]. However, few studies have directly extracted or categorized temporal physical activity patterns by ML but could not differentiate between the temporal physical activity pattern and the intensity [49] or the total amount of physical activity [50]. Our study focused on categorizing each 24-h activity-count into several clusters directly and identifying the behavioral tendencies of each individual based on the relative frequency of dominant activity patterns. Our approach based on data-driven analysis could be more straightforward, insightful, and beneficial in handling temporal patterns and dominant behaviors of human physical activity directly, although a relatively large complete data set is required.

Previous studies showed various results regarding the association between depressive symptoms and physical activity patterns, such as the timing of activity [26, 27], consistent with our result that a specific temporal behavioral pattern (E-dominant) is associated with depressive symptoms. In these reports, the higher depression severity was linked to increased activity late at night in the calculated 24-h activity pattern [26], and the depressed participants had significantly lower physical activity counts during the daytime compared to healthy controls [27]. In contrast, the other observational and case–control studies have indicated no association between depressive symptoms and activity timing [28, 29]. Different classifications of the activity patterns due to the distinct procedures might result in inconsistent conclusions since these observational and case–control studies did not observe or detect the evening or nighttime activity patterns, possibly leading to no association of physical activity patterns with depressive symptoms.

In our results, the E pattern showed the highest activity between 6 p.m. and 9 p.m., and the E dominant behavior showed a higher prevalence of depressive symptoms. The association between E dominant behavior and the depressive symptom might reflect a disruption of circadian rhythm, which could play a crucial role in regulating mood and sleep–wake cycles since disruptions in the circadian rhythms, such as delayed sleep–wake timing, have been associated with increased risk of depressive symptoms and mood disorders [51]. Alternatively, individuals with depression may reduce daytime activity and compensate for total physical activity by increasing their evening activity when they feel energetic [52, 53]. The latter possibility might explain our result in which E dominant (risk group) and M dominant (control group) behavior showed a similar total count activity.

Many previous studies reported that there is a connection between depressive symptoms and low physical activity levels [20, 27, 28]. These findings suggested that total physical activity would be significant for the association with depressive symptoms. For instance, individuals with depression had lower 24-h physical activity levels compared to controls without depression, and those with acute depression exhibited a marginal significance towards later timing of daily activity peaks, particularly in the evening [20]. Similarly, individuals with depression displayed significantly lower physical activity levels between 7 a.m. and 10 p.m. compared to healthy controls, considering both timing and activity levels [27]. Additionally, the presence and severity of depressive and anxiety disorders were associated with reduced overall daily activity levels, but no significant association was found with the timing of activity [28]. In our study, E-dominant behavior exhibited a total activity count similar to AD + M-dominant behavior (Table 1). However, only E-dominant behavior was found to be associated with depressive symptoms compared to M-dominant behavior (Table 2). On the other hand, the AD dominant behavior has a higher amount of activity count compared to the M dominant behavior (Table 1), but the AD dominant behavior is not associated with depressive symptoms (Table 2). These results suggest that the physical activity behavior based on the dominant physical activity pattern might be a more significant parameter than the total amount of physical activity to explain the association with depressive symptoms in a particular procedure.

We also examined the day-to-day variability of activity patterns within different activity behaviors (Supplementary Table S3) and their relationship with depressive symptoms. We found that AD dominant, M dominant, and AD + M dominant behaviors exhibit the least day-to-day variability, with nearly 80% of their dominant activity patterns corresponding to their respective behaviors. However, the E dominant behavior shows considerably more variability, as only 49% of E patterns align with E dominant behavior, while the rest are distributed among other patterns (Supplementary Table S3). In this analysis, we observed that individuals with higher day-to-day variability (E dominant behavior) had a higher prevalence of depressive symptoms compared to those with less day-to-day variability in their activity behavior (AD dominant, M dominant, and AD + M dominant behavior) (Table 1). These findings indicate that the variability and temporal shape of activity patterns may play a significant role in the relationship between physical activity and depressive symptoms. This underscores the importance of considering these factors when seeking to understand the connection between physical activity and depressive symptoms.

Recent research has yielded valuable insights into the multifaceted influences on depressive symptoms across different life stages and factors, such as age, BMI, gender, and employment. Age was associated with a gradual increase in depressive symptoms, especially among older individuals [54]. Furthermore, a study on the relationship between depressive symptoms and BMI anticipated that overweight or obese individuals would experience more severe depressive symptoms, regardless of their racial background [55]. There are gender differences in depressive disorder, where women are more likely to have a higher prevalence of depression than men [56]. Unemployment also could be associated with the onset of clinical major depression [57]. In our study, however, the male ratio was the highest (male: female = 55.27:44.73 in %) in the E dominant cluster among other clusters (Table 1). On the other hand, the unemployment was similar between AD dominant and E dominant (7.02% and 9.42%, respectively), but the OR for depressive symptoms was not significant in the AD dominant. In addition, the OR for depressive symptoms was significantly higher even after gender and work situation were adjusted in the E-dominant cluster (Table 2). Therefore, gender and unemployment could have a minor contribution to depressive symptoms in our study. Our study uncovered a higher prevalence of depressive symptoms among individuals engaged in E-dominant behavior compared to those with M-dominant behavior, even after adjusting for confounding factors above. Our finding provides insight into the contribution of the physical activity pattern to understanding depressive syndrome, emphasizing the importance of individualized assessment of physical activity behavior to understand depressive symptoms.

The extent to which our machine-learning procedure could be generalized remains in future studies. However, our approach in this study could be applicable to other diseases or disorders in which physical activity could be closely related to the diagnosis. Temporal physical activity patterns might also indicate a meaningful link to cardiovascular disease (CVD) and obesity [49, 50]. Clustering temporal physical activity patterns by machine learning could enable comprehensive description or analysis at a higher resolution.

A major limitation of the current study is that the causal relationship between activity-count behavior and depressive symptoms could not be established because of the cross-sectional design. In addition, self-reported questionnaires were used to assess depressive symptoms. Although PHQ-9 used in this study has high sensitivity and specificity [44,45,46], we did not precisely diagnose the depression. A major limitation of this study is that about one-third (2,371) of the 6,613 participants used for the clustering had to be excluded from the analysis of depressive symptoms due to lacking complete demographic and PHQ-9 questionnaire data, possibly leading to selection bias and less generalizability. Moreover, participants were conscious that their physical activity was being monitored, possibly leading to the Hawthorne effect by which their behavior may change due to awareness [58]. Ubiquitous physical activity monitoring using devices such as smartphones daily would potentially minimize this bias and improve the study's reliability.

However, our study would provide a possible impact of physical activity patterns rather than the total amount of physical activity on the relationship with depressive symptoms. Evaluating the physical activity/behavioral patterns, such as temporal changes or relative frequency, may lead to valuable insights into the association between physical activity and depressive symptoms. Our findings also have a potential benefit to developing software for wearable devices alerting or reminding a risk to prevent the progression of depressive symptoms. These findings might have implications for improving the management and prevention of depressive symptoms in clinical and community settings.

Conclusions

This research illuminates the intricate link between the temporal patterns of physical activity and the prevalence of depressive symptoms. Utilizing unsupervised machine learning for the categorization of 24-h activity patterns, it uncovers distinct daily behavioral patterns related to physical activity. A pivotal discovery of this study is the positive correlation between physical activity patterns predominantly in the evening and depressive symptoms, in contrast to morning-centric activities. This underscores the necessity of considering not only the aggregate amount but also the temporal distribution of physical activity in the context of mental health.

The methodology of this study, which incorporates objective data from accelerometers and sophisticated clustering techniques, brings a detailed perspective to the exploration of physical activity's role in depression. The correlation identified between the evening-dominant pattern and depressive symptoms, irrespective of the total amount of activity, highlights the importance of considering the timing and pattern of physical activity. These findings pave the way for personalized interventions, such as custom-tailored activity recommendations or notifications from wearable devices, aiming to reduce the likelihood of depressive symptoms. In addition, the application of these findings in both clinical and community environments could significantly improve strategies for the prevention and management of depressive symptoms.

Availability of data and materials

The dataset supporting the conclusions of this article is available in the NHANES repository.

https://wwwn.cdc.gov/nchs/nhanes/continuousnhanes/default.aspx?BeginYear=2011

Abbreviations

AD:

All day

M:

Morning

E:

Evening

BP:

Bi-phasic

IM:

Irregular morning

AD + M:

All day + Morning

NHANES:

National Health and Nutrition Examination Survey

PHQ-9:

Patient Health Questionnaire-9

BMI:

Body Mass Index

MATLAB:

Matrix Laboratory

ML:

Machine learning

ANOVA:

Analysis of Variance

SD:

Standard Deviation

OR:

Odd Ratio

CI:

Confidence Interval

References

  1. Chand SP, Arif H. Depression. StatPearls Publishing; 2023.

  2. Psychiatry.org - What Is Depression? https://www.psychiatry.org/patients-families/depression/what-is-depression. Accessed 9 Oct 2023.

  3. What is Depression? | SAMHSA. https://www.samhsa.gov/mental-health/depression. Accessed 9 Oct 2023.

  4. Depressive disorder (depression). https://www.who.int/news-room/fact-sheets/detail/depression. Accessed 9 Oct 2023.

  5. Abbafati C, Abbas KM, Abbasi-Kangevari M, et al. Global burden of 369 diseases and injuries in 204 countries and territories, 1990–2019: a systematic analysis for the Global Burden of Disease Study 2019. The Lancet. 2020;396:1204–22.

    Article  Google Scholar 

  6. Depressive disorder (depression). https://www.who.int/news-room/fact-sheets/detail/depression.

  7. Walker ER, McGee RE, Druss BG. Mortality in mental disorders and global disease burden implications a systematic review and meta-analysis. JAMA Psychiat. 2015;72:334–41.

    Article  Google Scholar 

  8. Patel V, Chisholm D, Parikh R, et al. Addressing the burden of mental, neurological, and substance use disorders: Key messages from disease control priorities, 3rd ed. Lancet. 2016;387:1672–85.

    Article  PubMed  Google Scholar 

  9. Broadhead WE, Blazer DG, George LK, Tse CK. Depression, disability days, and days lost from work in a prospective epidemiologic survey. JAMA. 1990;264:2524–8.

    Article  CAS  PubMed  Google Scholar 

  10. Lopez CJLM and AD. It (1996) Evidence-Based Health Policy-Lessons from the Global Burden of Disease Study. Science. 1979;274:1–4.

    Google Scholar 

  11. Greenberg PE, Fournier AA, Sisitsky T, Simes M, Berman R, Koenigsberg SH, Kessler RC. The Economic Burden of Adults with Major Depressive Disorder in the United States (2010 and 2018). Pharmacoeconomics. 2021;39:653–65.

    Article  PubMed  PubMed Central  Google Scholar 

  12. CDC CDC Promotes public health approach to address depression among older adults.

  13. DIshman RK, McDowell CP, Herring MP,. Customary physical activity and odds of depression: A systematic review and meta-analysis of 111 prospective cohort studies. Br J Sports Med. 2021;55:926–34.

    Article  PubMed  Google Scholar 

  14. Loprinzi PD. Objectively measured light and moderate-to-vigorous physical activity is associated with lower depression levels among older US adults. Aging Ment Health. 2013;17:801–5.

    Article  PubMed  Google Scholar 

  15. Kandola A, Lewis G, Osborn DPJ, Stubbs B, Hayes JF. Depressive symptoms and objectively measured physical activity and sedentary behaviour throughout adolescence: a prospective cohort study. Lancet Psychiatry. 2020;7:262–71.

    Article  PubMed  PubMed Central  Google Scholar 

  16. Loprinzi PD, Cardinal BJ. Interrelationships among physical activity, depression, homocysteine, and metabolic syndrome with special considerations by sex. Prev Med (Baltim). 2012;54:388–92.

    Article  CAS  Google Scholar 

  17. Singh B, Olds T, Curtis R, et al. Effectiveness of physical activity interventions for improving depression, anxiety and distress: an overview of systematic reviews. Br J Sports Med. 2023;57(18):1203–9 bjsports-2022–106195.

    Article  PubMed  Google Scholar 

  18. Shimamoto H, Suwa M, Mizuno K. Relationships between depression, daily physical activity, physical fitness, and daytime sleepiness among japanese university students. Int J Environ Res Public Health. 2021. https://doi.org/10.3390/ijerph18158036.

    Article  PubMed  PubMed Central  Google Scholar 

  19. Guo F, Tian Y, Zhong F, Wu C, Cui Y, Huang C. Intensity of physical activity and depressive symptoms in college students: Fitness improvement tactics in youth (fityou) project. Psychol Res Behav Manag. 2020;13:797–811.

    Google Scholar 

  20. Minaeva O, Booij SH, Lamers F, Antypa N, Schoevers RA, Wichers M, Riese H. Level and timing of physical activity during normal daily life in depressed and non-depressed individuals. Transl Psychiatry. 2020. https://doi.org/10.1038/S41398-020-00952-W.

    Article  PubMed  PubMed Central  Google Scholar 

  21. Wirz-Justice A. Biological rhythm disturbances in mood disorders. Int Clin Psychopharmacol. 2006;21:11–5.

    Article  Google Scholar 

  22. Quera Salva MA, Hartley S, Barbot F, Alvarez JC, Lofaso F, Guilleminault C. Circadian rhythms, melatonin and depression. Curr Pharm Des. 2011;17:1459–70.

    Article  CAS  PubMed  Google Scholar 

  23. Germain A, Kupfer DJ. Circadian rhythm disturbances in depression. Hum Psychopharmacol Clin Exp. 2008;23:571–85.

    Article  Google Scholar 

  24. Bechtel W. Circadian rhythms and mood disorders: Are the phenomena and mechanisms causally related? Front Psychiatry. 2015;6: 153068.

    Article  Google Scholar 

  25. Gianfredi V, Ferrara P, Pennisi F, Casu G, Amerio A, Odone A, Nucci D, Dinu M. Association between daily pattern of physical activity and depression: A systematic review. Int J Environ Res Public Health. 2022;19:6505.

    Article  PubMed  PubMed Central  Google Scholar 

  26. Banihashemi N, Robillard R, Yang J, Carpenter JS, Hermens DF, Naismith SL, Terpening Z, White D, Scott EM, Hickie IB. Quantifying the effect of body mass index, age, and depression severity on 24-h activity patterns in persons with a lifetime history of affective disorders. BMC Psychiatry. 2016;16:1–11.

    Article  Google Scholar 

  27. Wolff EA, Putnam FW, Post RM. Motor activity and affective illness. The relationship of amplitude and temporal distribution to changes in affective state. Arch Gen Psychiatry. 1985;42:288–94.

    Article  PubMed  Google Scholar 

  28. Difrancesco S, Riese H, Merikangas KR, Shou H, Zipunnikov V, Antypa N, van Hemert AM, Schoevers RA, Penninx BWJH, Lamers F. Sociodemographic, health and lifestyle, sampling, and mental health determinants of 24-hour motor activity patterns: Observational study. J Med Internet Res. 2021;23:1–14.

    Article  Google Scholar 

  29. Lorenz N, Spada J, Sander C, Riedel-Heller SG, Hegerl U. Circadian skin temperature rhythms, circadian activity rhythms and sleep in individuals with self-reported depressive symptoms. J Psychiatr Res. 2019;117:38–44.

    Article  PubMed  Google Scholar 

  30. Brady R, Brown WJ, Hillsdon M, Mielke GI. Patterns of accelerometer-measured physical activity and health outcomes in adults: A systematic review. Med Sci Sports Exerc. 2022;54:1155–66.

    Article  PubMed  Google Scholar 

  31. De Feijter M, Kocevska D, Ikram MA, Luik AI. The bidirectional association of 24-h activity rhythms and sleep with depressive symptoms in middle-aged and elderly persons. 2023.

    Book  Google Scholar 

  32. Wiemken TL, Kelley RR. Machine Learning in Epidemiology and Health Outcomes Research. Annu Rev Public Health. 2020;41:21–36.

    Article  PubMed  Google Scholar 

  33. Rose S. Intersections of machine learning and epidemiological methods for health services research. Int J Epidemiol. 2021;49:1763–70.

    Article  PubMed  Google Scholar 

  34. Javaid M, Haleem A, Pratap Singh R, Suman R, Rab S. Significance of machine learning in healthcare: Features, pillars and applications. International Journal of Intelligent Networks. 2022;3:58–73.

    Article  Google Scholar 

  35. Hamilton AJ, Strauss AT, Martinez DA, Hinson JS, Levin S, Lin G, Klein EY. Machine learning and artificial intelligence: applications in healthcare epidemiology. Antimicrobial Stewardship & Healthcare Epidemiology : ASHE. 2021;1:1–6.

    Article  Google Scholar 

  36. Jones PJ, Catt M, Davies MJ, Edwardson CL, Mirkes EM, Khunti K, Yates T, Rowlands AV. Feature selection for unsupervised machine learning of accelerometer data physical activity clusters – A systematic review. Gait Posture. 2021;90:120–8.

    Article  PubMed  Google Scholar 

  37. Bzdok D, Altman N, Krzywinski M. Points of Significance: Statistics versus machine learning. Nat Methods. 2018;15:233–4.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  38. Rajula HSR, Verlato G, Manchia M, Antonucci N, Fanos V. Comparison of Conventional Statistical Methods with Machine Learning in Medicine: Diagnosis, Drug Development, and Treatment. Medicina (B Aires). 2020;56:1–10.

    Google Scholar 

  39. Ley C, Martin RK, Pareek A, Groll A, Seil R, Tischer T. Machine learning and conventional statistics: making sense of the differences. Knee Surg Sports Traumatol Arthrosc. 2022;30:753–7.

    Article  PubMed  Google Scholar 

  40. Centers for Disease Control and Prevention (CDC). National Center for Health Statistics (NCHS). National Health and Nutrition Examination Survey Data. Hyattsville: U.S. Department of Health and Human Services, Centers for Disease Control and Prevention, [2011–2012]. https://wwwn.cdc.gov/nchs/nhanes/continuousnhanes/default.aspx?BeginYear=2011. Accessed 15 Jul 2023.

  41. Nawrin SS, Inada H, Momma H, Nagatomi R. Clustering temporal step-counting patterns for 24 hours with machine learning revealed potential heterogeneity in the categorization by a traditional tertile procedure. bioRxiv. 2023. https://doi.org/10.1101/2023.06.04.543652

  42. Center for Health Statistics N. National health and nutrition examination survey (NHANES). Physical activity monitor (PAM) procedures manual. 2011. https://www.cdc.gov/nchs/data/nhanes/nhanes_11_12/Physical_Activity_Monitor_Manual.pdf. Accessed 5 Jan 2024.

  43. Aadland E, Ylvisåker E. Reliability of the Actigraph GT3X+ Accelerometer in Adults under Free-Living Conditions. PLoS ONE. 2015;10: e0134606.

    Article  PubMed  PubMed Central  Google Scholar 

  44. Spitzer RL, Kroenke K, Williams JBW. Validation and utility of a self-report version of PRIME-MD: The PHQ Primary Care Study. J Am Med Assoc. 1999;282:1737–44.

    Article  CAS  Google Scholar 

  45. Kroenke K, Spitzer RL. The PHQ-9: A new depression diagnostic and severity measure. Psychiatr Ann. 2002;32:509–15.

    Article  Google Scholar 

  46. Kroenke K, Spitzer RL, Williams JBW. The PHQ-9: Validity of a brief depression severity measure. J Gen Intern Med. 2001;16:606–13.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  47. Tavenard R, Faouzi J, Vandewiele G, Divo F, Androz G, Holtz C, Payne M, Yurchak R, Rußwurm M. Tslearn, A Machine Learning Toolkit for Time Series Data. J Mach Learn Res. 2020;21:1–6.

    Google Scholar 

  48. Smagula SF, Zhang G, Gujral S, Covassin N, Li J, Taylor WD, Reynolds CF, Krafty RT. Association of 24-Hour Activity Pattern Phenotypes with Depression Symptoms and Cognitive Performance in Aging. JAMA Psychiat. 2022;79:1023–31.

    Article  Google Scholar 

  49. Niemelä M, Kangas M, Farrahi V, Kiviniemi A, Leinonen A-M, Ahola R, Puukka K, Auvinen J, Korpelainen R, Jämsä T. Intensity and temporal patterns of physical activity and cardiovascular disease risk in midlife. 2019. https://doi.org/10.1016/j.ypmed.2019.04.023.

    Article  Google Scholar 

  50. Aqeel M, Guo J, Lin L, Gelfand S, Delp E, Bhadra A, Richards EA, Hennessy E, Eicher-Miller HA. Temporal physical activity patterns are associated with obesity in U.S. adults. Prev Med (Baltim). 2021;148:106538.

    Article  Google Scholar 

  51. Wirz-Justice A, Van Den Hoofdakker RH. Sleep deprivation in depression: What do we know, where do we go? Biol Psychiatry. 1999;46:445–53.

    Article  CAS  PubMed  Google Scholar 

  52. Trotti LM. Waking up is the hardest thing I do all day: Sleep inertia and sleep drunkenness. Sleep Med Rev. 2017;35:76.

    Article  PubMed  Google Scholar 

  53. Smagula SF, Capps CS, Krafty RT. Evaluating the timing of differences in activity related to depression symptoms across adulthood in the United States. J Affect Disord. 2021;284:64.

    Article  PubMed  PubMed Central  Google Scholar 

  54. Fiske A, Gatz M, Pedersen NL. Depressive Symptoms and Aging: The Effects of Illness and Non-Health-Related Events. The Journals of Gerontology: Series B. 2003;58:P320–8.

    Article  Google Scholar 

  55. Badillo N, Khatib M, Kahar P, Khanna D. Correlation Between Body Mass Index and Depression/Depression-Like Symptoms Among Different Genders and Races. Cureus. 2022. https://doi.org/10.7759/CUREUS.21841.

    Article  PubMed  PubMed Central  Google Scholar 

  56. Piccinelli M, Wilkinson G. Gender differences in depression: Critical review. Br J Psychiatry. 2000;177:486–92.

    Article  CAS  PubMed  Google Scholar 

  57. Jefferis BJ, Nazareth I, Marston L, et al. Associations between unemployment and major depressive disorder: evidence from an international, prospective study (the predict cohort). Soc Sci Med. 2011;73:1627–34.

    Article  PubMed  Google Scholar 

  58. McCambridge J, Witton J, Elbourne DR. Systematic review of the Hawthorne effect: New concepts are needed to study research participation effects. J Clin Epidemiol. 2014;67:267.

    Article  PubMed  PubMed Central  Google Scholar 

Download references

Acknowledgements

Not applicable.

Funding

A pioneering research support grant, JST SPRING (Grant number: JPMJSP2114) from Tohoku University partially supported this project. The funding body has no role in the study design, data analysis, or manuscript submission.

Author information

Authors and Affiliations

Authors

Contributions

SSN and HI analyzed the data. SSN, HI, and HM wrote the manuscript. HI and RN designed and supervised the research. All authors revised the manuscript and approved the final manuscript.

Corresponding authors

Correspondence to Hitoshi Inada or Ryoichi Nagatomi.

Ethics declarations

Ethics approval and consent to participate

The NHANES 2011–2012 obtained approval from the Ethics Review Board of the National Center for Health Statistics (NCHS) under Protocol #2011–2017. The methods employed in this study adhered to relevant guidelines and regulations. Written informed consent was secured from all participants.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Nawrin, S.S., Inada, H., Momma, H. et al. Twenty-four-hour physical activity patterns associated with depressive symptoms: a cross-sectional study using big data-machine learning approach. BMC Public Health 24, 1254 (2024). https://doi.org/10.1186/s12889-024-18759-5

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12889-024-18759-5

Keywords