Skip to main content

Short-term exposure sequences and anxiety symptoms: a time series clustering of smartphone-based mobility trajectories

Abstract

Background

Short-term environmental exposures, including green space, air pollution, and noise, have been suggested to affect health. However, the evidence is limited to aggregated exposure estimates which do not allow the capture of daily spatiotemporal exposure sequences. We aimed to (1) determine individuals’ sequential exposure patterns along their daily mobility paths and (2) examine whether and to what extent these exposure patterns were associated with anxiety symptoms.

Methods

We cross-sectionally tracked 141 participants aged 18–65 using their global positioning system (GPS) enabled smartphones for up to 7 days in the Netherlands. We estimated their location-dependent exposures for green space, fine particulate matter, and noise along their moving trajectories at 10-min intervals. The resulting time-resolved exposure sequences were then partitioned using multivariate time series clustering with dynamic time warping as the similarity measure. Respondents’ anxiety symptoms were assessed with the Generalized Anxiety Disorders-7 questionnaire. We fitted linear regressions to assess the associations between sequential exposure patterns and anxiety symptoms.

Results

We found four distinctive daily sequential exposure patterns across the participants. Exposure patterns differed in terms of exposure levels and daily variations. Regression results revealed that participants with a “moderately health-threatening” exposure pattern were significantly associated with fewer anxiety symptoms than participants with a “strongly health-threatening” exposure pattern.

Conclusions

Our findings support that environmental exposures’ daily sequence and short-term magnitudes may be associated with mental health. We urge more time-resolved mobility-based assessments in future analyses of environmental health effects in daily life.

Background

Evidence steadily consolidates that accounting for dynamic environmental exposures is likely more accurate than residence-based approaches, not factoring in exposures incurred during people’s day-to-day mobility [1, 2]. Only a few studies have relied on global positioning system (GPS)-enabled sensing technologies to measure people’s exposures at out-of-home activity locations and along their mobility paths [3,4,5,6]. Given the high spatiotemporal granularity of tracking data, location-dependent exposure sequences can be derived [7]. However, to date, no studies have appeared to have done so.

State-of-the-art mobility-based exposure assessments focus primarily on the health effects of aggregated exposures (e.g., daily or weekly exposure averages per person) [8,9,10,11,12]. However, such aggregation might average out short-term spatiotemporal variabilities in exposures, which likely have different health effects than exposure averages [13]. A US study supports this speculation showing that asthma symptoms could be associated with 1-h maximum exposure to ambient particulate matter (PM10); however, null associations were found using 24-h average PM10 exposures [14]. Another study also observed a similar phenomenon in which the daily mean PM2.5 concentrations were thought to underestimate the cardiovascular mortality burden due to hourly PM2.5 variations being ignored [15]. Therefore, solely using mean values of spatiotemporal exposures along individuals’ daily mobility paths to characterize their exposures is possibly an oversimplified approach that overlooks temporal variations.

Based on GPS tracking data, the measurement of short-term daily exposure sequences allows to consider both magnitude and temporal fluctuations of exposures [2]. Further, exposures captured at different daily time points may have differential impacts on health [16,17,18]. Thus, there is a significant need for dynamic exposure conceptualizations that incorporate spatiotemporal exposure sequences along people’s daily mobility paths [19].

Our exploratory study had two aims for bridging this research gap. First, to characterize individuals’ daily sequential exposure patterns of green space, noise, and PM2.5 along their mobility paths based on time series clustering. Second, using anxiety symptoms as an example, we aim to examine how different daily sequential exposure patterns were associated mental health.

Methods

Mobile phone-based GPS data collection

We acquired GPS data from the NEEDS (‘Dynamic Urban Environmental Exposures on Depression and Suicide’) study collected between September and November 2018 in the Netherlands [20]. Survey respondents aged 18 to 65 (N = 11,505) who agreed to be re-contacted were invited via email to download our Android “Your Living Environment” mobile phone application to collect movement data. We sent 8869 invitation emails within 2 days of survey completion. To increase participation in the mobile phone-based GPS data collection, we raffled off 400 gift vouchers valued at €22 each. In total, 821 participants downloaded the app (i.e., 7.1% of the survey respondents), and 629 permitted tracking and recording.

The locational information was recorded every 20 s for subjects in motion. The recording frequency decreased to 1 min if the mobile phone showed no movement (i.e., displacement of the phone < 20 m) after 30 min. If the mobile phone was stationary for over 1 h, the location was recorded every 2 min to conserve battery power. After a cumulative total of 7 days of data collection, the app stopped recording. Data, initially stored locally, were then uploaded daily to a secure server at Utrecht University.

Preprocessing of the GPS data

The GPS data were cleaned following established practice [3, 4]. Data cleaning entailed the removal of inaccurate GPS points and participants whose records were not representative of a typical week (e.g., leaving the country) (Additional file 1: Table S1). This cleaning process preserved 419 participants with 685,971 GPS points provided.

Recording a sufficient quantity of GPS locations was necessary to derive meaningful tracks capturing people’s daily movement and their related spatiotemporal environmental exposures. However, locational sampling coverage was likely to be sparse due to technological constraints (e.g., signal loss in urban areas), battery depletion, or phones temporarily being turned off. To evaluate the quality of the daily GPS data, for each participant, we enumerated how many daytime hours (6:00 am to 10:00 pm, 16 h in total) included at least one recorded location (Additional file 1: Table S2). We assumed people remained home for 8 nighttime hours, from 10:00 pm to 6:00 am. According to Additional file 1: Table S2, only a limited number of participants had complete GPS records for all 7 days (i.e., every day included GPS-based data points for all 16 daytime hours).

Given the limited number of complete GPS records, our study defined days with 4 h or less of missing GPS data (i.e., a minimum of 12 daytime hours with GPS records) as valid days providing sufficient daily locational data. Those with at least 4 valid days were eligible for inclusion to represent participants’ typical weekly mobility. Considering the possibility of different travel behaviors on weekends versus weekdays [21], we included participants with a minimum of 3 valid weekdays and 1 valid weekend day. In cases in which participants had more valid weekdays or weekend days than the minimum required, we randomly selected 3 weekdays and 1 weekend day to ensure each participant had the same number of days of data included. Our final sample included 141 participants.

Environmental exposure data

Green space

We used the Normalized Difference Vegetation Index (NDVI) as our green space metric [22]. The NDVI was derived from Landsat 8 imagery with a cloud cover of < 40% for 2018 obtained through Google Earth Engine [23]. Scenes had a resolution of 30 m × 30 m. We only included atmospherically corrected images collected between May and September, when vegetation is at its greenest. We removed pixels with a cloud score of > 25 before determining the median NDVI per pixel. Negative NDVI values were masked to avoid distortion. Higher positive NDVI values represented higher levels of vegetation.

Noise

Average day–night–evening (Lden [dB]) noise levels were calculated according to the Standard Model Instrumentation for Noise Assessments (STAMINA). This model considered noise sources from roads, rails, air traffic, industry, and wind turbines for 2016 [24]. The noise data resolution varied depending on the distance between the noise source and the observation point, with resolution values increasing from 10 to 80 m based on increasing distance [25].

Air pollution

We acquired yearly averaged concentrations of fine (≤ 2.5 µm) particulate matter (PM2.5) (µgm−3) from a nationwide land-use regression model [26]. The model regressed monitored PM2.5 concentrations on land use, traffic infrastructure, traffic intensity, and population density for 2009. The calibrated model was then used to predict PM2.5 concentrations at unsampled locations. We resampled the data from 5 to 25 m to reduce program run time. The air pollution data set we used showed that air pollution values had remained stable for nearly a decade prior to our survey and were, thus, applicable for use in our analysis [27].

Determining space-time exposure series

Following a previous GPS study [28], environmental exposures were assessed every 10 min along a respondent’s moving trajectory, resulting in 144 daily 10-min segments (Fig. 1a). We created line-based buffers of 100 m for each trajectory segment (Fig. 1b), a typical buffer width used elsewhere [3, 4, 29]. Mean NDVI, noise, and PM2.5 exposure estimates were determined per segment within a buffer (Fig. 1c). The 48 nighttime exposure segments (10:00 pm to 6:00 am, 8 h) were assessed using home-based buffers (i.e., 100 m) (Fig. 1d). Home addresses were geocoded by matching the population register with the cadaster. For some of the daytime segments without GPS records (Fig. 1e), interpolation was conducted by copying the exposures from the previous segment (Fig. 1f), assuming people were static during this period of missing data. Consequently, for each participant, we received three (i.e., NDVI, noise, and PM2.5) time series of environmental exposures with a length of 576 short-term exposure assessments (144 × 4 days) (Fig. 1g). All participants’ exposures for their 3 weekdays were placed before their weekend exposures to ensure consistent weekday-weekend ordering of data. For our sensitivity analyses, we repeated the procedure using 50 m buffers and 30-min time windows (48 segments per day). The latter resulted in exposure series with a length of 192 exposure assessments (48 × 4 days).

Fig. 1
figure 1

Assessment of exposure time series based on the GPS tracking data

Multivariate time series clustering

Based on the space-time exposure series obtained from “Determining space-time exposure series” section, we employed multivariate time series clustering with constrained dynamic time warping as a distance metric to determine environmental exposure patterns [30]. Due to differences in exposure units, we normalized each exposure time series using z-scores. To achieve clustering results with optimal performance, we tested a varying number of clusters (k) from 2 to 8 (Additional file 1: Fig. S1) and the window size of constraints (i.e., the maximum amount of warping) from 0.5 to 2.5 h (Additional file 1: Fig. S2). For each combination of the two parameters, we repeated the clustering procedure 100 times as the algorithm required a random starting value, which could potentially affect the partition results [31]. We used the Davies–Bouldin index to identify an optimal number of clusters (k) and the window size of constraints [32]. Smaller index values indicate better clustering performance [33]. The results with the smallest DB value were used. Separate time series clustering runs were conducted for different buffer sizes (50 and 100 m) and time windows (10 and 30 min). We visualized the diurnal multi-exposure patterns of each cluster using hourly line charts, where the exposure values of each hour referred to the averages of the corresponding hour over 4-day spans.

Statistical analyses

We developed regression models to test the associations between anxiety symptoms and sequential exposure patterns as a pilot study. Anxiety symptoms over the last 2 weeks were measured during the survey using the Generalized Anxiety Disorder-7 (GAD-7) questionnaire [34]. Participants reported on seven questions, such as “feeling nervous, anxious or on edge” or “not being able to stop or control worrying.“ Response options to each question ranged from 0 (“Not at all”) to 3 (“Nearly every day”). We summed the individual item scores resulting in GAD-7 scores ranging from 0 to 21. A higher overall score indicated more severe anxiety symptoms. The Cronbach’s alpha was 0.90 between individual items.

Models were adjusted for age, sex, income (quintiles treated continuously), employment status (employed, unemployed), marital status (married, unmarried), and educational background. Educational background was scored as high (undergraduate or graduate university education), medium (upper secondary education), and low (up to lower secondary education). We fitted separate models for different buffer sizes (50 and 100 m) and time windows (10 and 30 min) for sensitivity tests. All analyses were conducted using R software, version 4.1.3 [35].

Results

Descriptive statistics

Table 1 shows the descriptive statistics for our sample. The average GAD-7 score was 4.10, with a standard deviation (SD) of ± 4.34. The mean age of the participants was 43.66, and the majority were employed (72.3%). The gender and marital status divisions were approximately equal. Most respondents were middle or highly educated with high or very high incomes. The demographic and socioeconomic characteristics of the analytical sample closely mirrored those before the preprocessing of the GPS data (Additional file 1: Table S3).

Table 1 Summary statistics of the sample

Sequential environmental exposure patterns

The Davies–Bouldin index suggested that four clusters were optimal for the multivariate time series clustering (Additional file 1: Fig. S1). Figure 2 shows each cluster’s sequential environmental exposure patterns using buffers of 100 m and time windows of 10 min. Participants in each cluster were exposed to different ambient environments, which revealed distinctive daily exposure variations. Based on the exposure characteristics, we labeled the clusters as “strongly health-threatening,” “moderately health-threatening,” “moderately health-supportive,” and “strongly health-supportive.” Those in the “strongly health-threatening” cluster (N = 38) were constantly exposed to high noise and high PM2.5 concentrations in conjunction with lesser green space exposure. Participants in the “moderately health-threatening” cluster (N = 23) constantly experienced lesser green space exposure and relatively low noise at night but striking noise increases during the daytime, with PM2.5 exposure being relatively high at night and increasing further during the day. Participants allocated to the “moderately health-supportive” cluster (N = 40) were exposed to relatively high amounts of green space with a decrease in green space exposure during the daytime and moderately high noise and PM2.5 levels with a slight increase during the daytime. The “strongly health-supportive” cluster (N = 40) included participants exposed to high amounts of green space but with a decrease during the daytime, along with low noise and PM2.5 exposures at nighttime but which increased during the day.

Fig. 2
figure 2

Sequential environmental exposure patterns of the four clusters based on 100 m buffers and 10-min time windows. Exposure means and related SDs are shown for each exposure in each cluster. The data partition was based on multivariate time series clustering with constrained dynamic time warping

Associations between anxiety symptoms and exposure patterns

Figure 3 illustrates the regression results between anxiety symptoms and the exposure patterns for the 100 m buffers and 10-min time windows. The signs of the coefficients indicated that, on average, participants in the “moderately health-threatening,“ “moderately health-supportive,“ and “strongly health-supportive” clusters tended to be associated with lower GAD-7 scores than participants in the reference cluster (i.e., “strongly health-threatening”). However, only the “moderately health-threatening” cluster reached statistical significance (p < 0.05); the negative coefficient of the “moderately health-supportive” cluster was marginally significant (Additional file 1: Table S4). Unexpectedly, the negative coefficient of the “strongly health-supportive” cluster was insignificant.

Fig. 3
figure 3

Regression coefficients and 95% CIs for the associations between the GAD-7 scores and the exposure patterns. Cluster 1 (“strongly health-threatening”) served as the reference category. The model was adjusted for age, sex, income, employment status, marital status, and educational background. Exposures were based on 100 m buffers and 10 min time windows

Sensitivity analyses

Our sensitivity analyses revealed that the sequential environmental exposure patterns of the 50 m buffers (Additional file 1: Fig. S3) and the 30-min time windows (Additional file 1: Figs. S4 and S5) resulted in four clusters largely congruent with our main results. Only minor differences were noticeable. For example, for the 50 m buffers (Additional file 1: Fig. S3), the “moderately health-supportive” cluster had lower overall PM2.5 exposure, and the “moderately health-threatening” cluster had higher nighttime noise exposure compared with the 100 m buffers (Fig. 2).

The regression results were broadly consistent across the 50 and 100 m buffers (Additional file 1: Fig. S6). In the model with 50 m buffers, the “moderately health-supportive” cluster was statistically significant (p < 0.05) instead of the “moderately health-threatening” cluster. Sensitivity tests with the 30-min time windows resulted in similar negative, but non-significant, coefficients of the three clusters (i.e., “moderately health-threatening,“ “moderately health-supportive,“ and “strongly health-supportive”) compared to the “strongly health-threatening” cluster (Additional file 1: Fig. S6).

Discussion

Main findings

The few available mobility-based studies on environmental-health associations typically have assessed aggregated exposures over time (e.g., daily or weekly). Our study broke new ground by proposing a novel methodology to determine temporally disaggregated sequential exposure patterns along individuals’ GPS-tracked mobility paths. We identified four sequential exposure patterns among our participants based on the magnitude of the experienced environmental exposures (i.e., green space, noise, and PM2.5) along the daily moving trajectories. Participants in different clusters differed in their overall exposure levels and daily exposure variation, which traditional exposure averages have difficulty capturing. The regression results of our pilot study revealed that the varying sequential exposure patterns were differently associated with anxiety symptoms. Particularly, those participants with a “moderately health-threatening” exposure pattern were associated with significantly fewer anxiety symptoms than participants with a “strongly health-threatening” exposure pattern. However, the associations between anxiety symptoms and the sequential exposure patterns did not pass all sensitivity tests, especially when using 30-min exposure time windows.

Characterizing sequential exposure patterns using time series clustering approach

Our study demonstrated the usefulness of time series clustering to determine individuals’ sequential spatiotemporal exposure patterns. This dynamic exposure conceptualization considers the co-occurrence of disaggregated exposures over space-time. Characterizing exposures in this manner will likely provide exposure summarizations that differ from past studies’ aggregated exposure assessments [36]. Unlike traditional sequence analysis, constrained to categorical data [37, 38], time series clustering enables continuous exposure metrics, which are more desirable because they require neither exposure categorization nor arbitrarily defined distance metrics between different categories [28].

The temporal granularity (i.e., the time interval of the exposure windows) is an important parameter that could affect the clustering results [39]. A too-fine temporal resolution may introduce much, probably less meaningful, fluctuation, while a too-coarse temporal resolution may average the exposure variations [40]. We observed significant anxiety-related associations using 10-min exposure windows, but these results were not replicated when using 30-min time windows. Our study uncovered granularity-related temporal uncertainties that parallel other well-established issues with defining health-influencing geographic contexts [41]. Future studies are encouraged to carefully specify both the temporal and spatial exposure windows, which may depend on whether the analyses rely on short-term exposures along people’s daily mobility or long-term over their life course.

Sequential exposure patterns and anxiety symptoms

Since our pilot study is the first we are aware of that deals with daily sequential exposure patterns, we embed our findings in the broader debate over the impact of the environment on mental health [42,43,44]. Our regression results suggested that participants exposed to a severely compromised ambient environment (i.e., “strongly health-threatening” cluster) tended to be associated with more anxiety symptoms. This finding is in line with previous studies that had reported positive associations between anxiety and exposure to noise [45, 46] and PM2.5 [8, 45], especially when the exposure levels were high [3, 47]. Moreover, we also found that participants with a “moderately health-threatening” exposure pattern were associated with significantly fewer anxiety symptoms than those with a “strongly health-threatening” exposure pattern. The most apparent difference between the two clusters was that participants in the “moderately health-threatening” cluster had significantly less nighttime noise exposure. This lesser exposure may have significantly contributed to their relative paucity of anxiety symptoms, as nighttime noise has been recognized as a more pronounced risk factor than daytime noise [18, 48].

Participants in the “moderately health-supportive” and “strongly health-supportive” clusters had moderate or high green space exposure. However, their negative coefficients did not reach statistical significance, implying that participants exposed to more green space did not necessarily to be associated with significantly fewer anxiety symptoms. A possible explanation is that, despite high overall green space exposures, these participants tended to experience significantly less green space during the daytime when they were most active. Another possible explanation is that individuals with more anxiety symptoms may spend more time in green and quiet places for self-regulation purposes. The insignificant result differs from previous studies reporting significant negative associations between green space exposure and anxiety [49,50,51,52]. However, in line with our results, others have also reported null associations [53,54,55].

Our study indicated that the exposure magnitude and daily variations of multiple exposures might jointly affect anxiety symptoms, although rarely previously recognized. We speculate that individuals’ daily mood fluctuations are influenced by their biological clocks [56, 57] and their daily dynamic exposures [58]. Supported by prior studies [59, 60], these fluctuations in immediate moods can shape individuals’ mental well-being over time. This rationale may explain why daily sequential exposure patterns could be related to people’s mental health status.

Strengths and limitations

This mobility-based study went beyond exposure averages to daily sequential exposure patterns. Such sequential exposure patterns consider both the magnitude and variation of exposures, which is impossible with traditional aggregation-based approaches. Additional file 1: Figures S7 and S8 show the distinction between our time-series approach and the aggregation-based approach when preprocessing mobility-based exposure data. The multivariate time series clustering enabled us to simultaneously include multiple exposures over space and time. This dynamic exposure assessment took advantage of the spatiotemporal exposure measures obtained from GPS tracking and has the potential to contribute new insights into exposure-health associations. Finally, while many previous GPS-based studies had study domains limited to a single city [9, 61,62,63], we tested our approach based on a nationwide sample from the Netherlands, providing data from diverse environmental settings.

The study also had several limitations. First, the PM2.5 and noise data, derived for 2009 and 2016, may not entirely reflect the air pollution and noise levels in 2018 when the GPS data were collected. Second, the exposure assessment was based on exposure maps representing annual averages, which neglect temporal exposure fluctuations and, in turn, may lead to some exposure misclassification, especially for air pollution, which faces diurnal variations [64]. To realize the full advantage of our time series approach, spatiotemporally resolved exposure maps (e.g., on an hourly level) should be incorporated in the future to explore the full potential of our approach. Third, our pilot study only measured anxiety symptoms at baseline. Our analysis was conducted under the assumption that individuals’ anxiety symptoms and their daily sequential exposure patterns remained stable throughout our tracking period of 1 week. It was reported that most people with high GAD-7 scores have chronic symptoms for a month or more [34]. Elsewhere, it was also shown that individuals’ day-to-day mobility patterns are relatively stable [65]. Our cross-sectional design limited our ability to establish causal relationships. We advice future studies to record mental health responses repeatedly using geographically-explicit ecological momentary assessments [66]. Fourth, as with most GPS-based studies [11, 61], we cannot rule out individuals’ selective daily mobility bias [67, 68]. Fifth, our sample was not representative of the Dutch population. Most included participants were highly educated with relatively higher incomes, which would challenge the generalization of the results, but this study introduced a methodological innovation. Finally, obtaining enough hourly GPS tracks for each participant was challenging and resulted in a relatively small final sample. We also cannot guarantee that our sample has represented all possible exposure patterns and using a different sample could yield slightly different sequential exposure patterns. Thus, future studies with more extensive, high-quality samples are needed to examine how exposure sequences are possibly associated with health outcomes.

Conclusions

We proposed a novel GPS-based methodology to determine individuals’ sequential environmental exposure patterns along their daily mobility. Our findings support the previously neglected notion that people’s daily sequential exposure patterns may play a role in mental health. Our data on GPS-tracked Dutch adults showed four distinctive daily sequential exposure patterns based on people’s daily mobility paths. Each pattern was composed of multiple exposures, which differed in magnitudes and daily exposure variations. The regression analyses provided suggestive evidence that some daily exposure patterns were associated with anxiety symptoms. We advise future studies using more extensive tracking data to replicate our approach using time-resolved exposure assessments to uncover spatiotemporally based environmental impacts on mental health.

Availability of data and materials

The data used in this analysis cannot be shared with third parties due to privacy restrictions.

References

  1. Desjardins MR, et al. Improving longitudinal research in geospatial health: an agenda. Health Place. 2023;80: 102994.

    PubMed  Google Scholar 

  2. Helbich M. Toward dynamic urban environmental exposure assessments in mental health research. Environ Res. 2018;161:129–35.

    CAS  PubMed Central  PubMed  Google Scholar 

  3. Lan Y, et al. Daily space-time activities, multiple environmental exposures, and anxiety symptoms: a cross-sectional mobile phone-based sensing study. Sci Total Environ. 2022;834: 155276.

    CAS  PubMed  Google Scholar 

  4. Roberts H, Helbich M. Multiple environmental exposures along daily mobility paths and depressive symptoms: a smartphone-based tracking study. Environ Int. 2021;156:106635.

    CAS  PubMed  Google Scholar 

  5. Liu D, et al. Examining individual-level tri-exposure to greenspace and air/noise pollution using individual-level GPS-based real-time sensing data. Soc Sci Med. 2023;329: 116040.

    PubMed  Google Scholar 

  6. Wei L, et al. Measuring environmental exposures in people’s activity space the need to account for travel modes and exposure decay. J Expo Sci Environ Epidemiol. 2023. https://0-doi-org.brum.beds.ac.uk/10.1038/s41370-023-00527-z.

    Article  PubMed  Google Scholar 

  7. Campbell M, Marek L, Hobbs M. Reconsidering movement and exposure: towards a more dynamic health geography. Geogr Compass. 2020;15: e12566.

    Google Scholar 

  8. Azhdari, et al. Associations of combined short-term exposures to ambient PM2.5 air pollution and noise annoyance on mental health disorders: a panel study of healthy college students in Tehran. Air Qual Atmos Health. 2022;15(8):1497–505.

    CAS  Google Scholar 

  9. Letellier N, et al. Air pollution and metabolic disorders: dynamic versus static measures of exposure among Hispanics/Latinos and non-Hispanics. Environ Res. 2022;209: 112846.

    CAS  PubMed Central  PubMed  Google Scholar 

  10. Li D, et al. Moving beyond the neighborhood: daily exposure to nature and adolescents’ mood. Landsc Urban Plan. 2018;173:33–43.

    Google Scholar 

  11. Marquet O, et al. GPS-based activity space exposure to greenness and walkability is associated with increased accelerometer-based physical activity. Environ Int. 2022;165: 107317.

    PubMed Central  PubMed  Google Scholar 

  12. Yoo E-h, Roberts JE. Static home-based versus dynamic mobility-based assessments of exposure to urban green space. Urban For Urban Green. 2022;70: 127528.

    Google Scholar 

  13. Park YM, Kwan M-P. Individual exposure estimates may be erroneous when spatiotemporal variability of air pollution and human mobility are ignored. Health Place. 2017;43:85–94.

    PubMed  Google Scholar 

  14. Delfino Ralph J, et al. Association of asthma symptoms with peak particulate air pollution and effect modification by anti-inflammatory medication use. Environ Health Perspect. 2002;110(10):A607-617.

    CAS  PubMed Central  PubMed  Google Scholar 

  15. Lin HL, et al. Daily exceedance concentration hours: a novel indicator to measure acute cardiovascular effects of PM2.5 in six Chinese subtropical cities. Environ Int. 2018;111:117–23.

    CAS  PubMed  Google Scholar 

  16. Tao Y, et al. Associations of co-exposures to air pollution and noise with psychological stress in space and time: a case study in Beijing, China. Environ Res. 2020;196: 110399.

    PubMed  Google Scholar 

  17. Heritier H, et al. Diurnal variability of transportation noise exposure and cardiovascular mortality: a nationwide cohort study from Switzerland. Int J Hyg Environ Health. 2018;221(3):556–63.

    PubMed  Google Scholar 

  18. Munzel T, et al. Adverse cardiovascular effects of traffic noise with a focus on nighttime noise and the new WHO noise guidelines. Annu Rev Public Health. 2020;41:309–28.

    PubMed  Google Scholar 

  19. Li K, et al. Using dynamic time warping self-organizing maps to characterize diurnal patterns in environmental exposures. Sci Rep. 2021;11(1):24052.

    CAS  PubMed Central  PubMed  Google Scholar 

  20. Helbich M. Dynamic urban environmental exposures on depression and suicide (NEEDS) in the Netherlands: a protocol for a cross-sectional smartphone tracking study and a longitudinal population register study. BMJ Open. 2019;9(8): e030075.

    PubMed Central  PubMed  Google Scholar 

  21. Astroza S, et al. Understanding activity engagement across weekdays and weekend days: a multivariate multiple discrete-continuous modeling approach. J Choice Model. 2018;28:56–70.

    Google Scholar 

  22. Tucker CJ. Red and photographic infrared linear combinations for monitoring vegetation. Remote Sens Environ. 1979;8(2):127–50.

    Google Scholar 

  23. Gorelick N, et al. Google earth engine: planetary-scale geospatial analysis for everyone. Remote Sens Environ. 2017;202:18–27.

    Google Scholar 

  24. National Institute for Health and, Environment. Noise in the Netherlands (Lden). National georegister; 2019.

  25. Schreurs E, Jabben J, Verheijen E. STAMINA—model description. Standard model instrumentation for noise assessments. 2010.

  26. Schmitz O, et al. High resolution annual average air pollution concentration maps for the Netherlands. Sci Data. 2019;6(1):190035.

    PubMed Central  PubMed  Google Scholar 

  27. de Hoogh K, et al. Spatial PM2.5, NO2, O3 and BC models for western Europe—evaluation of spatiotemporal stability. Environ Int. 2018;120:81–92.

    PubMed  Google Scholar 

  28. Liu B, et al. Disentangling time use, food environment, and food behaviors using multi-channel sequence analysis. Geogr Anal. 2021;54(4):881–917.

    Google Scholar 

  29. Mueller W, et al. Urban greenspace and the indoor environment: pathways to health via indoor particulate matter, noise, and road noise annoyance. Environ Res. 2020;180: 108850.

    CAS  PubMed  Google Scholar 

  30. Sardá-Espinosa A. Time-series clustering in R using the dtwclust package. R J. 2019;11:22.

    Google Scholar 

  31. Fränti P, Sieranoja S. How much can k-means be improved by using better initialization and repeats? Pattern Recogn. 2019;93:95–112.

    Google Scholar 

  32. Davies DL, Bouldin DW. A cluster separation measure. IEEE Trans Pattern Anal Mach Intell. 1979;PAMI-1(2):224–7.

    Google Scholar 

  33. Wijaya YA, et al. Davies Bouldin index algorithm for optimizing clustering case studies mapping school facilities. TEM J Tech Educ Manag Inform. 2021;10(3):1099–103.

    Google Scholar 

  34. Spitzer RL, et al. A brief measure for assessing generalized anxiety disorder: the GAD-7. JAMA Intern Med. 2006;166(10):1092–7.

    Google Scholar 

  35. R Core Team. R: a language and environment for statistical computing. 2022. https://www.R-project.org/.

  36. Pelgrims I, et al. Association between urban environment and mental health in Brussels, Belgium. BMC Public Health. 2021;21(1):635.

    PubMed Central  PubMed  Google Scholar 

  37. Liao TF, et al. Sequence analysis: its past, present, and future. Soc Sci Res. 2022;107: 102772.

    PubMed  Google Scholar 

  38. Piccarreta R, Studer M. Holistic analysis of the life course: methodological challenges and new perspectives. Adv Life Course Res. 2019;41:100251.

    PubMed  Google Scholar 

  39. Guo HY, et al. Information granulation-based fuzzy clustering of time series. IEEE Trans Cybern. 2021;51(12):6253–61.

    PubMed  Google Scholar 

  40. Burkhard O, et al. On the requirements on spatial accuracy and sampling rate for transport mode detection in view of a shift to passive signalling data. Transp Res Part C Emerg Technol. 2020;114:99–117.

    Google Scholar 

  41. Li J, et al. Comparing effects of euclidean buffers and network buffers on associations between built environment and transport walking: the multi-ethnic study of atherosclerosis. Int J Health Geogr. 2022;21(1):12.

    PubMed Central  PubMed  Google Scholar 

  42. Tortorella A, et al. New determinants of mental health: the role of noise pollution. A narrative review. Int Rev Psychiatry. 2022;34:1–14.

    Google Scholar 

  43. King JD, Zhang S, Cohen A. Air pollution and mental health: associations, mechanisms and methods. Curr Opin Psychiatry. 2022;35(3):192–9.

    PubMed  Google Scholar 

  44. Zhang R, Zhang C-Q, Rhodes RE. The pathways linking objectively-measured greenspace exposure and mental health: a systematic review of observational studies. Environ Res. 2021;198: 111233.

    CAS  PubMed  Google Scholar 

  45. Hao G, et al. Associations of PM2.5 and road traffic noise with mental health: evidence from UK Biobank. Environ Res. 2022;207:112221.

    CAS  PubMed  Google Scholar 

  46. Jigeer G et al. Association of residential noise exposure with maternal anxiety and depression in late pregnancy. Environ Int, 2022. 168:107473

    PubMed  Google Scholar 

  47. Zhang X, et al. Geographic ecological momentary assessment (GEMA) of environmental noise annoyance: the influence of activity context and the daily acoustic environment. Int J Health Geogr. 2020;19(1):50.

    PubMed Central  PubMed  Google Scholar 

  48. Smith MG, Cordoza M, Basner M. Environmental noise and effects on sleep: an update to the WHO systematic review and meta-analysis. Environ Health Perspect. 2022;130(7):076001.

    PubMed Central  PubMed  Google Scholar 

  49. Beyer KMM, et al. Exposure to neighborhood green space and mental health: evidence from the survey of the health of wisconsin. Int J Environ Res Public Health. 2014;11(3):3453–72.

    PubMed Central  PubMed  Google Scholar 

  50. Dzhambov AM, et al. Residential greenspace is associated with mental health via intertwined capacity-building and capacity-restoring pathways. Environ Res. 2019;178: 108708.

    CAS  PubMed  Google Scholar 

  51. Moreira TCL, et al. Assessing the impact of urban environment and green infrastructure on mental health: results from the Sao Paulo megacity mental health survey. J Expo Sci Environ Epidemiol. 2021;32(2):205–12.

    PubMed  Google Scholar 

  52. Aerts R, et al. Residential green space and mental health-related prescription medication sales: an ecological study in Belgium. Environ Res. 2022;211: 113056.

    CAS  PubMed  Google Scholar 

  53. Helbich M, et al. Can’t see the wood for the trees? An assessment of street view- and satellite-derived greenness measures in relation to mental health. Landsc Urban Plan. 2021;214: 104181.

    Google Scholar 

  54. Generaal E, et al. Not urbanization level but socioeconomic, physical and social neighbourhood characteristics are associated with presence and severity of depressive and anxiety disorders. Psychol Med. 2019;49(1):149–61.

    PubMed  Google Scholar 

  55. Hartley K, et al. Residential surrounding greenness and self-reported symptoms of anxiety and depression in adolescents. Environ Res. 2021;194: 110628.

    CAS  PubMed  Google Scholar 

  56. Beute F, De Kort Y, Ijsselsteijn W. Restoration in its natural context: how ecological momentary assessment can advance restoration research. Int J Environ Res Public Health. 2016;13(4):420.

    PubMed Central  PubMed  Google Scholar 

  57. Roenneberg T, Merrow M. Circadian clocks—the fall and rise of physiology. Nat Rev Mol Cell Biol. 2005;6(12):965–71.

    CAS  PubMed  Google Scholar 

  58. Kondo MC, et al. Momentary mood response to natural outdoor environments in four European cities. Environ Int. 2020;134: 105237.

    PubMed  Google Scholar 

  59. Nahum M, et al. Immediate mood scaler: tracking symptoms of depression and anxiety using a novel mobile mood scale. JMIR mHealth uHealth. 2017;5(4): e44.

    PubMed Central  PubMed  Google Scholar 

  60. Shin KE, Newman MG, Jacobson NC. Emotion network density is a potential clinical marker for anxiety and depression: comparison of ecological momentary assessment and daily diary. Br J Clin Psychol. 2022;61(S1):31–50.

    PubMed  Google Scholar 

  61. Vich G, Marquet O, Miralles-Guasch C. Green exposure of walking routes and residential areas using smartphone tracking data and GIS in a Mediterranean city. Urban For Urban Green. 2019;40:275–85.

    Google Scholar 

  62. Xu Y, et al. The impact of GPS-derived activity spaces on personal PM2.5 exposures in the MADRES cohort. Environ Res. 2022;214: 114029.

    CAS  PubMed  Google Scholar 

  63. He MZ, et al. Intermediate- and long-term associations between air pollution and ambient temperature and glycated hemoglobin levels in women of child bearing age. Environ Int. 2022;165: 107298.

    CAS  PubMed Central  PubMed  Google Scholar 

  64. Guo H, et al. Coupling mobile phone data with machine learning: how misclassification errors in ambient PM2.5 exposure estimates are produced? Sci Total Environ. 2020;745: 141034.

    CAS  PubMed  Google Scholar 

  65. Olsen JR, et al. Day-to-day intrapersonal variability in mobility patterns and association with perceived stress: a cross-sectional study using GPS from 122 individuals in three european cities. SSM Popul Health. 2022;19: 101172.

    PubMed Central  PubMed  Google Scholar 

  66. Shiffman S, Stone AA, Hufford MR. Ecological momentary assessment. Ann Rev Clin Psychol. 2008;4:1–32.

    PubMed  Google Scholar 

  67. Chaix B, et al. GPS tracking in neighborhood and health studies: a step forward for environmental exposure assessment, a step backward for causal inference? Health Place. 2013;21:46–51.

    PubMed  Google Scholar 

  68. Shrestha S, et al. Spatial access to sport facilities from the multiple places visited and sport practice: assessing and correcting biases related to selective daily mobility. Soc Sci Med. 2019;236: 112406.

    PubMed  Google Scholar 

Download references

Acknowledgements

We sincerely thank Mei-Po Kwan for her valuable comments on YL’s proposal. We also thank the reviewer and editor for their constructive feedback, which improved the paper.

Funding

YL was funded by the the China Scholarship Council (CSC) (No. 201908440272). This study was part of the NEEDS project. The research leading to this paper received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation program (Grant Agreement Number 714993). The funders had no role concerning the study design, data collection and analysis, interpretation, or dissemination.

Author information

Authors and Affiliations

Authors

Contributions

YL: conceptualization, methodology, formal analysis, visualization, writing—original draft, funding acquisition. MH: conceptualization, data curation, writing—review and editing, project administration, funding acquisition, supervision.

Corresponding author

Correspondence to Yuliang Lan.

Ethics declarations

Ethics approval and consent to participate

The study obtained ethical approval (FETC17-060) from the Ethics Review Board of the Faculty of Social and Behavioral Sciences of Utrecht University. All participants consented before inclusion. All procedures have been performed in accordance with the Declaration of Helsinki. The study complies with the General Data Protection Regulation of the European Union.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Table S1.

GPS data cleaning. Table S2. The number of participants with different GPS data quality. Table S3. Sample characteristics before and after GPS cleaning process. Table S4. Associations between the GAD-7 scores and the exposure patterns. Exposures were based on 100 m buffers and 10 min time window. The model was adjusted for age, sex, income, employment status, marital status, and educational background. Figure S1. Davies Bouldin index for different numbers of clusters and across various model settings (i.e., time windows and buffer sizes). Figure S2. Davies Bouldin index for different window size of constraints and across various model settings (i.e., time windows and buffer sizes). Figure S3. Sequential environmental exposure patterns of the four clusters based on 50 m buffers and 10-min time windows. The data partition was conducted with multivariate time series clustering and constrained dynamic time warping. Figure S4. Sequential environmental exposure patterns of the four clusters based on 100 m buffers and 30-min time windows. The data partition was conducted with multivariate time series clustering and constrained dynamic time warping. Figure S5. Sequential environmental exposure patterns of the four clusters based on 50 m buffers and 30-min time windows. The data partition was conducted with multivariate time series clustering and constrained dynamic time warping. Figure S6. Regression coefficients for the associations between the GAD-7 scores and the exposure patterns across different buffers (50 m and 100 m) and time windows (10 min and 30 min). Cluster 1 (“strongly health-threatening”) served as the reference category. The model was adjusted for age, sex, income, employment status, marital status, and educational background. Figure S7. Preprocessing of mobility-based exposure data using aggregation-based approach. Figure S8. Preprocessing of mobility-based exposure data in our study using the time-series approach. Individuals of the same color were assigned in the same cluster.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Lan, Y., Helbich, M. Short-term exposure sequences and anxiety symptoms: a time series clustering of smartphone-based mobility trajectories. Int J Health Geogr 22, 27 (2023). https://0-doi-org.brum.beds.ac.uk/10.1186/s12942-023-00348-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://0-doi-org.brum.beds.ac.uk/10.1186/s12942-023-00348-1

Keywords