Forecasting call and chat volumes at online helplines for mental health

de Boer, Tim Rens; Mérelle, Saskia; Bhulai, Sandjai; Gilissen, Renske; van der Mei, Rob

doi:10.1186/s12889-023-15887-2

Research
Open access
Published: 27 May 2023

Forecasting call and chat volumes at online helplines for mental health

Tim Rens de Boer¹,
Saskia Mérelle²,
Sandjai Bhulai³,
Renske Gilissen² &
…
Rob van der Mei^1,3

BMC Public Health volume 23, Article number: 984 (2023) Cite this article

1402 Accesses
Metrics details

Abstract

Background

Each year, many help seekers in need contact health helplines for mental support. It is crucial that they receive support immediately, and that waiting times are minimal. In order to minimize delay, helplines must have adequate staffing levels, especially during peak hours. This has raised the need for means to predict the call and chat volumes ahead of time accurately. Motivated by this, in this paper, we analyze real-life data to develop models for accurately forecasting call volumes, for both phone and chat conversations for online mental health support.

Methods

This research was conducted on real call and chat data (adequately anonymized) provided by 113 Suicide Prevention (Over ons | 113 Zelfmoordpreventie) (throughout referred to as ‘113’), the online helpline for suicide prevention in the Netherlands. Chat and phone call data were analyzed to better understand the important factors that influence the call arrival process. These factors were then used as input to several Machine Learning (ML) models to forecast the number of call and chat arrivals. Next to that, senior counselors of the helpline completed a web-based questionnaire after each shift to assess their perception of the workload.

Results

This study has led to several remarkable and key insights. First, the most important factors that determine the call volumes for the helpline are the trend, and weekly and daily cyclic patterns (cycles), while monthly and yearly cycles were found to be non-significant predictors for the number of phone and chat conversations. Second, media events that were included in this study only have limited—and only short-term—impact on the call volumes. Third, so-called (S)ARIMA models are shown to lead to the most accurate prediction in the case of short-term forecasting, while simple linear models work best for long-term forecasting. Fourth, questionnaires filled in by senior counselors show that the experienced workload is mainly correlated to the number of chat conversations compared to phone calls.

Conclusion

(S)ARIMA models can best be used to forecast the number of daily chats and phone calls with a MAPE of less than 10 in short-term forecasting. These models perform better than other models showing that the number of arrivals depends on historical data. These forecasts can be used as support for planning the number of counselors needed. Furthermore, the questionnaire data show that the workload experienced by senior counselors is more dependent on the number of chat arrivals and less on the number of available agents, showing the value of insight into the arrival process of conversations.

Peer Review reports

Background

Many countries have helplines to support people struggling with mental health problems, such as suicidal thoughts [2]. These helplines provide immediate and anonymous support, often free of cost, to improve mental health and prevent suicides [3]. The Netherlands has multiple helplines, like the listen helpline (Dutch: de Luisterlijn) [4], the helpline for children (Dutch: de Kindertelefoon) [5], and 113 Suicide Prevention, which is the helpline for suicide prevention in the Netherlands, providing telephone-call as well as chat support [6]. This paper focuses on the helpline of 113 Suicide Prevention, but the methodology and results also provide important insights for other helplines. In the Netherlands alone, on average five persons die each day by suicide [7], and worldwide more than 700,000 people annually [8]. Suicide is a global mental health phenomenon. 113 Suicide Prevention has the mission that no one should die alone and in despair of suicide and to break the taboo around suicide. This national suicide prevention center started as 113Online founded by Jan Mokkenstorm on 7 October 2009. Help seekers with suicidal thoughts or their family and friends can contact 113 round-the-clock anonymously, either by telephone or chat. Besides mental health services, 113 also provides training services, leads the National Suicide Prevention Agenda [9] and has a research department. The organization is subsidized by the Ministry of Health, Welfare and Sport. During and after COVID-19, 113 saw increased chat and phone call arrivals, showing the importance of helplines during crisis situations [10]. Unpaid volunteers and paid professionals assist help seekers at 113. It is crucial to gain insight into the arrival process of these help seekers to help these people as well as possible because these insights can contribute to good predictions of call volumes, and hence adequate staffing levels, resulting in lower waiting times and a higher number of help seekers helped.

In our study, various factors were considered vital for the number of help seekers per day, such as the historic trend, daily, weekly, monthly, and yearly patterns, and the effect of large news items or events discussed in various media forms. Whitley et al.[11] and Niederkrotenthaler et al. [12] researched the influence of media events on suicides and found that the number of suicides increased after the suicide of a well-known celebrity.

Research has been done on forecasting call volumes at helplines (e.g., [13]). Gijo et al. showed the added benefit of using (S)ARIMA models to forecast call volumes in emergency services [14]. (S)ARIMA models can identify possible trends and cycles, Gijo et al. show that forecasting can be done effectively using historical data only. In contrast, research on helplines for mental health is often focused on the conversation topics or the types of callers, rather than on call volumes and waiting times. For example, Salmi et al. showed the change in conversation topics during COVID-19 [15]. Grigorash et al. have studied the caller type of mental health helplines [16]. In this context, the present paper aims to fill the gap between these studies by combining a forecasting approach mostly seen in general call centers on the one hand with the specifics of the mental health helpline context where media events might affect the demand on the other hand.

This study aims to test and compare different forecasting models on the anonymous call-volume data provided by 113. Assumptions about the helpline are validated using data analysis, especially a possible trend, cycle effects (over different time scales), and exogenous factors, such as the effect of media events. A cycle is defined as a seasonal effect, which repeats over time on a yearly, monthly, weekly, or even daily basis. Conversely, the trend shows the general tendency of the data to increase or decrease during a longer period. More specifically, we address the following hypotheses:

1. Weekly, daily, and yearly cycles are important for predicting the number of arrivals.
2. The number of incoming phone calls and chats increases during and after a large media event (such as the suicide of a celebrity).
3. Accurately forecasting the call volumes is possible using models that use historical data with possible exogenous factors. The historical data is used to incorporate possible cycles and trends, while exogenous factors are added to use the possible effect of media events.
4. The workload that counselors experience increases when the average waiting time for phone or chat is higher than usual. This applies to situations where there are more help seekers than counselors can help or more help seekers than expected.

Methods

Dataset

The data provided by 113 consists of (anonymized) call and chat conversations ranging from 2017 until 2021 and contains around 250,000 chats and 175,000 telephone calls. This dataset is time-stamped per conversation and can therefore be used separately to test hourly and/or daily forecasting for chat and telephone. The size of the dataset also makes it possible to determine effect cycles as well as identify a possible trend. The details of the number of chat and telephone conversations can be found in Table 1. Each call or chat record contains the following fields: the contact id, an initial contact id, (the channel (telephone or chat), the arrival time, the time entering the queue, the accept time, the disconnect time, the completion time, the switch count, and finally, the agent that handled the call or chat. Here, the contact id is used to identify the conversation, where contact id is the same as the initial contact id if the conversation is not forwarded, agents can forward conversations if the help seeker requires more or other help. The switch count also identifies the number of times a call has been forwarded, so the switch count would be one in the case of contact id equal to initial contact id. This paper focuses on conversations where the contact id is equal to the initial contact id. The switch count increases with one, with each conversation sent through. The data contains four timestamps, the previously mentioned arrival, queue enter, accept, and disconnect time. Help seekers arrive at 113 at the arrival time. Help seekers can be classified into two groups based on their means of communication, help seekers that call 113 (the so-called phone callers) and help seekers that use the chat function of 113 to communicate with 113 (the so-called chatters). Phone callers first have to listen to a phone tape and fill in a questionnaire, chatters are also required first to fill in a questionnaire. After the help seeker has filled in all questions, he or she enters the queue. The time it takes to fill in the questions is the so-called pre-queue duration. The chatter/caller waits until a counselor is available, and then the conversation is accepted. After finishing the conversation, the help seeker and counselor are disconnected, and finally, the agents have to fill in a wrap-up form. The wrap-up form is a form for the agent to evaluate the conversation and the help seeker, recording the conversation topic, for example, and if this person has called or chatted with 113 before; the time it takes to fill in this form is called the wrap-up duration.

Table 1 Number of chat and telephone arrivals per year

Full size table

Data preprocessing

Not all data were useful in its original form. Therefore, we identified missing values in the data and determined whether any imputations were required. Missing values are handled differently based on context: a missing value in waiting time often meant that the help seeker abandoned the queue. In some cases could also have been due to the help seeker being accepted before being queued. These values were filled in based on these conditions. Finally, the data were aggregated to obtain call and chat volumes per day and hour. We had two days for which the call and chat volumes were both zero, probably due to a technical issue. These two volumes were estimated using linear interpolation.

Forecasting models

The following models were used to forecast day volumes: ARIMA, SARIMA, Linear Regression, LSTM, and various baseline models. These models were chosen to represent different approaches for forecasting and could incorporate the different aspects we hypothesized as important.

(Seasonal) Autoregressive Integrated Moving Average (shortly, (S)ARIMA) models are well-known time series models [14] used for forecasting. ARIMA uses previously measured values for forecasting future values. SARIMA is similar to ARIMA, but here a seasonal component is added; see [17] for an overview of (S)ARIMA models. The parameters for ARIMA and SARIMA are both determined using AutoARIMA [18]. AutoARIMA is an R-method that determines the best parameters based on the Akaike Information Criterion (AIC), which is an estimator of the prediction error. Linear regression is a simple machine learning (ML) approach and was used to fit a linear trend with a weekly effect on the data. These models fit a linear relation between various factors and the outcome, in this case, the number of arrivals. In formula form, this model looks as follows:

$${\varvec{F}}={\varvec{X}}{\varvec{\beta}}+{\varvec{\varepsilon}}$$

where F is a vector containing the forecasts, X is a vector containing the input variables, β is the vector containing parameters, and finally, ε is noise.

The Long Short Term Memory (LSTM) model is a more sophisticated ML model used for forecasting in time series and is a special kind of Recurrent Neural Network (RNN). Lastly, these models are compared to various baseline models: the forecast of day i is the measurement of day i-7 or i-56, calculated as follows:

$${F}_{i}={A}_{i-7}$$

For Baselines 1 and 2, the following is used:

$${F}_{i}={A}_{i-56}$$

where ${F}_{i}$ is the forecast for day i and ${A}_{i}$ is the actual value of day i. These baselines correspond to using the actual number of phone calls and chats from one week ago (7 days) or 8 weeks (56 days). The models are compared based on the Mean Absolute Percentage Error (MAPE), defined as follows:

$${\text{MAPE}}=\frac{1}{n}\sum_{i=1}^{n}\left|\frac{{F}_{i}-{A}_{i}}{{A}_{i}}\right| \times 100\%$$

where n is the number of forecasts.

Questionnaire

Next, a questionnaire was given to senior counselors to determine when and why they experienced a high workload. This process consists of a questionnaire based on a quick scan for workload and adapted to the situation of the helpline. Senior counselors were asked to fill in a questionnaire after each shift [19]. Here, we examine whether the workload of the senior counselors is related to the number of calls or chats, and are so able, together with the forecasts, to understand adequate staffing levels. Data collection took place from 16 February to 30 April 2022. Table 2 gives an overview of the 10-item questionnaire that uses a 5-point Likert scale [20]. We transformed all questions such that a score of “5” indicates a high workload and a score of “1” indicates a low workload. Questionnaires that reported technical problems were excluded from analyses since high experienced workload can then be related to the technical issues. We can identify whether there were any technical issues based on positive answers to Question 7.

Table 2 Questions of the questionnaire

Full size table

The sum scale that represents the counselor’s experienced workload is calculated by summing all questions, except Question 7, since this question is used to filter out questionnaires with technical issues.^{Footnote 1} This sum scale was then compared to features of the objective workload captured by the data. These variables are: the number of chats and phone calls during the shift, mean waiting time of chats and phone calls, missed percentage of chats, and phone calls. The Cronbach’s alpha of the questionnaire was 0.72 [21], which tells us that using the questionnaire in its current form is acceptable. Pearson’s correlation coefficient was then used to measure the strength of the relationship, where 0 means no relation and 1 or -1 means a perfect correlation [22]. P-values < 0.05 were considered statistically significant.

Results

Trend

The number of telephone and chat conversations shows an increasing trend over the years (see Fig. 1 below). This can also be seen in Table 1, where the number of phone and chat conversations more than doubled in the period from 2017 until 2021. In 2017, on average, around 110 chats and 50 telephone calls were arriving daily. In 2021 we observed 224 chats and 184 telephone calls per day.

Weekly and daily patterns

First, the weekly pattern was determined. This was done by determining the distribution of arrivals over the different weekdays. As shown in Fig. 2, the distribution of arrivals over the week from Monday until Friday for the telephone is similar around 15%, with a drop in the weekend to around 12.7%. We see a slightly different cycle for the chats: the number of chat arrivals is similar for days from Monday until Thursday, with a drop on Friday and a larger drop on Saturday, followed by an increase on Sunday.

Next, the daily cycles were examined, similarly to determining the weekly cycles. The daily cycles can be found in Fig. 3, telephone and chat arrivals both show a dip in the early morning from 1 AM until 5 AM. The number of telephone arrivals is similar during the period from 9 AM till 8 PM, while chat arrivals show a clear peak in the evening around 8 PM.

Both daily and weekly cycles show the effects of these cycles for forecasting the number of arrivals. Besides the importance of the different cycles, it also shows the importance of forecasting chats and telephone conversations as two separate arrival processes, since they follow different cycles. Yearly and monthly cycles were also analyzed. However, both were found to not significantly vary over time, possibly due to limited available data in the case of yearly cycles.

Media events

To assess whether celebrity suicides might influence the number of call arrivals at the helpline, the periods before and after celebrity suicides were analyzed, and briefly outlined below. In the period studied, one celebrity in the Netherlands died by suicide. Figure 4 (lower graph) shows the effect of the suicide of a well-known Dutch author on the number of chat arrivals. In this period, only the data of chats are available. It is clear that the news significantly affected the number of chats, especially in the week after the news broke. Figure 4 (upper graph) also shows the absence of the effect the suicide of an internationally well-known artist had on the arrivals at the helpline. Lastly, Fig. 5 shows the effect national political news had on the arrivals, as an example of the effect of media events other than the suicide of well-known persons. We observe that in most cases the effects of these events were limited, or only short-term (one or two days). Figure 4 (lower graph) shows that there are events that have a larger or long-term effect, but most events did not have this large or long-term effect. Together with the fact that this type of event cannot be predicted ahead of time, we chose not to include these events in the forecasting models.

Forecast results

The forecast analyses were done separately for chats and phone calls. The error (in terms of the MAPE) for each model and each time window can be seen in Tables 3 and 4 for chat and telephone, respectively. The lowest MAPE, meaning the most accurate forecast. In both cases, the ARIMA and SARIMA models perform similarly and best in the case of short-term forecasting, five weeks or less for telephone, and seven weeks or less for chats. After these time windows, in both cases, the simple models perform the best for long-term forecasting, which can be seen in Tables 3 and 4, where the MAPE of the simple model (12.80 for chat and 15.01 for phone) is less than that of the (S)ARIMA models. Most remarkably, both (S)ARIMA and the simple models have a lower MAPE than the baseline model and the LSTM model, which performs even worse.

Table 3 Error term of the chat forecast models

Full size table

Table 4 Error term of the telephone forecast models

Full size table

A demo of how one-day ahead predictions using (S)ARIMA work can be seen below. Figure 6 shows that in the case of an event with a large effect the (S)ARIMA model can quickly adapt to the increase in the number of arrivals. The (S)ARIMA quickly adapts without explicitly giving the event or the reason for the increase. We observe that the predictions follow the waves of the arrivals and also adapt in cases of peaks and throughs.

Experienced workload

The senior counselors filled in 88 questionnaires (n = 32 Day, n = 41 Evening, n = 13 Night), two were excluded from analyses since respondents reported technical problems during the shift. Descriptive statistics are given in Table 5. Most questions are filled in with a mean of around 3, except for Questions 5,6, and 10, which all have a mean below 2. These answers indicate that senior counselors, in general, experience fewer problems with the pace, can still show interest in their colleagues, and have enough counselors besides interns in the shift. In contrast, they experience more problems due to the multitude of tasks, which can be seen by the mean of Question 1, the highest mean score of all questions.

Table 5 Descriptive statistics of the useable questionnaires

Full size table

Next, it was checked whether the questionnaire data contained some busy shifts; this was done by checking the number of arrivals of each shift. It is found that the questionnaires filled in by the evening shift contain the most variability of workload. The correlations found in the evening shift are presented in Fig. 7. We found that most correlations were significant, except for the correlation between the sum score and the number of phone calls (see Table 6), albeit moderately correlated (i.e., around or below 0.5). The two strongest relations are between the number of chats and the total sum score. The correlations of the percentage of unanswered chats are omitted since all chats were answered.

Table 6 Correlations between the outcome variables and the questionnaires of the evening shift

Full size table

Discussion

This paper aimed to shed light on the factors that determine call volumes at online mental health helplines. Based on real-life data from the helpline of 113 Suicide Prevention, we found that the following factors are dominant: trend, and weekly and daily cycles. The media events appear only to have a limited—or short-term—effect on the number of arrivals, contrary to the effect these kinds of events have on the number of suicides studied by Whitley et al. [10] and Niederkrotenthaler et al. [11]. To our knowledge, previous work primarily focuses on the different help-seeker types arriving at the helpline [16], but not on the actual arrival process. The insight that (S)ARIMA forecasts are most accurate shows that the arrivals at the helpline are mostly dependent on historical data and can be used at other helplines that handle (mental health) emergencies, which is comparable to what Gijo et al. found [14].

We also found that telephone forecasting can best be done with (S)ARIMA models for short-term forecasting (less than four weeks) and linear regression for long-term forecasting (more than four weeks). Chat forecasting can best be done by (S)ARIMA models for the whole test forecasting period of eight weeks or less ahead. Surprisingly, the (S)ARIMA models performed better than the LSTM models. However, it could be the case that the LSTM model can improve with more time and optimization. However, it is questionable if with more optimization and time the LSTM will perform better than the other models. The low MAPE of the (S)ARIMA models can be attributed to the workings of the models. These models are flexible. In case of an event with a large and long-lasting effect, the (S)ARIMA models can quickly adapt and include this increase or decrease in the forecasts. Overall, the rule holds that the forecasts lose accuracy when forecasting further in the future.

The results of the questionnaire show that the experienced workload of the counselors is mostly related to the number of chats during a shift. Surprisingly, the experienced workload seems to have a weaker relationship with the workload of the phone calls. Both are crucial insights into the causes of experienced workload, which was previously done for only volunteers [23]. However, it should be noted that the results of the questionnaire showed that, on average, senior counselors do not experience a high workload or seem able to work with a high workload. A higher variability in experienced workload during shifts is needed to determine the relationship between call volumes and waiting times more precisely.

Limitations

Most of the limitations encountered with this research can be attributed to the availability or quality of the data. Yearly cycles could not accurately be measured, since the data consisted of five years of chats and phone calls. However, recently 113 has seen enormous growth in the number of chat and phone calls, making it difficult to measure the yearly cycle if one is present accurately.

Media events were considered to influence the number of arrivals. The data shows that suicides of nationally well-known celebrities might have more impact than those of internationally well-known persons. Luckily, the number of Dutch well-known persons that died by suicide is limited. Therefore, it is unknown whether a similar event nowadays would lead to a similar, smaller, or larger effect.

The found correlations with the questionnaire were all significant except for the correlation between the number of phone calls and the sum score. However, more data might be needed to say more about the significance of the correlations, given the sample size of 41 on which the calculations were made.

Implications

One of the key implications considered is that the planning department of 113 Suicide Prevention can use the predictions provided by the forecasting models. The predictions offer the possibility to adjust the staffing and schedule. Staffing better fitting to the number of arrivals can relieve counselors and volunteers from stress and provides them with more time to cool down after a difficult conversation [23].

Possibilities for future research

There are several directions for future research. First, we may extend the model to include different caller types of the helpline and determine the arrival processes per type. This could introduce a larger forecasting error but could provide more information on what type of help-seeker to expect and when and how these callers could best be helped.

The effect of the level of experience of counselors on the duration of the conversation is an interesting issue. Initial results suggest no significant correlation, but this could be investigated in more detail, making distinctions between functions and different levels of experience.

Conclusion

The analysis of real-life data leads to new and important insights in forecasting the demand for online health support for mental health. The (S)ARIMA model forecasts have a MAPE of less than 10 in short-term forecasting, showing that the number of chats and telephones can be forecasted. The fact that (S)ARIMA models perform better than other models shows that the number of call and chat arrivals is more dependent on historical data, without explicitly giving data about media events. These forecasts can then be used in other processes, for example, to support the planning of counselors.

Furthermore, the results of the questionnaire show that the experienced workload of senior counselors is less dependent on the actual staffing and more on the number of chat arrivals. These results again show the importance of insight into the arrival process of the chat (and/or) telephone.

Availability of data and materials

The datasets used and/or analyzed during the current study are available from the corresponding author upon reasonable request.

Notes

The questions used in the sum scale do not include questions concerning only chats or phone calls, except for Question 8, which only concerns incoming chats. However, this question is included, since the question is about the process of forwarding chats after triage, chats after triage are handled by the same counselors that handle phone calls. The process of forwarding chats can therefore be obstructed by a large workload of phone calls. The results using the sum score excluding and including Question 8 and the resulting conclusions were similar. Therefore, we chose to use the collected data of this item.

Abbreviations

(S)ARIMA:: (Seasonal) Autoregressive Integrated Moving Average
LSTM:: Long Short-Term Memory model
MAPE:: Mean Absolute Percentage Error
AIC:: Akaike Information Criterion
ML:: Machine Learning

References

Over ons | 113 Zelfmoordpreventie. [cited 2022 Mar 30]. Available from: https://www.113.nl/over-113/over-ons.
Brülhart M, Klotzbücher V, Lalive R, Reich SK. Mental health concerns during the COVID-19 pandemic as revealed by helpline calls. Nature. 2021;600(7887):121–6.
Article PubMed PubMed Central Google Scholar
Gould MS, Kalafat J, HarrisMunfakh JL, Kleinman M. An evaluation of crisis hotline outcomes part 2: suicidal callers. Suicide and Life-Threatening Behavior. 2007;37(3):338–52.
Article PubMed Google Scholar
De Luisterlijn | 24/7 een luisterend oor | 088 0767 000. [cited 2022 Apr 13]. Available from: https://www.deluisterlijn.nl/?gclid=CjwKCAjw6dmSBhBkEiwA_W-EoG0RmjIZxS8kiRz2y2XdVIbbNiy1-z8O3b3eo-TLgqts8nCig20lLRoC6AsQAvD_BwE.
Kindertelefoon Homepage. [cited 2022 Apr 13]. Available from: https://www.kindertelefoon.nl/.
Mokkenstorm JK, Eikelenboom M, Huisman A, Wiebenga J, Gilissen R, Kerkhof AJFM, et al. Evaluation of the 113online suicide prevention crisis chat service: outcomes, helper behaviors and comparison to telephone hotlines. Suicide and Life-Threatening Behavior. 2017;47(3):282–96.
Article PubMed Google Scholar
Statistiek CB voor de. Zelfdoding in Nederland: een overzicht vanaf 1950. Centraal Bureau voor de Statistiek. [cited 2022 Mar 22]. Available from: https://www.cbs.nl/nl-nl/longread/statistische-trends/2021/zelfdoding-in-nederland-een-overzicht-vanaf-1950?onepage=true.
Suicide. [cited 2022 Apr 13]. Available from: https://www.who.int/news-room/fact-sheets/detail/suicide.
Landelijke agenda | 113 Zelfmoordpreventie. [cited 2023 Mar 6]. Available from: https://www.113.nl/over-113/landelijke-agenda.
van der Burgt MCA, Mérelle S, Beekman ATF, Gilissen R. The impact of COVID-19 on the suicide prevention helpline in the Netherlands. Crisis: The Journal of Crisis Intervention and Suicide Prevention. 2022.
Whitley R, Fink DS, Santaella-Tenorio J, Keyes KM. Suicide mortality in Canada after the death of Robin Williams, in the context of high-fidelity to suicide reporting guidelines in the Canadian media. Can J Psychiatry. 2019;64(11):805–12.
Article PubMed PubMed Central Google Scholar
Niederkrotenthaler T, Fu KW, Yip PSF, Fong DYT, Stack S, Cheng Q, et al. Changes in suicide rates following media reports on celebrity suicide: a meta-analysis. J Epidemiol Community Health. 2012;66(11):1037–42.
Article PubMed Google Scholar
Taylor JW. Density forecasting of intraday call center arrivals using models based on exponential smoothing. Manage Sci. 2012;58(3):534–49.
Article Google Scholar
Gijo EV, Balakrishna N. SARIMA models for forecasting call volume in emergency services. International Journal of Business Excellence. 2016 [cited 2022 Mar 25]; Available from: https://doi.org/10.1504/IJBEX.2016.079252.
Salmi S, Mérelle S, Gilissen R, van der Mei R, Bhulai S. Detecting changes in help seeker conversations on a suicide prevention helpline during the COVID− 19 pandemic: in-depth analysis using encoder representations from transformers. BMC Public Health. 2022;22(1):530.
Article CAS PubMed PubMed Central Google Scholar
Grigorash A, O’Neill S, Bond R, Ramsey C, Armour C, Mulvenna MD. Predicting caller type from a mental health and well-being helpline: analysis of call log data. JMIR Ment Health. 2018;5(2): e47.
Article PubMed PubMed Central Google Scholar
Cryer JD, Chan K sik. Time series analysis: with applications in R. 2nd ed. New York: Springer; 2008. 491 p. (Springer texts in statistics).
Hyndman RJ, Khandakar Y. Automatic time series forecasting: the forecast package for R. J Stat Softw. 2008;29(27):1–22.
Google Scholar
Doe de sneltest werkdruk - FNV. [cited 2022 May 31]. Available from: https://www.fnv.nl/werk-inkomen/veilig-gezond-werken/werkdruk/doe-de-sneltest-werkdruk.
Likert R. A technique for the measurement of attitudes. Archives of Psychology. 1932;22(140):55–55.
Google Scholar
Tavakol M, Dennick R. Making sense of Cronbach’s alpha. Int J Med Educ. 2011;27(2):53–5.
Article Google Scholar
Schober P, Boer C, Schwarte LA. Correlation coefficients: appropriate use and interpretation. Anesth Analg. 2018;126(5):1763–8.
Article PubMed Google Scholar
Willems RCWJ, Drossaert CHC, Vuijk P, Bohlmeijer ET. Mental wellbeing in crisis line volunteers: understanding emotional impact of the work, challenges and resources. a qualitative study. Int J Qual Stud Health Well-being. 2021;16(1):1986920.
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

The authors thank Robin Costers, Sophie de Vries Robles, Menno Zwart and Kim Setkowski from 113 Suicide Prevention for their great support during this project.

Funding

This research was funded by 113 Suicide Prevention which is mainly funded by the Ministry of Health, Welfare & Sports of the Netherlands. All authors report no financial relationships with commercial interests.

Author information

Authors and Affiliations

Centrum Wiskunde & Informatica, Amsterdam, the Netherlands
Tim Rens de Boer & Rob van der Mei
113 Suicide Prevention, Amsterdam, the Netherlands
Saskia Mérelle & Renske Gilissen
Vrije Universiteit Amsterdam, Amsterdam, the Netherlands
Sandjai Bhulai & Rob van der Mei

Authors

Tim Rens de Boer
View author publications
You can also search for this author in PubMed Google Scholar
Saskia Mérelle
View author publications
You can also search for this author in PubMed Google Scholar
Sandjai Bhulai
View author publications
You can also search for this author in PubMed Google Scholar
Renske Gilissen
View author publications
You can also search for this author in PubMed Google Scholar
Rob van der Mei
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

TB had access to the data of the study and analyzed the data. Concept and design of the study: All authors. TB drafted the manuscript. Revision of the manuscript: SM, SB, RM, RG. Study supervision: SM, RM, SB. The author(s) read and approved the final manuscript.

Corresponding author

Correspondence to Tim Rens de Boer.

Ethics declarations

Ethics approval and consent to participate

Under Dutch law, the Medical Research Involving Human Subjects Act (WMO), ethical approval is not required for this study. Under WMO ethical approval is required for medical scientific research, where persons are subjected to actions or rules of conduct are imposed on them. Our study of anonymous call volumes does not require ethical approval.

Consent for publication

Not applicable.

Competing interests

We declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

de Boer, T.R., Mérelle, S., Bhulai, S. et al. Forecasting call and chat volumes at online helplines for mental health. BMC Public Health 23, 984 (2023). https://doi.org/10.1186/s12889-023-15887-2

Download citation

Received: 04 July 2022
Accepted: 12 May 2023
Published: 27 May 2023
DOI: https://doi.org/10.1186/s12889-023-15887-2

Forecasting call and chat volumes at online helplines for mental health

Abstract

Background

Methods

Results

Conclusion

Background

Methods

Dataset

Data preprocessing

Forecasting models

Questionnaire

Results

Trend

Weekly and daily patterns

Media events

Forecast results

Experienced workload

Discussion

Limitations

Implications

Possibilities for future research

Conclusion

Availability of data and materials

Notes

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Public Health

Contact us