- Research article
- Open Access
The Work Stress Questionnaire (WSQ) – reliability and face validity among male workers
BMC Public Health volume 19, Article number: 1580 (2019)
The Work Stress Questionnaire (WSQ) was developed as a self-administered questionnaire with the purpose of early identification of individuals at risk of being sick-listed due to work-related stress. It has previously been tested for reliability and face validity among women with satisfying results. The aim of the study was to test reliability and face validity of the Work Stress Questionnaire (WSQ) among male workers.
For testing reliability, a test-retest study was performed where 41 male workers filled out the questionnaire on two occasions at 2 weeks intervals. For evaluating face validity, seven male workers filled out the questionnaire and gave their opinions on the questions, scale steps and how the items corresponded to their perception of stress at work.
The WSQ was, for all but one item, found to be stable over time. The item Supervisor considers one’s views showed a systematic disagreement, i.e. there was a change common to the group for this item. Face validity was confirmed by the male pilot group.
Reliability and face validity of the WSQ was found to be satisfying when used on a male population. This indicates that the questionnaire can be used also for a male target group.
Work-related disorders are a common problem in Sweden as well as in Europe [1, 2]. A survey on work-related disorders that was carried out among workers in Sweden in 2016, found that 26% of the female workforce and 19% of the male experienced disorders related to their working situation . The survey also found that up until 2014, physical conditions have been the predominant cause for work-related disorders among men, however, stress and mental strain has now reached the same levels. Estimating the cost of work-related stress to society is complex, depending on definition of work-related stress and costs associated to it. The cost per year to society has been found to range between 221.3 million USD to 187 billion . The cost for sick-listing in Sweden 2018, taking only into account the cost for rehabilitation benefits and sickness compensation, was approximately 4 billion USD, where stress-related and adjustment disorders represented 20% of all the sick-listing cases . Between the years 2012 and 2016, stress as a cause of work-related disorders in Sweden increased from 6 to 8% for men and from 10 to 15% for women . Along with staggering numbers for sick-leave, a large proportion of the workforce continue to go to work despite experiencing work-related problems . Although there are gender differences in sick-listing, where mental disorders are a more common cause for sick-leave in women , sick-listing due to work-related stress is rising among both women and men .
It has long been known that several work-related psychosocial factors such as conflicts at work, low influence at work, low co-worker support, poor organizational structure, low justice in interpersonal treatment and decision latitude is connected with sick leave [7,8,9,10,11,12,13,14] and common mental disorders [14, 15]. Interactive effects of poor organizational climate and high work commitment has been found to be associated with a higher rate of sick-leave among both women and men . The world of work is also changing. Boundaries between work and home are challenged when new technology such as smartphones leads to flexible working places and/or hours [17, 18].
Workers tend to experience ill-health due to work-related stress long before sick-listing [9, 19, 20], and often seek help for these complaints at primary health care centers . GPs, however, have reported not having sufficient knowledge on how to address issues related to the patients working situation . Early interventions to address stress-related disorders are of importance . The Work Stress Questionnaire (WSQ) was developed with the intention to identify individuals at risk of sick-leave due to work-related stress. It has been tested for reliability and face validity among women with satisfying results . In the present study, reliability and face validity was to be tested among male workers.
The WSQ was developed by Holmgren et al. , as a self-administered questionnaire with the purpose of early identification of individuals at risk of being sick-listed due to work-related stress. It consists of only 21 questions, which makes it suitable to use in a clinical setting where time often is sparse. Another advantage is that it is not targeting a specific diagnosis, as other screening tools [25, 26], but can be used to identify work-related stress regardless of the patient’s complaint. The WSQ emanates from the experiences of sick-listed workers and takes into consideration the interaction between personal and environmental factors . It has previously been used in a study to analyze the connection between presence of work-related stress and future work absenteeism in a primary health care setting  and a cohort study investigating the association between work-related stress and ill-health/sick-leave in women . In the first study , a presence of high stress due to poor organizational climate, especially when coexisting with high personal demands, significantly increased the risk of sick-leave a year later. The women in the cohort study experiencing a higher level of overall work-related stress also had higher rates of self-reported ill-health . Women reporting low influence at work and high stress-levels due to indistinct organization also had a higher probability of sick-leave . In a prospective, longitudinal study the WSQ was found to predict sickness absence for as far as up to 8 years [Knapstad M, Lissner L, Björkelund C, Holmgren K. Organizational climate and work commitment as predictor of 10-year registered sickness absence: The Population Study of Women in Gothenburg, in preparation].
Both female and male workers seem to be experiencing ill-health due to work-related stress, and this is often present long before sick-listing [9, 19, 20]. Stress and mental strain as cause of work-related disorders increases, not only among women but also among men . The WSQ was developed using a female reference group (24). Gender differences may influence the psychometric properties of a questionnaire , it is therefore important to test reliability and validity of the WSQ for men.
The aim of this study was to evaluate the reliability and face validity of the Work Stress Questionnaire (WSQ) when used on a male working population.
For testing reliability, a test-retest study was performed. The WSQ was filled out by the same respondent on two occasions at 2 weeks intervals. Face validity was evaluated by using a pilot-group that filled out the questionnaire and were encouraged to give comments, either written or oral, concerning the questionnaire. The target group was non-sick-listed employed men aged 18–64 years. The study took place in Gothenburg, Sweden 2017.
The work stress questionnaire (WSQ)
The WSQ consists of 21 items covering 4 main themes: Indistinct organization and conflicts, Individual demands and commitment, Influence at work and Work to leisure time interference (Additional file 1). The questions of the first two themes can be answered Yes, Partly or No. To determine the level of stressfulness in the items of the first two themes, the questions are followed by the question Do you perceive it as stressful? The respondent grades the level of stressfulness by answering Not stressful, Less stressful, Stressful or Very stressful. The items of the second two themes can be answered Yes, always, Yes, often, No, rarely or No, never. Demographic data concerning employment, age and educational level was also collected. In the follow-up questionnaire for the testing of reliability, a question concerning changes at the workplace during the 2-week period was added: Has anything deviating occurred at your workplace since the first time you filled out the questionnaire that may affect your answers today? If the respondents answered yes to this question, they were excluded from the study.
Procedure and data collection
Respondents were recruited from different areas of the labour-market using the researcher’s own social network. Contact persons, who were not involved in any part of the research, were used to identify and approach eligible recruits. The procedure has similarities to snowball sampling . Snowball sampling uses the researchers´ social network to reach a target group with specific characteristics, in this case non-sick-listed employed men. Written information was given to eligible recruits, containing a short background of the study and information about the procedure. Emphasis was laid on the voluntary nature of participation in the study, and the respondents were informed that they could choose to terminate participation at any point without explanation. It was clearly stated in the written information to the participants that consent to participate in the study was given by filling out the WSQ. The completed questionnaire was then put in a sealed envelope and passed on to the research group. This procedure was then repeated 2 weeks later. The questionnaire did not contain any personal information, only a code for matching with the second questionnaire. For this part of the study, a population of 57 employed men, aged 18–64 years, was included. Sixteen of the respondents were excluded, of which 2 respondents did not fill out the second questionnaire and 14 respondents answered yes to the appended question Has anything deviating occurred at your workplace since the first time you filled out the questionnaire that may affect your answers today? in the second questionnaire. A total of 41 respondents (n = 41) remained for analysis. The demographics of the group are presented in Table 1.
To evaluate face validity of the WSQ, a pilot-group comprising seven employed men were recruited in the same way as for the test-retest part of the study. The group consisted of men working in both public and private sector, at small and large workplaces and in different positions. The respondents were asked to fill out the WSQ and leave notes, either written or oral, concerning the items and scales. Afterwards, the respondents were encouraged to give comments on scale steps and formulation of the questions as well as if the questionnaire corresponded to their understanding of work-related stress.
To analyze the reliability of the questionnaire, a test-retest analysis was performed using a rank-invariant method for analysis of paired ordered categorical data described by Svensson . This method for assessing reliability of a questionnaire has been used previously [24, 31] and is recommended for analysis of ordered categorical data . The method is suitable for analysis of change and is valid regardless of the number of response categories. There is no need for combining or dichotomizing the category distributions. This made it possible to analyze each item of the questionnaire, assessing the occasional and systematic disagreement of each item. As some of the items are divided into two parts, where the first part contains the categories Yes, Partly, No and the second part Not stressful, Less stressful, Stressful, Very stressful, these two parts were analyzed separately. Percentage agreement was calculated for both parts, the second part was then analyzed further for Relative Rank Variance (RV), Relative Position (RP) and Relative Concentration (RC). For all other items PA, RV, RP and RC were calculated. RV ranges from 0 to 1 and indicates individual changes. The lower the RV-number is, the smaller the occasional disagreement. The items were also analyzed for systematic disagreement by plotting each item in a graph where the x-axis represents the cumulated proportions for the marginal distributions at first test and the y-axis represents the cumulated proportion at retest, see Fig. 1. Each axis ranges from 0 to 1. If there is no disagreement between test and retest, the graph will be plotted as a straight line from point (0, 0) to point . If there is a systematic disagreement, the graph will be either concave or convex. This will be expressed as Relative Position (RP). If there has been a systematic change in concentration, it will be expressed as the Relative Concentration (RC). If there is a change in RC it will result in an S-shaped graph.
Both the RP and RC measurements were calculated for each item. RP and RC ranges from − 1 to 1, where a number close to 0 indicates a low disagreement between test and retest. Both RP and RC refer to changes common to the group. The confidence interval (CI) for RV, RP and RC values for each item was calculated using the bootstrap method, based on the jack-knife standard error. If the CI did not include 0, the item was assessed as having significantly changed between test and retest occasion.
The PA of the items ranged from 55 to 98% with a median PA of 77%. To evaluate the stability of the questionnaire, RV, RP and RC were calculated for each item. The result is presented in Table 2. The second parts of two of the items, Knowledge of work assignments and Involved in conflicts at work, were not analyzed due to a low response rate. The respondent is only requested to answer the second part of these items if they answer the first part with “No” and “Yes” respectively, which explains why the response rate was low for these two items. The first parts of these items were still analyzed for PA. All but one item remained stable over time regarding occasional and systematic disagreement, which means that responses from the two measurements did not vary regarding position on the scale or concentration of responses on group level. RV was close to 0 for all items, implying that individual variation between test and retest was low. However, the confidence interval for RV for the item Do you take more responsibility at work than you ought to?/stress was large (RV = 0.14, CI 0.00–0.39). The item Supervisor considers one’s views showed a significant change in systematic disagreement (RP = 0.10, CI 0.02–0.18), shifting towards higher response categories which means grading this item as more stressful on retest occasion. Since the RV value was 0.0 for this item, it cannot be explained by individual variation.
Face validity was confirmed by the pilot-group. The participants all confirmed the relevance of the questions regarding work-related stress and the items were found to be generally easy to answer.
For assessing reliability of the questionnaire, a test-retest design was chosen. This design has been suggested as suitable for testing reliability in an already existing questionnaire . The time between test and retest occasion was set to 2 weeks. This interval was chosen so that the respondents probably would have forgotten their responses from the first questionnaire but not too long so that changes in work environment might have occurred . However, ensuring the two occasions to be exactly the same is not possible. To further decrease the possibility of this affecting the answers, the appended question Has anything deviating occurred at your workplace since the first time you filled out the questionnaire that may affect your answers today? was formulated. This made it possible to identify respondents who felt that there had been a change in their working conditions and exclude them from analysis.
For statistical analysis a rank-invariant non-parametric method was used, which is suitable for analyzing ordinal data. Pairing of data in this method makes it possible to identify both random individual changes and systematic changes common to the group . Compared to for example weighted Kappa, which is a method commonly used for analyzing changes in data, the rank invariant method is not dependent on the number of categories . Kappa statistics also treat data as nominal. A study comparing the rank-invariant method used in this study and Kappa statistics found the rank-invariant method to be more sensitive detecting changes in data .
All but one item showed stability over time. For the item Supervisor considers one’s views there was a change common to the group, grading this item as more stressful on retest occasion. The change was however small, with a RP-value of 0.10 (CI 0.02–0.18). How much influence one experiences at work might be something the respondents have not been reflecting on, and may have been made aware of at base-line. At retest they may therefore grade this item with higher response categories. This item was however not commented by the pilot group.
The questionnaire was developed using a female reference group. Gender has been identified as a factor affecting validity of a questionnaire , and therefore needs to be evaluated for the target group. A report from the Swedish agency for work environment found that when women and men are exposed to the same stressors at work they respond in a similar way . In a recently published review and meta-analysis, no gender differences were found regarding depressive symptoms when exposed to the same psychosocial work factors related to stress . This supports the findings of this study, that the questionnaire is useful also for a male population. The intent of the questionnaire is to be self-administered and therefore needs to be able to complete without reluctance or hesitation. Few signs of this, for example written comments in the questionnaire or ticking in between scale-pace boxes , were found in the test-retest part of the study implying that the attitudes among the participants towards the questionnaire was good.
In the WSQ, four scale-steps are available for determining the level of stressfulness of the items. How many scale-steps that should be used in questionnaires is an issue that is up for debate, where there are both advantages and disadvantages concerning using an even or odd number of scale-steps . Using a scale with four steps forces the respondent to commit him- or herself to either experiencing the item as stressful or not. Since the items remained stable over time, the even number of scale-steps does not seem to affect the reliability of the questionnaire.
The purpose of this study was not to screen for work-related stress but to test the stability of the questionnaire over time. Snowball sampling allowed for recruitment of respondents from the target group.
There are some limitations to the study. In one part of the questionnaire the items are divided into two, where the second part is only answered if the respondent answers positively in the first part of the item. For two of the items, this has resulted in a low response rate to the second part of these items. To increase probability of having enough answers to be able to do a statistical analysis of these items, a larger population sample would have been needed. A larger study population would also have increased the power of the statistical analysis for all items.
Another limitation to the study is that face validity was only confirmed by the pilot group. Face validity may be the weakest form of validation, but has been suggested as a first step in the process of validating a questionnaire . More thorough research to confirm the validity of the questionnaire when used on a male population is however needed.
An increasing level of men on sick leave due to stress-related disorders calls for a valid instrument for early identification of persons at risk of being put out of work due to these factors. Results from the present study indicate that the WSQ is a reliable and valid questionnaire when used on a male target group. Future research on the development of the questionnaire should focus on more extensive evaluation of validity. The predictive value of the questionnaire on sickness absence in a male population was not in the scope of this article and is also an issue that needs further research.
Availability of data and materials
The datasets used and analyzed during the current study are available from the corresponding author on reasonable request.
Relative rank variance
Work Stress Questionnaire
Arbetsmiljöverket (Swedish Work Environment Authority). Work-Related Disorders 2016. Stockholm: Arbetsmiljöverket; 2016. Arbetsmiljöstatistik; rapport 2016–3
Eurofound. Developments in working life in Europe 2015. EurWORK annual review. Luxembourg: Publications Office of the European Union; 2016.
Hassard J, Teoh KRH, Visockaite G, Dewe P, Cox T. The cost of work-related stress to society: a systematic review. J Occup Health Psychol. 2018;23(1):1–17.
Regeringens skrivelse 2018/19:101 Årsredovisning för staten 2018. 2019. (The Governments writ/document 2018/19:101; Annual report of the state 2018. 2019).
Janssens H, Clays E, De Clercq B, De Bacquer D, Casini A, Kittel F, et al. Association between psychosocial characteristics of work and presenteeism: a cross-sectional study. Int J Occup Med Environ Health. 2016;29(2):331–44.
Ferrie JE, Vahtera J, Kivimäki M, Westerlund H, Melchior M, Alexanderson K, et al. Diagnosis-specific sickness absence and all-cause mortality in the GAZEL study. J Epidemiol Community Health. 2009;63:50–5.
Vahtera J, Kivimaki M, Pentti J, Theorell T. Effect of change in the psychosocial work environment on sickness absence: a seven year follow up of initially healthy employees. J Epidemiol Community Health. 2000;54(7):484–93.
Falco A, Girardi D, Marcuzzo G, De Carlo A, Bartolucci GB. Work stress and negative affectivity: a multi-method study. Occup Med (Lond). 2013;63(5):341–7.
Holmgren K, Fjallstrom-Lundgren M, Hensing G. Early identification of work-related stress predicted sickness absence in employed women with musculoskeletal or mental disorders: a prospective, longitudinal study in a primary health care setting. Disabil Rehabil. 2013;35(5):418–26.
Vaananen A, Kalimo R, Toppinen-Tanner S, Mutanen P, Peiro JM, Kivimaki M, et al. Role clarity, fairness, and organizational climate as predictors of sickness absence: a prospective study in the private sector. Scand J Public Health. 2004;32(6):426–34.
Head J, Kivimaki M, Siegrist J, Ferrie JE, Vahtera J, Shipley MJ, et al. Effort-reward imbalance and relational injustice at work predict sickness absence: the Whitehall II study. J Psychosom Res. 2007;63(4):433–40.
Andrea H, Beurskens AJ, Metsemakers JF, van Amelsvoort LG, van den Brandt PA, van Schayck CP. Health problems and psychosocial work environment as predictors of long term sickness absence in employees who visited the occupational physician and/or general practitioner in relation to work: a prospective study. Occup Environ Med. 2003;60(4):295–300.
Voss M, Floderus B, Diderichsen F. Physical, psychosocial, and organisational factors relative to sickness absence: a study based on Sweden post. Occup Environ Med. 2001;58(3):178–84.
Kivimaki M, Elovainio M, Vahtera J, Ferrie JE. Organisational justice and health of employees: prospective cohort study. Occup Environ Med. 2003;60(1):27–34.
Stansfeld S, Candy B. Psychosocial work environment and mental health - a meta-analytic review. Scand J Work Environ Health. 2006;32(6):443–62.
Holmgren K, Hensing G, Dellve L. The association between poor organizational climate and high work commitments, and sickness absence in a general population of women and men. J Occup Environ Med. 2010;52(12):1179–85.
Eurofound and the International Labour Office. Working anytime, anywhere: the effects on the world of work. Luxembourg: Publication Office of the European Union; 2017.
Mellner C. After-hours availability expectations, work-related smartphone use during leisure, and psychological detachment. Int J Workplace Health Manag. 2016;9(2):146–64.
Stansfeld SA, Fuhrer R, Shipley MJ, Marmot MG. Work characteristics predict psychiatric disorder: prospective results from the Whitehall II study. Occup Environ Med. 1999;56(5):302–7.
Niedhammer I, Chastang JF, David S, Barouhiel L, Barrandon G. Psychosocial work environment and mental health: job-strain and effort-reward imbalance models in a context of major organizational changes. Int J Occup Environ Health. 2006;12(2):111–9.
Toft T, Fink P, Oernboel E, Christensen K, Frostholm L, Olesen F. Mental disorders in primary care: prevalence and co-morbidity among disorders. Results from the functional illness in primary care (FIP) study. Psychol Med. 2005;35(8):1175–84.
Nilsing E, Söderberg E, Berterö C, Öberg B. Primary Healthcare Professionals’ Experiences of the Sick Leave Process: A Focus Group Study. J Occup Rehabil. 2013;23(3):450-61.
Virtanen M, Vahtera J, Pentti J, Honkonen T, Elovainio M, Kivimäki M. Job strain and Psychologic distress: influence on sickness absence among Finnish employees. Am J Prev Med. 2007;33(3):182–7.
Holmgren K, Hensing G, Dahlin-Ivanoff S. Development of a questionnaire assessing work-related stress in women - identifying individuals who risk being put on sick leave. Disabil Rehabil. 2009;31(4):284–92.
Shaw W, van der Windt D, Main C, Loisel P, Linton S. Early patient screening and intervention to address individual-level occupational factors (“blue flags”) in Back disability. J Occup Rehabil. 2009;19(1):64–80.
Lexis M, Jansen N, Amelsvoort L, Huibers M, Berkouwer A, Tjin A, Ton G, et al. Prediction of long-term sickness absence among employees with depressive complaints. J Occup Rehabil. 2012;22(2):262–9.
Holmgren K, Dahlin-Ivanoff S, Björkelund C, Hensing G. The prevalence of work-related stress, and its association with self-perceived health and sick-leave, in a population of employed Swedish women. BMC Public Health. 2009;9(73). https://doi.org/10.1186/1471-2458-9-73.
Switzer GE, Wisniewski SR, Belle SH, Dew MA, Schultz R. Selecting, developing, and evaluating research instruments. Soc Psychiatry Psychiatr Epidemiol. 1999;34(8):399–409.
Morgan DL. Snowball sampling. In: Given LM, editor. The SAGE Encyclopedia of Qualitative Research Methods. Thousand Oaks: SAGE publication; 2008. p. 816–7.
Svensson E. Ordinal invariant measures for individual and group changes in ordered categorical data. Stat Med. 1998;17(24):2923–36.
Dahlin-Ivanoff S, Sonn U, Svensson E. Development of an ADL instrument targeting elderly persons with age-related macular degeneration. Disabil Rehabil. 2001;23(2):69–79.
Nunnally JC, Bernstein IH. Psychometric theory. 3rd ed. New York: McGraw-Hill; 1994.
Brenner H, Kliebsch U. Dependence of weighted kappa coefficients on the number of categories. Epidemiology. 1996;7(2):199–202.
Bunketorp L, Carlsson J, Kowalski J, Stener-Victorin E. Evaluating the reliability of multi-item scales: a non-parametric approach to the ordered categorical structure of data collected with the swedish version of the Tampa scale for kinesiophobia and the self-efficacy scale. J Rehabil Med. 2005;37:330–4.
Sverke M, Falkenberg H, Kecklund G, Magnusson Hanson L, Lindfors P. Kvinnors och mäns arbetsvillkor - betydelsen av organisatoriska faktorer och psykosocial arbetsmiljö för arbets- och hälsorelaterade utfall. Stockholm: Arbetsmiljöverket (Swedish Work Environment Authority); 2016.
Theorell T, Hammarström A, Aronsson G, Träskman Bendz L, Grape T, Hogstedt C, et al. A systematic review including meta-analysis of work environment and depressive symptoms. BMC Public Health. 2015;15. https://doi.org/10.1186/s12889-015-1954-4.
Streiner DL, Norman GR, Cairney J. Health measurement scales: a practical guide to their development and use. 5th ed. Incorporated: Oxford University Press; 2015.
The authors would like to thank Robin Fornazar for invaluable help during recruitment of respondents and processing of data.
This study was supported by grants from the local Research Unit (FoU Göteborg Södra Bohuslän). The funding body had no part in the study design, collection, interpretation and analysis of the data, or in writing the manuscript.
Ethics approval and consent to participate
The study was approved by the Regional Ethical Review Board, Gothenburg, Sweden, ref. nr 473–15.
Written information was given to eligible recruits, containing a short background of the study and information about the procedure. Emphasis was laid on the voluntary nature of participation in the study, and the respondents were informed that they could choose to terminate participation at any point without explanation. Consent to participate for test of reliability was given by filling out the questionnaire and for test of face validity by signing written informed consent.
Consent for publication
The authors declare that they have no competing interests
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Frantz, A., Holmgren, K. The Work Stress Questionnaire (WSQ) – reliability and face validity among male workers. BMC Public Health 19, 1580 (2019). https://doi.org/10.1186/s12889-019-7940-5
- Work related stress
- Face validity
- Test retest