Reliability and validity of the Turkish version of the WHO-5, in adults and older adults for its use in primary care settings

Erhan Eser; Celalettin Çevik; Hakan Baydur; Soner Güneş; Tayfun Alperen Esgin; Çağlar Söğüt Öztekin; Esen Eker; Ufuk Gümüşsoy; Gün Baris Eser; Beyhan Özyurt

doi:10.1017/S1463423619000343

Reliability and validity of the Turkish version of the WHO-5, in adults and older adults for its use in primary care settings

Published online by Cambridge University Press: 01 July 2019

Erhan Eser ,

Celalettin Çevik

Hakan Baydur ,

Soner Güneş ,

Tayfun Alperen Esgin ,

Çağlar Söğüt Öztekin ,

Esen Eker ,

Ufuk Gümüşsoy ,

Gün Baris Eser and

Beyhan Özyurt

Show author details

Erhan Eser: Affiliation:
Department of Public Health, Manisa Celal Bayar University School of Medicine, Turkey
Celalettin Çevik*: Affiliation:
Department of Nursing, Balıkesir University Faculty of Health Sciences, Turkey
Hakan Baydur: Affiliation:
Faculty of Health Sciences, Department of Social Work, Manisa Celal Bayar University, Turkey
Soner Güneş: Affiliation:
Department of Public Health, Balıkesir University School of Medicine, Turkey
Tayfun Alperen Esgin: Affiliation:
Department of Public Health, Manisa Celal Bayar University School of Medicine, Turkey
Çağlar Söğüt Öztekin: Affiliation:
Department of Public Health, Manisa Celal Bayar University School of Medicine, Turkey
Esen Eker: Affiliation:
Department of Public Health, Çanakkale 18 Mart University School of Medicine, Turkey
Ufuk Gümüşsoy: Affiliation:
Manisa Province Health Directorate, Turkey
Gün Baris Eser: Affiliation:
School of Social and Behavioral Sciences, Erasmus University, Netherlands
Beyhan Özyurt: Affiliation:
Department of Public Health, Manisa Celal Bayar University School of Medicine, Turkey
*: Author for correspondence: Celalettin Çevik, Department of Nursing, Balıkesir University Faculty of Health Sciences, Turkey. E-mail: [email protected]

Article contents

Abstract
Background:
Methods:
Results:
Conclusion:
Introduction
Methods
Data analysis
Results
Discussion
Strengths and limitations
Conclusion
Author ORCIDs
Financial Support
Conflicts of Interest
Authors’ Contribution
Ethical Standards
References

Rights & Permissions

Abstract

Background:

This study aims to determine the psychometric properties of the World Health Organization Well-Being Index (WHO-5) Turkish version in Turkish adults and older adults.

Methods:

This is a multicenter cultural adaptation study carried out with 1752 participants. Internal consistency (by Cronbach’s alpha); Construct validity (by known groups and confirmatory factor analysis-CFI) and discriminant validity are evaluated stratified by adults and older adults. Cohen’s Effect Size is used in known groups and discriminant validity analyses.

Results:

Distribution properties of the WHO-5 Turkish version are in acceptable limits. Alpha values are 0.81 for adults and 0.86 for older adults. The variances of the 58.5% of the adults sample and 63.9% of the older adults sample are explained in Exploratory FA. Model fits (CFI) are satisfactory ( > 0.95) in both samples; but RMSEA is poor in the older adults sample (0.166) whereas it is acceptable (0.073) in the adults sample. Known groups validity and discriminant analyses are satisfactory in both adults and older adults.

Conclusion:

The WHO-5 Turkish version has a good measurement capacity, internal consistency and good model fits in both samples. The error values in the older adults group suggest that the results when testing older adults should be interpreted with caution.

Keywords

Turkey validity and reliability WHO-5 Wellbeing index

Type: Research
Information: Primary Health Care Research & Development , Volume 20 , 2019 , e100

DOI: https://doi.org/10.1017/S1463423619000343 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright: © The Author(s) 2019

Introduction

According to the World Health Organization (WHO), the state of good health is described not only as the absence of any sickness or disability, but also as a complete sense of well-being in all psychological, bodily and social domains (Diener et al., Reference Diener, Scollon and Lucas2009). Well-being describes the state of subjective well-being which consists of positive and negative aspects (Guðmundsdóttir et al., Reference Guðmundsdóttir, Ólason, Guðmundsdóttir and Sigurðsson2014; Barden et al., Reference Barden, Conley and Young2015).

The World Health Organization Well-Being Index (WHO-5), introduced by WHO in 1998, is one of the most widely used scales that broadly measure subjective well-being with a limited number of items (Warr et al., Reference Warr, Banks and Ullah1985; Bech et al., Reference Bech, Gudex and Johansen1996; Topp et al., Reference Topp, Østergaard, Søndergaard and Bech2015). The WHO-5 is a generic scale which is used to evaluate the general mental well-being of persons (Hall et al., Reference Hall, Krahn, Horner-Johnson and Lamb2011; Bech, Reference Bech2012) in clinical settings. The WHO-5 was reported to be one of the very frequently used scales that measure mental wellness and quality of life in Primary Care settings in different population groups such as adolescents and students (Yusoff et al., Reference Yusoff, Yaacob and Naing2013; Christensen et al., Reference Christensen, Haugen, Sirpal and Haavet2015; Downs et al., Reference Downs, Boucher, Campbell and Polyakov2017); pregnant women (Mortazavi et al., Reference Mortazavi, Mousavi, Chaman and Khosravi2015); individuals using primary care services (Henkel et al., Reference Henkel, Mergl, Kohnon, Allgaier, Möller and Hegerl2004; Saipanish et al., Reference Saipanish, Lotrakul and Sumrithe2009; Guðmundsdóttir et al., Reference Guðmundsdóttir, Ólason, Guðmundsdóttir and Sigurðsson2014; Christensen et al., Reference Christensen, Haugen, Sirpal and Haavet2015); and in population-based studies (Khosravi et al., Reference Khosravi, Mousavi, Chaman, Sepidar-Kish, Ashrafi, Khalili and Holakouie Naieni2015). Two main studies clearly mentioned the use and the superiority of the WHO-5 for screening mental well-being in PHC settings: Henkel et al. (Reference Henkel, Mergl, Kohnen, Maier, Möller and Hegerl2003) observed that, being the briefest screening questionnaire (and therefore the most practical to use), the WHO-5 produced very high sensitivity (93%) and negative predictive values (98%) compared to the other questionnaires with standard cut-off points in their paper (Henkel et al., Reference Henkel, Mergl, Kohnen, Maier, Möller and Hegerl2003), whereas Löwe et al. (Reference Löwe, Spitzer, Gräfe, Kroenke, Quenter, Zipfel, Buchholz, Witte and Herzog2004) concluded that, all three questionnaires (The Hospital Anxiety and Depression Scale, the Patient Health Questionnaire and the WHO-5) performed well in screening of the depressive mood (Löwe et al., Reference Löwe, Spitzer, Gräfe, Kroenke, Quenter, Zipfel, Buchholz, Witte and Herzog2004). Following the presentation of the WHO-5 scale, the original version in English was translated into many other languages by the WHO Regional Office for Europe (Staehr, Reference Staehr1998; Topp et al., Reference Topp, Østergaard, Søndergaard and Bech2015) and by others (Awata et al., Reference Awata, Bech, Koizumi, Seki, Kuriyama, Hozawa, Ohmori, Nakaya, Matsuoka and Tsuji2007a; Newnham et al., Reference Newnham, Hooke and Page2010; Krieger et al., Reference Krieger, Zimmermann, Huffziger, Ubl, Diener, Kuehner and Holtforth2014; Kong et al., Reference Kong, Lee, Ip, Chow, Leung and Lam2016; Halliday et al., Reference Halliday, Hendrieckx, Busija, Browne, Nefs, Pouwer and Speight2017; Bonnín et al., Reference Bonnín, Yatham, Michalak, Martínez-Arán, Dhanoa, Torres, Santos-Pascual, Valls, Carvalho, Sánchez-Moreno, Valenti, Grande, Hidalgo-Mazzei, Vieta and Reinares2018). The clinical validity for WHO-5 was found to be more than sufficient (Awata et al., Reference Awata, Bech, Yoshida, Hirai, Suzuki, Yamashita, Ohara, Hinokio, Matsuoka and Oka2007b; Saipanish et al., Reference Saipanish, Lotrakul and Sumrithe2009; Newnham et al., Reference Newnham, Hooke and Page2010; Hall et al., Reference Hall, Krahn, Horner-Johnson and Lamb2011; Guðmundsdóttir et al., Reference Guðmundsdóttir, Ólason, Guðmundsdóttir and Sigurðsson2014; Bech et al., Reference Bech, Lindberg and Moeller2018). Furthermore, it was found that the external (clinical) validity of the WHO-5 was not affected by existing comorbid psychiatric disorders, and that it was a substantial indicator of symptoms of depression (Mergl et al., Reference Mergl, Seidscheck, Allgaier, Möller, Hegerl and Henkel2007).

The WHO-5 index has been validated for different populations (Bech et al., Reference Bech, Gudex and Johansen1996; Heun et al., Reference Heun, Bonsignore, Barkow and Jessen2001; Awata et al., Reference Awata, Bech, Koizumi, Seki, Kuriyama, Hozawa, Ohmori, Nakaya, Matsuoka and Tsuji2007a; Saipanish et al., Reference Saipanish, Lotrakul and Sumrithe2009; Lehmann et al., Reference Lehmann, Makine, Karşıdağ, Kadıoğlu, Karşıdağ and Pouwer2011; Bech, Reference Bech2012; Lucas‐Carrasco, Reference Lucas‐Carrasco2012; Yusoff et al., Reference Yusoff, Yaacob and Naing2013; Guðmundsdóttir et al., Reference Guðmundsdóttir, Ólason, Guðmundsdóttir and Sigurðsson2014; Moon et al., Reference Moon, Kim and Kim2014; Christensen et al., Reference Christensen, Haugen, Sirpal and Haavet2015; Khosravi et al., Reference Khosravi, Mousavi, Chaman, Sepidar-Kish, Ashrafi, Khalili and Holakouie Naieni2015; Uludag et al., Reference Uludag, Sahin, Agaoglu, Gungor, Ertekin and Tekin2016) but the psychometric properties of the previously translated (by the leading author of this paper) Turkish version were not investigated, leaving its validity and the reliability unknown.

The mean annual Family Physician Services use has increased from 1.1 to–2.8 per person since the early 2000s in Turkey (Akman, Reference Akman2014; Republic of Turkey Ministry of Health General Directorate of Health Information Systems, 2017). This improvement of the primary care service use in Turkey also increases the necessity of rapid evaluation methods in primary care, focusing not only on physical-related disorders but also on mental well-being in both adults and older adults. Sound and psychometrically valid assessment tools like the WHO-5 would therefore satisfy the unmet need of evaluating mental well-being in primary care in Turkey.

This study aims to determine and explore the psychometric properties of the Turkish version of WHO-5 in both adults and older adults separately.

Methods

Subjects and data collection

Turkish-speaking subjects over 18 years of age, having intellectual competency for answering all items were recruited for the study (n = 1752). Being intellectually competent is defined as providing correct answers to the two simple questions about the age of the respondent, and the date of the interview. This is a multicentre study that covers the secondary analysis of the data from seven unpublished cross-sectional representative studies conducted with different hypotheses in two provinces (Manisa and Balıkesir) of Turkey. The sample sizes were calculated by using the patient data sets of the seven Family Health Centers in both provinces and the data were collected by interviewer-assisted questionnaires at the houses of the respondents in all of the seven cross-sectional studies in 2017. Three of the seven data sets belonged to adults and three belonged to older adults samples and one sample is a mixed sample of adults and older adults. Primary endpoints in three adult samples are low back pain, obesity and occupational health; and falls and PHC services accessibility for older adults samples. Adults and older adults mixed sample explores the hypertension prevalence. The demographic, morbidity and WHO-5 data from these seven studies were used in this study.

These studies are summarized in Table 1.

Table 1. The list of data sources used in this study

Socio-demographic and Morbidity variables

Socio-demographic variables used in the studies were age, gender, level of education, marital status, social security status, and perceived financial status of the household and social class.

The other variables used in the statistical analysis were the ‘perceived change in health status compared to the previous year’ and ‘the presence of any chronic disease’.

WHO-5

The WHO-5 is a short and effective screening measure for detecting mental well-being. The WHO-5 index was translated into Turkish in 1999 by one of the authors of this study, printed in the Turkish official version of the scale, which is included in the website: ‘https://www.psykiatri-regionh.dk/who-5/who-5-questionnaires/Pages/default.aspx’ which refers to various language versions of the WHO-5 including Turkish version.

The translation/Turkish adaptation process followed the forward translations (two independent forward translations and generating consensus forwards version), back translation and the cognitive debriefing interviews on 10 lay subjects. The items of the WHO-5 are: I have felt cheerful and in good spirits (item 1); I have felt calm and relaxed (item 2); I have felt active and vigorous (item 3); I woke up feeling fresh and rested (item 4) and My daily life has been filled with things that interest me (item 5). The participants were asked to specify to what extent each of these five statements was true for them in the last 14 days. The items are scored between 0 (at no time) and 5 (all of the time). Therefore, the overall raw score varies between 0 (the absence of well-being) and 25 (the highest level of well-being). A 100-point scale score may also be calculated by multiplying the crude score by four. According to the scale instructions, Major Depression Inventory (ICD-10) should be applied to the patient if the scale score is less than 13. With this scale, individual change over time can also be monitored. A score change of 10% or more (increase or decrease) indicates a clinically meaningful change (Ware and Davies, Reference Ware and Davies1995).

Data analysis

Psychometric analysis

Reliability and validity analyses of the Turkish version of the WHO-5 were conducted in this study. ‘Confirmatory approach’ was used in both reliability and validity analyses. This approach meant that the Turkish version would be tested against the one-dimensional original (index) scale structure. Analyses were run stratified by adults (18–64 years of age, n = 940) and older adults (65 and over, n = 812).

Descriptive analysis

The descriptive results of the individual items and overall scores are presented by measures of mean and SD; and skewness, kurtosis and ceiling and floor effects. Maximum acceptable ceiling and floor effect was considered as 20% (Andresen, Reference Andresen2000).

Reliability analysis

Reliability analyses were presented by ‘item analyses’ and ‘internal consistency’. In the item analyses correlation coefficients values (corrected for overlap) were obtained between each Item’s score and the total score, demonstrating the contribution of each of the items to the overall scale score. Internal consistency was tested with Cronbach’s alpha value. Cronbach’s alpha were deleted calculated for both the index scale and separately when each of the five items was deleted. If item deleted alpha values are expected to be smaller than the overall alpha value, this means that all five items contribute to the variance of the scale. Any ‘if item deleted’ alpha values greater than overall alpha value may refer to a problematic item. Alpha value 0.7 and over indicates a Satisfactory internal consistency (Nunnally and Bernstein, Reference Nunnally and Bernstein1994).

Validity analysis

Construct validity of the Turkish version of the WHO-5 was evaluated in the validity analyses. The construct validity was tested with known groups validity, discriminant validity, exploratory factor analysis (via Principal components analysis with Varimax rotation) and Confirmatory factor analysis (CFA). Comparative Fit Index (CFI) and Root Mean Square Error of Approximation (RMSEA) values were calculated in CFA. Acceptable fit values are 0.90 for CFI and 0.08 for RMSEA (Hooper et al., Reference Hooper, Coughlan and Mullen2008; Kline, Reference Kline2016). The known groups and discriminant validity of the instrument was tested with the mean difference between subgroups (Student’s t test), while the magnitude of the differences was presented with Cohen’s Effect Size (ES) statistic (Cohen, Reference Cohen1988b). ES value closer to 0.20 indicates a small effect, whereas 0.50 a medium and 0.8 and over a big effect (Cohen, Reference Cohen1988a). One-way ANOVA analysis was used in comparing three or more groups where parametric conditions are satisfied. Post-hoc comparisons were conducted using Tukey’s B test. The upper limit for type 1 error was taken to be 0.05 in the statistical analyses. The analyses were done by using ‘SPSS version 21.0 for Windows’ (SPSS Inc., Chicago, Il, USA) and Lisrel 8.54 (Joreskog & Sorbom, 2003).

Ethical considerations

This study was approved by the Manisa Celal Bayar University Ethics Committee.

Results

Seven different studies’ data were used in the analyses. 53.7% of the study group was between 18 and 64 years of age and the rest were aged 65 and over. This study presents the results of the psychometric analyses stratified for both adults (18–64 years of age) and older adults (over 65). The mean age of the respondents was 40.35 ± 12.43 (range: 18–64) for the adult group and 72.87 ± 6.43 (range: 65–97) for the older adults group. 14.7% of the adult group and 41.7% of the older adults group was male; illiteracy rates were 8.2% for adults and 25.0% for older adults, whereas the percentage of insufficient income was 31.9% and 17.2% for adults and older adults, respectively (Table 3). Overall raw WHO-5 Score was 13.78 ± 4.93 for the adults sample and 14.86 ± 5.17 for the older adults sample. Converted mean 0–100 scale scores were 55.14 ± 19.72 for the adult groups and 59.42 ± 20.70 for the older adults group. Major Depression Inventory (ICD-10) should be applied to 36.7% of the respondents in the adults sample and 30.8% in the older adults sample, since the raw WHO-5 scores were less than 13.

Table 3. Known groups and discriminant validity results

(Cohen, Reference Cohen1988b).

Cronbach’s alpha values were 0.83 for the overall sample; 0.81 for the adults sample and 0.86 for the older adults sample. In both samples, when item five was deleted, Cronbach’s alpha was higher than the general alpha value, but item total correlations (corrected overlap) were greater than 0.35 for all items in both adults and older adults samples, indicating that item five should not be considered a problematic item.

Exploratory Factor Analysis showed Keiser Meier Olkin (KMO) values to be 0.82 in both groups and cumulative exploratory variance percentage to be 58.5% in adults and 63.9% in older adults. This means that the sample size adequacy has been ensured for both groups, that the scale presents a one-dimensional structure and that the explained variance is above 50% (Table 2). Item loadings for each of the individual items were over 0.7 except for item five, which has lower item loadings in both adults (0.50) and older adults groups (0.69). CFA results indicated good fit indices for both the adults and the older adults samples: CFI values were over 0.95 in both samples. However, the RMSEA value was in acceptable limits (under 0.08) in the adults sample, whereas RMSEA was over acceptable limits (0.16) in the older adults sample. Error residuals of the items were in moderate levels and the commonalities with the overall scale in an adequate magnitude (Table 2).

Table 2. Results of WHO-5 item analyses, internal consistency, exploratory and confirmatory factor analyses in adults and older adults

(a) Item total correlations corrected for overlap; (b) If item deleted Alpha values; (c) Factor loadings generated by Principal Components Analyses; KMO: Kaiser-Meyer-Olkin Measure of Sampling Adequacy, RMSEA: Root Mean Square Error of Approximation, CFI: Comparative Fit Index, NFI: Normed Fit Index, GFI: Goodness of Fit Index, Stand.RMR: Standardized Root Mean Square Residual.

Table 3 presents known groups and discriminant validity findings for both age groups. In the adults group, those with insufficient income, with chronic diseases, sleep problems, having a Body Mass Index (BMI) of more than 30 and those with poor psychological status had a significantly lower scale score (P < 0.05). In the older adults group, the WHO-5 scores were worse in women compared to men; illiterate persons compared to educated ones; those having inadequate income compared to adequate income; those with chronic diseases compared to healthy persons; smokers compared to non-smokers; those with sleep problems compared to the people who do not have any sleep problems; those with high BMI values compared to normal weighted persons and those with poor psychological status (P < 0.05). Cohen’s effect size figures indicate that, in the older adults group the WHO-5 score was more sensitive to the known variables than in the adult groups (Table 3).

Discussion

In this study, the psychometric properties of the Turkish version of the WHO-5 were found to be quite satisfactory. The study covered a wide range of psychometric evaluations except for sensitivity to change due to its cross-sectional design. There were two points that distinguish that distinguished our study from many other studies: Firstly, the sample pool consisted of individuals using primary health care services representing the community, and secondly, evaluations were done separately in adults and elderly individuals. The main reason for using stratified analyses was to take into account that self-assessment of mental well-being might differ between between adults and the elderly.

Illiteracy rates were 8.2% for adults and 25.0% for older adults in this study. Illiterate persons (especially those older adults living in the suburbs) are part of the community and should be included in the study for the sake of generalizability of the results to the Turkish population. So, the questionnaires were applied either via interviewer administration or in an interviewer-assisted way.

When we compared the WHO-5 score obtained from various studies and the WHO-5 scores obtained from our study, the average values obtained in various countries of Europe (De Wit et al., Reference De Wit, Pouwer, Gemke, Delemarre-van De Waal and Snoek2007; Klis et al., Reference Klis, Vingerhoets, De Wit, Zandbelt and Snoek2008; Gorter et al., Reference Gorter, Wens, Khunti, Claramunt, Topsever, Drivsholm, Jenum, Berkhout, Khalangot, Goldfracht, Rurik, Lionis and Rutten2010; Lehmann et al., Reference Lehmann, Makine, Karşıdağ, Kadıoğlu, Karşıdağ and Pouwer2011; Nicolucci et al., Reference Nicolucci, Kovacs Burns, Holt, Comaschi, Hermanns, Ishii, Kokoszka, Pouwer, Skovlund, Stuckey, Tarkun, Vallis, Wens and Peyrot2013; Bahrmann et al., Reference Bahrmann, Abel, Zeyfang, Petrak, Kubiak, Hummel, Oster and Bahrmann2014; Guðmundsdóttir et al., Reference Guðmundsdóttir, Ólason, Guðmundsdóttir and Sigurðsson2014) were between 62.5 and 66.4, which are higher than our results (WHO-5 mean values in the adults and the elderly group were 55.13 and 59.42, respectively, in our study). Similarly, in a German study (Bahrmann et al., Reference Bahrmann, Abel, Zeyfang, Petrak, Kubiak, Hummel, Oster and Bahrmann2014), the raw WHO-5 score was found to be 17.7, whereas the WHO-5 raw scores were 13.7 for adults and 14.9 in older adults in our study. These score differences between European countries may be explained mainly with the different socioeconomic conditions, which are worse in our study sample compared to the European samples. On the other hand, higher WHO-5 scores were obtained in other studies conducted on higher educated Turkish population compared to our study (Lehmann et al., Reference Lehmann, Makine, Karşıdağ, Kadıoğlu, Karşıdağ and Pouwer2011; Makine et al., Reference Makine, Nouwen, Karşıdağ, Kadıoğlu, Karşıdağ and Pouwer2011). Besides, our results are closer to the results of an Iranian study (Khosravi et al., Reference Khosravi, Mousavi, Chaman, Sepidar-Kish, Ashrafi, Khalili and Holakouie Naieni2015). These results also support our hypothesis of socio-demographic factors effecting mental well-being. By contrast, in a multicenter study (Nicolucci et al., Reference Nicolucci, Kovacs Burns, Holt, Comaschi, Hermanns, Ishii, Kokoszka, Pouwer, Skovlund, Stuckey, Tarkun, Vallis, Wens and Peyrot2013) which consists of 17 countries including Turkey, the poor mental well-being prevalence was found to be 19% if the cut-off point was set as 28 points in a 100-point scale, whereas this prevalence was 12.0% in adults and 10.8% in older adults based on the same cut-off point used in our study. This may be attributed to age, gender and socioeconomic differences between study samples.

Study sample and distribution properties

KMO values obtained in the EFA for both the adult and older adult samples were greater than 0.50, confirming sample size adequacy. Ceiling and floor effects of the scale in both the adults and older adults groups were within acceptable limits (0.4%–2.2%), pointing to a remarkable measuring capacity of the WHO-5 in both samples. Skewness and Kurtosis values were more than sufficient, which indicates a normal distribution, eradicating any distribution- related uncertainty in the reliability and validity analyses (West et al., Reference West, Finch, Curran and Hoyle1995). In both samples, when item five was deleted, Cronbach’s alpha increased over the general alpha value, but item total correlations (corrected overlap) were greater than 0.35 in both samples. This demonstrates that item five could not be considered a problematic item.

Reliability and validity analysis

The alpha coefficients of both adults (0.81) and older adults (0.86) are over 0.70, pointing out a satisfactory internal consistency. These values are consistent with all literature findings (Awata et al., Reference Awata, Bech, Koizumi, Seki, Kuriyama, Hozawa, Ohmori, Nakaya, Matsuoka and Tsuji2007a; De Wit et al., Reference De Wit, Pouwer, Gemke, Delemarre-van De Waal and Snoek2007; Saipanish et al., Reference Saipanish, Lotrakul and Sumrithe2009; Makine et al., Reference Makine, Nouwen, Karşıdağ, Kadıoğlu, Karşıdağ and Pouwer2011; Ramona, Reference Ramona2012; Khosravi et al., Reference Khosravi, Mousavi, Chaman, Sepidar-Kish, Ashrafi, Khalili and Holakouie Naieni2015; Bonnín et al., Reference Bonnín, Yatham, Michalak, Martínez-Arán, Dhanoa, Torres, Santos-Pascual, Valls, Carvalho, Sánchez-Moreno, Valenti, Grande, Hidalgo-Mazzei, Vieta and Reinares2018). If item deleted alpha values did not indicate any problematic items in older adults. In adults’ version, the only potential problematic item is item five, since the alpha value when item five is deleted is greater than overall alpha value (8.1 versus 8.6). Nonetheless, the item total correlation for item five (corrected overlap) was greater than 0.35, which indicates that item five is not a problematic item, and this eliminates any previous doubts. The three methods used to demonstrate the construct validity analyses were: ‘factor analyses’, the ‘known groups analysis’, and the ‘discriminant validity analysis’. Exploratory factor analyses of the Turkish version in both age groups have revealed a one-dimensional structure, as it was suggested in other language versions of the WHO-5 (Bech et al., Reference Bech, Gudex and Johansen1996; Heun et al., Reference Heun, Bonsignore, Barkow and Jessen2001). Factor loadings obtained from exploratory factor analyses and the model’s variance explanation potential were found to be consistent with other studies in the literature (De Wit et al., Reference De Wit, Pouwer, Gemke, Delemarre-van De Waal and Snoek2007; Saipanish et al., Reference Saipanish, Lotrakul and Sumrithe2009; Ramona, Reference Ramona2012; Khosravi et al., Reference Khosravi, Mousavi, Chaman, Sepidar-Kish, Ashrafi, Khalili and Holakouie Naieni2015). Due to some limitations of the exploratory factor analyses, CFA was also suggested in the literature (Guðmundsdóttir et al., Reference Guðmundsdóttir, Ólason, Guðmundsdóttir and Sigurðsson2014). The CFI generated by CFA was satisfactory ( > 0.95) in both adults’ and older adults’ samples, but the indices showing error residuals were found weaker in the older adults’ version compared to the adults’ version. RMSEA value of the adults’ version was 0.073, which is lower than the acceptable limit ( < 0.08) whereas the RMSEA and 0.166 in older adults. In other language validation studies, the fit indices were all in acceptable limits as ours, but the RMSEA was satisfactory only in the German diabetics study with 0.062 (De Wit et al., Reference De Wit, Pouwer, Gemke, Delemarre-van De Waal and Snoek2007). The RMSEA was, 0.104 in the Iranian version (Khosravi et al., Reference Khosravi, Mousavi, Chaman, Sepidar-Kish, Ashrafi, Khalili and Holakouie Naieni2015); 0.09 in men and 0.15 in women in the Icelandic version (Guðmundsdóttir et al., Reference Guðmundsdóttir, Ólason, Guðmundsdóttir and Sigurðsson2014); and 0.23 in the Australian English version (Halliday et al., Reference Halliday, Hendrieckx, Busija, Browne, Nefs, Pouwer and Speight2017), which are all over acceptable limits. Looking at these results, we can confidently say that the Turkish version has an acceptable construct validity in adults. Some of the relations of known groups analyses agreed on in the literature have also been tested here as another way of showing the construct validity: The WHO-5 was found to discriminate between categories of age, gender, level of education, income and marital status. According to the effect sizes, the strongest discriminative variables were education and income.

Discriminant validity

As the original structure of the scale showed an index (one-dimensional) feature, this study focused on clinical sensitivity findings in addition to item analyses. A similar approach was followed in the WHO-5 studies in other countries. In most of the studies in the literature, either ROC analyses were conducted through a parallel scale by assuming a cut-off point, or sensitivity and specificity values were calculated using suggested cut-off points for WHO-5, which are 28 and 50 points for 100-point scale; and 12–13 points for raw 25-point scale. Mainly two cut-off points (50 for 100-point scale and 12–13 for 25-point scale) are suggested in the literature (Henkel et al., Reference Henkel, Mergl, Kohnon, Allgaier, Möller and Hegerl2004; Hajos et al., Reference Hajos, Pouwer, Skovlund, Den Oudsten, Geelhoed‐Duijvestijn, Tack and Snoek2013; Firdaus, Reference Firdaus2017; Halliday et al., Reference Halliday, Hendrieckx, Busija, Browne, Nefs, Pouwer and Speight2017). Different cut-off values were also suggested such as 13 in an Australian study (Halliday et al., Reference Halliday, Hendrieckx, Busija, Browne, Nefs, Pouwer and Speight2017) and 10 in a Europe and South Asia comparison study (Aujla et al., Reference Aujla, Skinner, Khunti and Davies2010) (Aujla et al., Reference Aujla, Skinner, Khunti and Davies2010) for raw scores (on 0–25-point scale); and 28 in 100-point scale (Lehmann et al., Reference Lehmann, Makine, Karşıdağ, Kadıoğlu, Karşıdağ and Pouwer2011).

Strengths and limitations

The representative big cross-sectional sample obtained from diverse socioeconomic groups of the community is one of the main strengths of this study. The other one is the stratified analyses done for the adults and older adults that distinguishes our study from the many others.

The main restriction of our paper is the lack of ROC analyses. We did not apply ROC and sensitivity analyses since we could not use a reference test in this study. Instead, we applied discriminant analyses by dichotomizing the WHO-5 raw score by 13 (as suggested in the official instructions of the WHO-5). Existence of any non-communicable disease, exercise frequency, sleep problems, obesity, violence and abuse, the satisfaction with interfamily relations and perceived psychological and general health status were the variables used in these discriminant analyses. The Turkish version of the WHO-5 could discriminate all subcategories of these variables.

Conclusion

To conclude, the distribution properties, measurement capacity and the internal consistency of the Turkish WHO-5 were sufficient and satisfactory in both the adults and older adults samples. Although the fit indices are acceptable in both samples in CFA, error residuals were out of acceptable limits in the older adults sample. So, we suggest that, the Turkish version of the WHO-5 can confidently be used in adults (18–64 years of age) whereas the results should be interpreted with caution for older adults.

Author ORCIDs

Celalettin Çevik 0000-0002-1123-6196

Financial Support

This research received no specific grant from any funding agency, commercial, or not-for-profit sectors.

Conflicts of Interest

None.

Authors’ Contribution

All authors contributed during every step of the research until the writing of the manuscript. All authors read and approved the final manuscript.

Ethical Standards

The authors assert that all procedures contributing to this work comply with the ethical standards of relevant national and institutional guidelines and with the Helsinki Declaration of 1975, as revised 2008.

References

Akman, M (2014) Strength of primary care in Turkey. Turkish Journal of Family Practice 18, 70–8.Google Scholar

Andresen, EM (2000) Criteria for assessing the tools of disability outcomes research. Archives of Physical Medicine and Rehabilitation 81, S15–20.CrossRef Google Scholar PubMed

Aujla, N, Skinner, T, Khunti, K and Davies, M (2010) The prevalence of depressive symptoms in a white European and South Asian population with impaired glucose regulation and screen‐detected Type 2 diabetes mellitus: a comparison of two screening tools. Diabetic Medicine 27, 896–905.CrossRef Google Scholar

Awata, S, Bech, P, Koizumi, Y, Seki, T, Kuriyama, S, Hozawa, A, Ohmori, K, Nakaya, N, Matsuoka, H and Tsuji, I (2007a) Validity and utility of the Japanese version of the WHO-Five Well-Being Index in the context of detecting suicidal ideation in elderly community residents. International Psychogeriatrics 19, 77–88.CrossRef Google Scholar PubMed

Awata, S, Bech, P, Yoshida, S, Hirai, M, Suzuki, S, Yamashita, M, Ohara, A, Hinokio, Y, Matsuoka, H and Oka, Y (2007b) Reliability and validity of the Japanese version of the world health organization‐five well‐being index in the context of detecting depression in diabetic patients. Psychiatry and Clinical Neurosciences 61, 112–9.CrossRef Google Scholar PubMed

Bahrmann, A, Abel, A, Zeyfang, A, Petrak, F, Kubiak, T, Hummel, J, Oster, P and Bahrmann, P (2014) Psychological insulin resistance in geriatric patients with diabetes mellitus. Patient Education and Counseling 94, 417–22.CrossRef Google Scholar PubMed

Barden, S, Conley, A and Young, M (2015) Integrating health and wellness in mental health counseling: clinical, educational, and policy implications. Journal of Mental Health 37, 152–63.Google Scholar

Bech, P (2012) Clinical psychometrics, 202. Retrieved from https://onlinelibrary.wiley.com/doi/book/10.1002/9781118511800 Google Scholar

Bech, P, Gudex, C and Johansen, KS (1996) The WHO (Ten) well-being index: validation in diabetes. Psychotherapy and Psychosomatics 65, 183–90.CrossRef Google Scholar PubMed

Bech, P, Lindberg, L and Moeller, SB (2018) The Reliable Change Index (RCI) of the WHO-5 in primary prevention of mental disorders. A measurement-based pilot study in positive psychiatry. Nordic Journal of Psychiatry, 1–5.Google Scholar

Bonnín, C, Yatham, L, Michalak, E, Martínez-Arán, A, Dhanoa, T, Torres, I, Santos-Pascual, C, Valls, E, Carvalho, AF, Sánchez-Moreno, J, Valenti, M, Grande, I, Hidalgo-Mazzei, D, Vieta, E and Reinares, M (2018) Psychometric properties of the well-being index (WHO-5) Spanish version in a sample of euthymic patients with bipolar disorder. Journal of Affective Disorders 228, 153–59.CrossRef Google Scholar

Christensen, KS, Haugen, W, Sirpal, MK and Haavet, OR (2015) Diagnosis of depressed young people – criterion validity of WHO-5 and HSCL-6 in Denmark and Norway. Family Practice 32, 359–63.CrossRef Google Scholar PubMed

Cohen, J (1988a) Statistical power analysis for the behavioral sciences, second edition. Hillsdale: Erlbaum Associates.Google Scholar

Cohen, J (1988b) Statistical power analysis for the behavioural sciences. Hillsdale. NJ: Lawrence Earlbaum Associates, 2.Google Scholar

De Wit, M, Pouwer, F, Gemke, RJ, Delemarre-van De Waal, HA and Snoek, FJ (2007) Validation of the WHO-5 Well-Being Index in adolescents with type 1 diabetes. Diabetes Care 30, 2003–6.CrossRef Google Scholar PubMed

Diener, E, Scollon, CN and Lucas, RE (2009) The evolving concept of subjective 455 well-being: themultifaceted nature of happiness assessing well-being. Illinois, U.S.A.: Springer Science+Business Media, 67–100.Google Scholar

Downs, A, Boucher, LA, Campbell, DG and Polyakov, A (2017) Using the WHO–5 well-being index to identify college students at risk for mental health problems. Journal of College Student Development 58, 113–7.CrossRef Google Scholar

Firdaus, G (2017) Mental well-being of migrants in urban center of India: analyzing the role of social environment. Indian Journal of Psychiatry 59, 164.CrossRef Google Scholar PubMed

Gorter, KJ, Wens, J, Khunti, K, Claramunt, XC, Topsever, P, Drivsholm, T, Jenum, AK, Berkhout, C, Khalangot, M, Goldfracht, M, Rurik, I, Lionis, C and Rutten, GEHM (2010) The European EUCCLID pilot study on care and complications in an unselected sample of people with type 2 diabetes in primary care. Prim Care Diabetes 4, 17–23.CrossRef Google Scholar

Guðmundsdóttir, HB, Ólason, DÞ, Guðmundsdóttir, DG and Sigurðsson, JF (2014) A psychometric evaluation of the Icelandic version of the WHO‐5. Scandinavian Journal of Psychology 55, 567–72.CrossRef Google Scholar PubMed

Hajos, TR, Pouwer, F, Skovlund, S, Den Oudsten, BL, Geelhoed‐Duijvestijn, P, Tack, C and Snoek, FJ (2013) Psychometric and screening properties of the WHO‐5 well‐being index in adult outpatients with Type 1 or Type 2 diabetes mellitus. Diabetic Medicine 30, e63–9.CrossRef Google Scholar PubMed

Hall, T, Krahn, GL, Horner-Johnson, W and Lamb, G (2011) Examining functional content in widely used Health-Related Quality of Life scales. Rehabilitation Psychology 56, 94.CrossRef Google Scholar PubMed

Halliday, JA, Hendrieckx, C, Busija, L, Browne, JL, Nefs, G, Pouwer, F and Speight, J (2017) Validation of the WHO-5 as a first-step screening instrument for depression in adults with diabetes: results from Diabetes MILES–Australia. Diabetes Research and Clinical Practice 132, 27–35.CrossRef Google Scholar PubMed

Henkel, V, Mergl, R, Kohnon, R, Allgaier, A-K, Möller, H-J and Hegerl, U (2004) Use of brief depression screening tools in primary care: consideration of heterogeneity in performance in different patient groups. General Hospital Psychiatry 26, 190–8.CrossRef Google Scholar PubMed

Henkel, V, Mergl, R, Kohnen, R, Maier, W, Möller, H-J and Hegerl, U (2003) Identifying depression in primary care: a comparison of different methods in a prospective cohort study. BMJ 326, 200–1.CrossRef Google Scholar

Heun, R, Bonsignore, M, Barkow, K and Jessen, F (2001) Validity of the five-item WHO Well-Being Index (WHO-5) in an elderly population. European Archives of Psychiatry and Clinical Neuroscience 251, 27–31.CrossRef Google Scholar

Hooper, D, Coughlan, J and Mullen, M (2008) Structural equation modelling: guidelines for determining model fit. Electronic Journal of Business Research Methods 6, 53–60.Google Scholar

Joreskog, KG and Sorbom, D (2003). LISREL 8.5. Lincolwood, IL: Scientific Software International.Google Scholar

Khosravi, A, Mousavi, SA, Chaman, R, Sepidar-Kish, M, Ashrafi, E, Khalili, M, … Holakouie Naieni, K (2015) Reliability and validity of the Persian version of the World Health Organization-five well-being index. International Journal of Health Studies 1, 17–19.Google Scholar

Kline, RB (2016) Principles and practice of structural equation modeling, fourth edition. NewYork: The Guilford Press.Google Scholar

Klis, S, Vingerhoets, AJ, De Wit, M, Zandbelt, N and Snoek, FJ (2008) Pictorial Representation of Illness and Self Measure Revised II (PRISM-RII) – a novel method to assess perceived burden of illness in diabetes patients. Health and Quality of Life Outcomes 6, 104.CrossRef Google Scholar PubMed

Kong, C-L, Lee, C-C, Ip, Y-C, Chow, L-P, Leung, C-H and Lam, Y-C (2016) Validation of the Hong Kong Cantonese version of World Health Organization five well-being index for people with severe mental illness. East Asian Archives of Psychiatry 26, 18.Google Scholar

Krieger, T, Zimmermann, J, Huffziger, S, Ubl, B, Diener, C, Kuehner, C and Holtforth, MG (2014) Measuring depression with a well-being index: further evidence for the validity of the WHO Well-Being Index (WHO-5) as a measure of the severity of depression. Journal of Affective Disorders 156, 240–4.CrossRef Google Scholar PubMed

Lehmann, V, Makine, C, Karşıdağ, Ç, Kadıoğlu, P, Karşıdağ, K and Pouwer, F (2011) Validation of the Turkish version of the Centre for Epidemiologic Studies Depression Scale (CES-D) in patients with type 2 diabetes mellitus. BMC Medical Research Methodology 11, 109.CrossRef Google Scholar PubMed

Löwe, B, Spitzer, RL, Gräfe, K, Kroenke, K, Quenter, A, Zipfel, S, Buchholz, C, Witte, S and Herzog, W (2004) Comparative validity of three screening questionnaires for DSM-IV depressive disorders and physicians’ diagnoses. Journal of Affective Disorders 78, 131–40.CrossRef Google Scholar PubMed

Lucas‐Carrasco, R (2012) Reliability and validity of the Spanish version of the World Health Organization‐Five Well‐Being Index in elderly. Psychiatry and Clinical Neurosciences 66, 508–13.CrossRef Google Scholar PubMed

Makine, C, Nouwen, A, Karşıdağ, Ç, Kadıoğlu, P, Karşıdağ, K and Pouwer, F (2011) Validation of the Turkish version of the problem areas in diabetes scale. Cardiovascular Psychiatry and Neurology 3, 1–6.Google Scholar

Mergl, R, Seidscheck, I, Allgaier, AK, Möller, HJ, Hegerl, U and Henkel, V (2007) Depressive, anxiety, and somatoform disorders in primary care: prevalence and recognition. Depress Anxiety 24, 185–95.CrossRef Google Scholar PubMed

Moon, YS, Kim, HJ and Kim, DH (2014) The relationship of the Korean version of the WHO Five Well-Being Index with depressive symptoms and quality of life in the community-dwelling elderly. Asian Journal of Psychiatry 9, 26–30.CrossRef Google Scholar PubMed

Mortazavi, F, Mousavi, S-A, Chaman, R and Khosravi, A (2015) Dünya Sağlık Örgütü-5 İyilik Hali Endeksi Geçerliği: Annenin İyilik Hali ve Bununla İlişkili Faktörlerin Değerlendirilmesi. Turkish Journal of Psychiatry 26, 48–55.Google Scholar

Newnham, EA, Hooke, GR and Page, AC (2010) Monitoring treatment response and outcomes using the World Health Organization’s Wellbeing Index in psychiatric care. Journal of Affective Disorders 122, 133–38.CrossRef Google Scholar PubMed

Nicolucci, A, Kovacs Burns, K, Holt, RI, Comaschi, M, Hermanns, N, Ishii, H, Kokoszka, A, Pouwer, F, Skovlund, SE, Stuckey, H, Tarkun, I, Vallis, M, Wens, J and Peyrot, M (2013) Diabetes Attitudes, Wishes and Needs second study (DAWN2™): cross‐national benchmarking of diabetes‐related psychosocial outcomes for people with diabetes. Diabetic Medicine 30, 767–77.CrossRef Google Scholar PubMed

Nunnally, JC and Bernstein, I (1994) Psychometric theory. McGraw-Hill Series in Psychology, Volume 3. New York: McGraw-Hill.Google Scholar

Ramona, L-C (2012) Reliability and validity of the Spanish version of the World Health Organization-Five Well-Being Index in elderly. Psychiatry and Clinical Neurosciences 66, 508–13.Google Scholar

Republic of Turkey Ministry of Health General Directorate of Health Information Systems (2017) Health Statistics Yearbook. Retrieved from https://dosyamerkez.saglik.gov.tr/Eklenti/27344,saglik-istatistikleri-yilligi-2017-haber-bultenipdf.pdf?0 Google Scholar

Saipanish, R, Lotrakul, M and Sumrithe, S (2009) Reliability and validity of the Thai version of the WHO‐Five Well‐Being Index in primary care patients. Psychiatry and Clinical Neurosciences 63, 141–6.CrossRef Google Scholar PubMed

Staehr, JK (1998) The use of well-being measures in primary health care – the DepCare project. World Health Organization Regional Office for Europe: Well-Being Measures in Primary Health Care-the DepCare Project. Geneva: World Health Organization.Google Scholar

Topp, CW, Østergaard, SD, Søndergaard, S and Bech, P (2015) The WHO-5 Well-Being Index: a systematic review of the literature. Psychotherapy and Psychosomatics 84, 167–76.CrossRef Google Scholar PubMed

Uludag, A, Sahin, E, Agaoglu, H, Gungor, S, Ertekin, Y and Tekin, M (2016) Are blood pressure values compatible with medication adherence in hypertensive patients? Nigerian Journal of Clinical Practice 19, 460–4.CrossRef Google Scholar PubMed

Ware, JE and Davies, AR (1995) Monitoring health outcomes from the patients’ point of view: a primer. Integrated Therapeutics Group, Incorporated.Google Scholar

Warr, P, Banks, M and Ullah, P (1985) The experience of unemployment among black and white urban teenagers. British Journal of Psychology 76, 75–87.CrossRef Google Scholar PubMed

West, SG, Finch, JF and Curran, PJ (1995) Structural equation models with nonnormal variables: problems and remedies. In Hoyle, R, editor, Structural equation modeling: concepts issues and applications. Newbery Park: Sage Publications, 56–75.Google Scholar

Yusoff, MSB, Yaacob, MJ and Naing, NN (2013) Psychometric properties of the Medical Student Well-Being Index among medical students in a Malaysian medical school. Asian Journal of Psychiatry 6, 60–5.CrossRef Google Scholar

Table 1. The list of data sources used in this study

Table 3. Known groups and discriminant validity results

Table 2. Results of WHO-5 item analyses, internal consistency, exploratory and confirmatory factor analyses in adults and older adults

Article contents

Reliability and validity of the Turkish version of the WHO-5, in adults and older adults for its use in primary care settings

Abstract

Keywords

Introduction

Methods

Subjects and data collection

Socio-demographic and Morbidity variables

WHO-5

Data analysis

Psychometric analysis

Descriptive analysis

Reliability analysis

Validity analysis

Ethical considerations

Results

Discussion

Study sample and distribution properties

Reliability and validity analysis

Discriminant validity

Strengths and limitations

Conclusion

Author ORCIDs

Financial Support

Conflicts of Interest

Authors’ Contribution

Ethical Standards

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests