Hostname: page-component-586b7cd67f-t7fkt Total loading time: 0 Render date: 2024-11-25T10:08:43.705Z Has data issue: false hasContentIssue false

The association between the neighbourhood characteristics and depression: was the regression model satisfactory?

Published online by Cambridge University Press:  30 March 2020

Abhishek Ghosh
Affiliation:
Assistant Professor, Department of Psychiatry, Post Graduate Institute of Medicne Education and Research, India
Natarajan Varadharajan
Affiliation:
Senior Resident, Department of Psychiatry, Post Graduate Institute of Medicine Education and Research, India. Email: [email protected]
Rights & Permissions [Opens in a new window]

Abstract

Type
Correspondence
Copyright
Copyright © The Royal College of Psychiatrists 2020

The study by Generaal and colleagues is noteworthy because of its large sample size and its objective and intelligent measurements of the myriad neighbourhood characteristics.Reference Generaal, Hoogendijk, Stam, Henke, Rutters and Oosterman1 However, we would like to draw the readers’ attention to the regression models. The Supplementary Table 1, with the bivariate correlations among the independent variables, showed modest to strong correlations between large numbers of variables. Therefore, multicollinearity was present. Under such circumstances, it is advisable to do variance inflation factor (VIF) estimation. VIF of more than ten suggests multicollinearity is a significant problem. Independent variables with VIF more than ten should have been removed from the model.Reference Vatcheva, Lee, McCormick and Rahbar2 The other option is to carry out principal component analysis of highly correlated independent variables. As the authors have not undertaken either of these corrections, significant multicollinearity might have affected the magnitude of the standardised regression coefficients, their standard errors and the P-values. These could potentially result in unreliable interpretations.Reference Kim3 The authors could have added the proportion of variance (R Reference Vatcheva, Lee, McCormick and Rahbar2) in the dependent variable (depression prevalence/severity) explained by the independent variables (neighbourhood characteristics) because R Reference Vatcheva, Lee, McCormick and Rahbar2 is not affected by multicollinearity. Additionally, R Reference Vatcheva, Lee, McCormick and Rahbar2 would have given an idea about the goodness of fit of the regression models.

The severity of depression (i.e. the dependent variable) had skewed distributions in five out of the seven cohorts. We agree, with a large sample size linear regression analysis could be done, even with a non-parametric dependent variable. However, the ordinary least square estimations should have been carried out to demonstrate statistical robustness of the regression analysis. In the case of non-normality of the ordinary least square, bootstrapping is an alternative.Reference Hubert, Rousseeuw and Aelst4

Because of these limitations, we would be cautious while interpreting the results of the regression analysis undertaken to examine the association between the neighbourhood characteristics and severity of depression.

References

Generaal, E, Hoogendijk, EO, Stam, M, Henke, CE, Rutters, F, Oosterman, M, et al. Neighbourhood characteristics and prevalence and severity of depression: pooled analysis of eight Dutch cohort studies. Br J Psychiatry 2019; 215: 468–75.10.1192/bjp.2019.100CrossRefGoogle ScholarPubMed
Vatcheva, KP, Lee, M, McCormick, JB, Rahbar, MH. Multicollinearity in regression analyses conducted in epidemiologic studies. Epidemiol 2016; 6: 227–46.CrossRefGoogle ScholarPubMed
Kim, JH. Multicollinearity and misleading statistical results. Korean JAnesthesiol 2019; 72: 558–69.CrossRefGoogle ScholarPubMed
Hubert, M, Rousseeuw, PJ, Aelst, SV. Inconsistency of resampling algorithms for high-breakdown regression estimators and a new algorithm. J Amer Stat Assoc 2002; 97: 151–3.Google Scholar
Submit a response

eLetters

No eLetters have been published for this article.