There is increasing concern that the genetic literature may be distorted by various biases, such as publication bias, which may lead to a misleading impression of the strength of evidence for a putative gene–disease association. Meta-analysis is one means by which a more accurate estimate of the strength of evidence for such association may be obtained, as well as offering a means by which potential biases may be identified (Munafo & Flint, Reference Munafo and Flint2004). Here we present evidence that the location where a study is conducted is associated with the degree to which it represents an over-estimate of the true effect size, as subsequently estimated using meta-analytical techniques.
A number of factors are likely to introduce bias into the literature, and contribute to the risk of false-positive results. These include publication bias (Ioannidis, Reference Ioannidis2006), longer time to publish for results which do not achieve statistical significance (Ioannidis, Reference Ioannidis1998), the trend for effect sizes to decrease with year of publication (Trikalinos et al. Reference Trikalinos, Ntzani, Contopoulos-Ioannidis and Ioannidis2004), the poor predictive value of initial reports of genetic association (Ioannidis et al. Reference Ioannidis, Ntzani, Trikalinos and Contopoulos-Ioannidis2001), the post-hoc study of further sub-groups defined by sex or environmental factors (Patsopoulos et al. Reference Patsopoulos, Tatsioni and Ioannidis2007), the excess of results that fall just below the 0.05 α-level (Ioannidis & Trikalinos, Reference Ioannidis and Trikalinos2007), and so on.
We used data from three meta-analytical reviews, relating to the DRD2 TaqIA polymorphism and alcoholism (Munafo et al. Reference Munafo, Matheson and Flint2007), the DRD2 TaqIA polymorphism and cigarette smoking (Munafo et al. in press), and the COMT Val158/108Met polymorphism and schizophrenia (Munafo et al. Reference Munafo, Bowes, Clark and Flint2005), resulting in a total of k=81 studies. For these, additional data on the geographical location of the research group which conducted each study (defined as the postal address of the corresponding author, grouped as North America, Europe, Other) were then extracted independently by two authors (M.R.M. and A.S.A.). Russia was coded as Europe, and Turkey as Other. We divided the individual study odds ratio (OR) by the pooled OR, to arrive at an estimate of the degree to which each individual study over- or under-estimated the true effect size, as estimated in the corresponding meta-analysis.
Data were combined within a fixed-effects framework using inverse variance methods, as described in detail elsewhere (Munafo & Flint, Reference Munafo and Flint2004). For studies coded as Europe [OR 0.96, 95% confidence interval (CI) 0.88–1.04, p=0.30] and Other (OR 0.95, 95% CI 0.89–1.01, p=0.10) these pooled OR did not differ significantly from zero. In both cases there was modest between-study heterogeneity (I 2=34.66 and 34.60 respectively). For studies coded as North America, however, there was evidence of a significant over-estimation of the true effect size (OR 1.10, 95% CI 1.02–1.17, p=0.009), with evidence of substantial between-study heterogeneity [χ2(33)=134.33, p<0.001, I 2=75.43]. Meta-regression indicated a significant negative correlation between year of publication and ln OR for studies coded as North America (slope −0.04, p<0.001), but not those coded as Europe or Other (slope +0.00 and +0.01, respectively, p values >0.55).
Our results indicate that studies published in North America may represent a relative over-estimate of the true effect size, compared to those published in Europe or elsewhere. Although this conclusion assumes that the pooled effect size arrived at using meta-analytical techniques represents the best-available estimate of the true effect size, this is exactly the rationale behind the use of such techniques (Munafo & Flint, Reference Munafo and Flint2004), which have become increasingly popular in recent years for confirming or refuting the evidence for specific gene–disease associations. It is not possible from these data to infer why this might be the case, although a number of possibilities may be considered. These include, for example, the relatively greater research funding available in North America, and the competition between research groups for these resources, both of which may encourage a focus on the apparently most interesting and timely results available to them (i.e. those which reach nominal statistical significance, at the expense of those which do not), in order to achieve a high rate of publication in high-impact journals.
In order to investigate this possibility, we examined the average bias index for individual countries and the ratio of government research and development funding in science and technology to the gross domestic product (GDP) of these countries, where these data were available. This indicated a strong positive correlation (r s=+0.65, p=0.032). These data are presented graphically in Fig. 1.
Clearly these and other biases will exist elsewhere as well, given that our data reflect a relative bias based on geographical location. One possible interpretation of these correlational data is that anything that distinguishes between North America and Europe will result in a similar correlation. However, by treating each country separately our correlational analysis is unlikely to be contaminated by any systematic bias that differentiates North America from Europe. Consistent with this interpretation, it should be noted that the USA and Canada are not placed at the top of the ratio of funding to GDP ranking of individual countries, although we cannot completely exclude the effect of a systematic bias. In conclusion, these findings should serve as a further caution that even our best available estimates of the strength of a genetic association will be distorted by publication and other biases.
Acknowledgements
Marcus Munafò is supported in part by a NARSAD Young Investigator Award. Jonathan Flint is supported by the Wellcome Trust.
Declaration of Interest
None.