Misunderstanding statistical results in analyses of population structure can jeopardize biodiversity conservation. Consideration of Type I (rejecting a true null hypothesis) and Type II errors (failing to reject a false null hypothesis) in hypothesis testing is crucial. In tests for population structure the null hypothesis is usually that there are no differences between provisional populations, and the threshold for rejection is α = 0.05. Thus, there is a 1/20 chance of a Type I error. The probability of a Type II error depends on both α and sample size; increasing either reduces the Type II error risk and improves the power to detect population differences. With the Baltic harbour porpoise population being very small (Berggren et al., Reference Berggren, Hiby, Lovell and Scheidat2004), the consequence of a Type II error increases the likelihood of extirpation (a non-reversible outcome) and conflicts with the precautionary principle of conservation.
Neither we (Wang & Berggren, Reference Wang and Berggren1997) nor Palmé et al. (Reference Palmé, Laikre, Utter and Ryman2008) presented the two P-values from the Monte Carlo χ2 simulation analysis in REAP v. 4.1 (McElroy et al., Reference McElroy, Moran, Bermingham and Kornfield1992): the probability of exceeding the original χ2 by chance, and the probability that includes ties. We (Wang & Berggren, Reference Wang and Berggren1997) believed the correct P-value to report was the former. In retrospect, both P-values should have been presented and discussed. Regardless, the results and conclusion are unaffected.
Our dataset was reanalysed independently using REAP v. 4.1 (Dr Patricia Rosel, National Marine Fisheries Service, USA, pers. comm.) with the following results: the calculated χ2 value was 8.3 (mean χ2 = 6.1, range 3.9–10.2); P = 0.034 ± SE 0.0018 (341 replicates without ties) and 0.079 ± SE 0.0028 (449 replicates with ties). The slight differences between these values and those in Palmé et al. (Reference Palmé, Laikre, Utter and Ryman2008) and Wang & Berggren (Reference Wang and Berggren1997) are probably due to the algorithm's randomization procedure.
The many tied χ2 values are due to the small sample size relative to the population's genetic diversity, and reflect low analytical power for detecting differences. Both P-values represent potential correct values derived from the analyses, with the actual probability in the range 0.034–0.079, but further resolution requires a larger sample size. Nevertheless, even the higher P-value (0.079) suggests structure. Given low analytical power, this P-value may even be stronger evidence of differentiation than the significant P-value that excludes ties. Furthermore, we argue that any P-value < 0.1 is grounds for prudent conservative management (i.e. in this case recognizing Baltic porpoises as genetically distinct). Finally, the claim by Palme et al. (Reference Palmé, Laikre, Utter and Ryman2008) of no evidence for genetic distinctness ignored differences in other characters (e.g. craniometry; Börjesson & Berggren, Reference Börjesson and Berggren1997); population identification is not limited to DNA data (Wang, Reference Wang, Perrin, Wursig and Thewissen2002).
There are also several factual errors in Palmé et al. (Reference Palmé, Laikre, Utter and Ryman2008). One example with serious implications is their erroneous claim of equal migration rates between the sexes. Several studies have shown that females exhibit stronger philopatry than males (Börjesson & Berggren, Reference Börjesson and Berggren1997; Rosel et al., Reference Rosel, France, Wang and Kocher1999).
In conclusion, Palme et al. (Reference Palmé, Laikre, Utter and Ryman2008) failed to provide convincing and statistically supported arguments against recognizing a distinct Baltic harbour porpoise population as earlier suggested by us (Wang & Berggren, Reference Wang and Berggren1997).
Acknowledgements
We thank Doug McElroy, Patricia Rosel, Karen Martien, Barbara Taylor and Brent Murray for constructive comments.