Minimal clade size and external branch length under the neutral coalescent

Michael G. B. Blum; Olivier François

doi:10.1239/aap/1127483740

Minimal clade size and external branch length under the neutral coalescent

Part of: Markov processes

Published online by Cambridge University Press: 01 July 2016

Michael G. B. Blum and

Olivier François

Show author details

Michael G. B. Blum*: Affiliation:
Institut National Polytechnique de Grenoble
Olivier François*: Affiliation:
Institut National Polytechnique de Grenoble
*: ∗ Postal address: Laboratoire TIMC-TIMB, Institute for Health and Information Engineering, Faculty of Medicine, F38706 La Tronche cedex, France.
∗ Postal address: Laboratoire TIMC-TIMB, Institute for Health and Information Engineering, Faculty of Medicine, F38706 La Tronche cedex, France.

Article contents

Abstract
References

Rights & Permissions

Abstract

Core share and HTML view are not available for this content. However, as you have access to this content, a full PDF is available via the ‘Save PDF’ action button.

Given a sample of genes taken from a large population, we consider the neutral coalescent genealogy and study the theoretical and empirical distributions of the size of the smallest clade containing a fixed gene. We show that the theoretical distribution is strongly related to a Yule distribution of parameter 2, and that the empirical count statistics are asymptotically Gaussian as the number of genes grows to infinity. Then we consider external branches of the coalescent tree, and describe their lengths. Using the infinitely many sites model of mutation, we also describe the conditional distribution of the external branch lengths, given the number of pairwise differences between a reference DNA sequence and the sequence of one closest relative in the sample.

Keywords

Coalescent genealogies tree shape statistics statistical genetics

MSC classification

Primary: 92D10: Genetics 92D20: Protein sequences, DNA sequences

Secondary: 60J70: Applications of Brownian motions and diffusion theory (population genetics, absorption problems, etc.)

Type: General Applied Probability
Information: Advances in Applied Probability , Volume 37 , Issue 3 , September 2005 , pp. 647 - 662

DOI: https://doi.org/10.1239/aap/1127483740 [Opens in a new window]
Copyright: Copyright © Applied Probability Trust 2005

References

Aldous, D. J. (1991). Asymptotic fringe distributions for general families of random trees. Ann. Appl. Prob. 1, 228–266.Google Scholar

Aldous, D. J. (1996). Probability distributions on cladograms. In Random Discrete Structures, eds Aldous, D. J. and Pemantle, R., Springer, Berlin, pp. 1–18.Google Scholar

Aldous, D. J. (2001). Stochastic models and descriptive statistics for phylogenetic trees, from Yule to today. Statist. Sci. 16, 23–34.CrossRef Google Scholar

Devroye, L. (1991). Limit laws for local counters in random binary search trees. Random Structures Algorithms 2, 303–315.Google Scholar

Donnelly, P., Tavaré, S., Balding, D. J. and Griffiths, R. C. (1996). Estimating the age of the common ancestor of men from the ZFY intron. Science 272, 1357–1359.Google Scholar

Durrett, R. (2003). Probabilistic Models of DNA Sequences. Springer, New York.Google Scholar

Fu, Y. X. and Li, W. H. (1993). Statistical tests of neutrality of mutations. Genetics 133, 693–709.CrossRef Google Scholar PubMed

Hwang, H.-K. and Neininger, R. (2002). Phase change of limit laws in the quicksort recurrence under varying toll functions. SIAM J. Comput. 31, 1687–1722.Google Scholar

Kingman, J. F. C. (1982). On the genealogy of large populations. In Essays in Statistical Science (J. Appl. Prob. Spec. Vol. 19A), Applied Probability Trust, Sheffield, pp. 27–43.Google Scholar

Kingman, J. F. C. (1982). The coalescent. Stoch. Process. Appl. 13, 235–248.Google Scholar

McKenzie, A. and Steel, M. (2000). Distributions of cherries for two models of trees. Math. Biosci. 164, 81–92.Google Scholar

Nordborg, M. (2001). Coalescent theory. In Handbook of Statistical Genetics, eds Balding, D. J. et al., John Wiley, New York, pp. 179–208.Google Scholar

Régnier, M. (1989). A limiting distribution for quicksort. RAIRO Inf. Théor. Appl. 23, 335–343.Google Scholar

Rösler, U. (1992). A limit theorem for quicksort. RAIRO Inf. Théor. Appl. 25, 85–100.CrossRef Google Scholar

Saunders, I. W., Tavaré, S. and Watterson, G. A. (1984). On the genealogy of nested subsamples from a haploid population. Adv. Appl. Prob. 16, 471–491.Google Scholar

Tajima, F. (1983). Evolutionary relationship of DNA sequences in finite populations. Genetics 105, 437–460.CrossRef Google Scholar PubMed

Tavaré, S. (2004). Ancestral inference in population genetics. In Lectures on Probability Theory and Statistics (Lecture Notes Math. 1837), Springer, Berlin, pp. 1–188.Google Scholar

Tavaré, S. (1997). Ancestral inference from DNA sequence data. In Case Studies in Mathematical Modeling in Ecology, Physiology and Cell Biology, eds Othmer, H. G. et al., Prentice Hall, Upper Saddle River, NJ, pp. 81–96.Google Scholar

Walsh, B. (2001). Estimating the time to the most recent common ancestor for the Y chromosome or mitochontrial DNA for a pair of individuals. Genetics 158, 897–912.Google Scholar

Watterson, G. A. (1975). On the number of segregating sites in genetical models without recombination. Theoret. Pop. Biol. 7, 256–276.CrossRef Google Scholar PubMed

Watterson, G. A. (1982). Mutant substitutions at linked nucleotide sites. Adv. Appl. Prob. 14, 206–224.Google Scholar

Wiuf, C. and Donnelly, P. (1999). Conditional genealogies and the age of a neutral mutant. Theoret. Pop. Biol. 56, 183–201.CrossRef Google Scholar PubMed

Yule, G. U. (1924). A mathematical theory of evolution, based on the conclusions of Dr J. C. Willis. Philos. Trans. R. Soc. London B 213, 21–87.Google Scholar

Article contents

Minimal clade size and external branch length under the neutral coalescent

Abstract

Keywords

MSC classification

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests