Book contents
- Frontmatter
- Contents
- List of contributors
- Foreword
- Preface
- Section I Introduction
- Section II Data preparation
- Section III Phylogenetic inference
- 4 Genetic distances and nucleotide substitution models
- 5 Phylogenetic inference based on distance methods
- 6 Phylogenetic inference using maximum likelihood methods
- 7 Bayesian phylogenetic analysis using MRBAYES
- 8 Phylogeny inference based on parsimony and other methods using PAUP
- 9 Phylogenetic analysis using protein sequences
- Section IV Testing models and trees
- Section V Molecular adaptation
- Section VI Recombination
- Section VII Population genetics
- Section VIII Additional topics
- Glossary
- References
- Index
5 - Phylogenetic inference based on distance methods
from Section III - Phylogenetic inference
Published online by Cambridge University Press: 05 June 2012
- Frontmatter
- Contents
- List of contributors
- Foreword
- Preface
- Section I Introduction
- Section II Data preparation
- Section III Phylogenetic inference
- 4 Genetic distances and nucleotide substitution models
- 5 Phylogenetic inference based on distance methods
- 6 Phylogenetic inference using maximum likelihood methods
- 7 Bayesian phylogenetic analysis using MRBAYES
- 8 Phylogeny inference based on parsimony and other methods using PAUP
- 9 Phylogenetic analysis using protein sequences
- Section IV Testing models and trees
- Section V Molecular adaptation
- Section VI Recombination
- Section VII Population genetics
- Section VIII Additional topics
- Glossary
- References
- Index
Summary
THEORY
Introduction
In addition to maximum parsimony (MP) and likelihood methods (see Chapters 6, and 8), pairwise distance methods form the third large group of methods to infer evolutionary trees from sequence data (Fig. 5.1). In principle, distance methods try to fit a tree to a matrix of pairwise genetic distances (Felsenstein, 1988). For every two sequences, the distance is a single value based on the fraction of positions in which the two sequences differ, defined as p-distance (see Chapter 4). The p-distance is an underestimation of the true genetic distance because some of the nucleotide positions may have experienced multiple substitution events. Indeed, because mutations are continuously fixed in the genes, there has been an increasing chance of multiple substitutions occurring at the same sequence position as evolutionary time elapses. Therefore, in distance-based methods, one tries to estimate the number of substitutions that have actually occurred by applying a specific evolutionary model that makes particular assumptions about the nature of evolutionary changes (see Chapter 4). When all the pairwise distances have been computed for a set of sequences, a tree topology can then be inferred by a variety of methods (Fig. 5.2).
Correct estimation of the genetic distance is crucial and, in most cases, more important than the choice of method to infer the tree topology.
- Type
- Chapter
- Information
- The Phylogenetic HandbookA Practical Approach to Phylogenetic Analysis and Hypothesis Testing, pp. 142 - 180Publisher: Cambridge University PressPrint publication year: 2009
- 13
- Cited by