Multiple sequence alignment

doi:10.1017/CBO9780511819049.005

3 - Multiple sequence alignment

from Section II - Data preparation

Published online by Cambridge University Press: 05 June 2012

Des Higgins and

Edited by

Marco Salemi and

Philippe Lemey: Affiliation:
University of Oxford
Marco Salemi: Affiliation:
University of California, Irvine
Anne-Mieke Vandamme: Affiliation:
Katholieke Universiteit Leuven, Belgium

Book contents

Get access

Summary

THEORY

Introduction

From a biological perspective, a sequence alignment is a hypothesis about homology of multiple residues in protein or nucleotide sequences. Therefore, aligned residues are assumed to have diverged from a common ancestral state. An example of a multiple sequence alignment is shown in Fig. 3.1. This is a set of amino acid sequences of globins that have been aligned so that homologous residues are arranged in columns “as much as possible.” The sequences are of different lengths, implying that gaps (shown as hyphens in the figure) must be used in some positions to achieve the alignment. The gaps represent a deletion, an insertion in the sequences that do not have a gap, or a combination of insertions and deletions. The generation of alignments, either manually or using an automatic computer program, is one of the most common tasks in computational sequence analysis because they are required for many other analyses such as structure prediction or to demonstrate sequence similarity within a family of sequences. Of course, one of the most common reasons for generating alignments is that they are an essential prerequisite for phylogenetic analyses. Rates or patterns of change in sequences cannot be analysed unless the sequences can be aligned.

The problem of repeats

It can be difficult to find the optimal alignment for several reasons. First, there may be repeats in one or all the members of the sequence family; this problem is shown in the simple diagram in Fig. 3.2.

Type: Chapter
Information: The Phylogenetic Handbook
A Practical Approach to Phylogenetic Analysis and Hypothesis Testing
, pp. 68 - 108

DOI: https://doi.org/10.1017/CBO9780511819049.005 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 2009

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Book contents

3 - Multiple sequence alignment

Summary

Access options

Book purchase

Temporarily unavailable

Book contents

3 - Multiple sequence alignment

Summary

Access options

Book purchase

Temporarily unavailable

Save book to Kindle

Save book to Dropbox

Save book to Google Drive