Skip to main content Accessibility help
×
Hostname: page-component-78c5997874-j824f Total loading time: 0 Render date: 2024-11-06T05:28:29.643Z Has data issue: false hasContentIssue false

Chapter 1 - The first look at a genome

Sequence statistics

Published online by Cambridge University Press:  05 June 2012

Nello Cristianini
Affiliation:
University of Bristol
Matthew W. Hahn
Affiliation:
Indiana University, Bloomington
Get access

Summary

Genomic era, year zero

In 1995 a group of scientists led by Craig Venter, at The Institute for Genomic Research (TIGR) in Maryland, published a landmark paper in the journal Science. This paper reported the complete DNA sequence (the genome) of a free-living organism, the bacterium Haemophilus influenzae (or H. influenzae, for short). Up until that moment, only small viral genomes or small parts of other genomes had been sequenced. The first viral genome sequence (that of phage phiX174) was produced by Fred Sanger's group in 1978, followed a few years later by the sequence of human mitochondrial DNA by the same group. Sanger – working in Cambridge, UK – was awarded two Nobel prizes, the first one in 1958 for developing protein sequencing techniques and the second one in 1980 for developing DNA sequencing techniques. A bacterial sequence, however, is enormously larger than a viral one, making the H. influenzae paper a true milestone. Given the order of magnitude increase in genome size that was sequenced by the group at TIGR, the genomic era can be said to have started in 1995.

  • Genomes and genomic sequences

  • Probabilistic models of sequences

  • Statistical properties of sequences

  • Standard data formats and databases

A few months later the same group at TIGR published an analysis of the full genome of another bacterium, Mycoplasma genitalium – a microbe responsible for urethritis – and shortly thereafter the sequence of the first eukaryote, the fungus, Saccharomyces cerevisiae (or S. cerevisiae, baker's yeast) was published by other groups.

Type
Chapter
Information
Introduction to Computational Genomics
A Case Studies Approach
, pp. 1 - 21
Publisher: Cambridge University Press
Print publication year: 2006

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Save book to Kindle

To save this book to your Kindle, first ensure [email protected] is added to your Approved Personal Document E-mail List under your Personal Document Settings on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part of your Kindle email address below. Find out more about saving to your Kindle.

Note you can select to save to either the @free.kindle.com or @kindle.com variations. ‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi. ‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.

Find out more about the Kindle Personal Document Service.

Available formats
×

Save book to Dropbox

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Dropbox.

Available formats
×

Save book to Google Drive

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Google Drive.

Available formats
×