A simple maximization model inspired by algorithms for the organization of genetic candidates in bacterial DNA

Andrew G. Hart; Servet Martínez; Leonardo Videla

doi:10.1017/S0001867800001452

A simple maximization model inspired by algorithms for the organization of genetic candidates in bacterial DNA

Part of: Markov processes

Published online by Cambridge University Press: 08 September 2016

Andrew G. Hart ,

Servet Martínez and

Leonardo Videla

Show author details

Andrew G. Hart*: Affiliation:
Universidad de Chile
Servet Martínez*: Affiliation:
Universidad de Chile
Leonardo Videla*: Affiliation:
Universidad de Chile
*: ∗ Postal address: Departamento de Ingeniería Matemática and Centro de Modelamiento Matemático, UMR 2071 CNRS-UCHILE, Facultad de Ciencias Físicas y Matemáticas, Universidad de Chile, Casilla 170-3, Correo 3, Santiago, Chile.
∗ Postal address: Departamento de Ingeniería Matemática and Centro de Modelamiento Matemático, UMR 2071 CNRS-UCHILE, Facultad de Ciencias Físicas y Matemáticas, Universidad de Chile, Casilla 170-3, Correo 3, Santiago, Chile.
∗ Postal address: Departamento de Ingeniería Matemática and Centro de Modelamiento Matemático, UMR 2071 CNRS-UCHILE, Facultad de Ciencias Físicas y Matemáticas, Universidad de Chile, Casilla 170-3, Correo 3, Santiago, Chile.

Article contents

Abstract
References

Rights & Permissions

Abstract

Core share and HTML view are not available for this content. However, as you have access to this content, a full PDF is available via the ‘Save PDF’ action button.

We propose a simple model for interaction between gene candidates in the two strands of bacterial DNA (deoxyribonucleic acid). Our model assumes that ‘final’ genes appear in one of the two strands, that they do not overlap (in bacteria there is only a small percentage of overlap), and that the final genes maximize the occupancy rate, which is defined to be the proportion of the genome occupied by coding zones. We are more concerned with describing the organization and distribution of genes in bacterial DNA than with the very hard problem of identifying genes. To this end, an algorithm for selecting the final genes according to the previously outlined maximization criterion is proposed. We study the graphical and probabilistic properties of the model resulting from applying the maximization procedure to a Markovian representation of the genic and intergenic zones within the DNA strands, develop theoretical bounds on the occupancy rate (which, in our view, is a rather intractable quantity), and use the model to compute quantities of relevance to the Escherichia coli genome and compare these to annotation data. Although this work focuses on genomic modelling, we point out that the proposed model is not restricted to applications in this setting. It also serves to model other resource allocation problems.

Keywords

Constrained optimization renewal process Markov process

MSC classification

Primary: 60J27: Continuous-time Markov processes on discrete state spaces 92D20: Protein sequences, DNA sequences

Type: General Applied Probability
Information: Advances in Applied Probability , Volume 38 , Issue 4 , December 2006 , pp. 1071 - 1097

DOI: https://doi.org/10.1017/S0001867800001452 [Opens in a new window]

References

Burge, C. and Karlin, S. (1997). Prediction of complete gene structures in human genomic DNA. J. Molec. Biol. 268, 78–94.Google Scholar

Kelly, F. P. (1991). Loss networks. Ann. Appl. Prob. 1, 319–378.Google Scholar

Krengel, U. (1985). Ergodic Theorems (De Gruyter Stud. Math. 6). Walter De Gruyter, Berlin.CrossRef Google Scholar

Lukashin, A. V. and Borodovsky, M. (1998). GeneMark.hmm: new solutions for gene finding. Nucleic Acids Res. 26, 1107–1115.CrossRef Google Scholar PubMed

Nicolas, P. (2003). Mise au point et utilisation de modèles de Markov cachées pour l'etude des séquences d'ADN. , Université d'Evry.Google Scholar

Nicolas, P. and Muri-Majoube, F. (2001). R'HOM. Programs to segment DNA sequences into homogeneous regions. Tech. Rep., Université d'Evry. Available at http://genome.jouy.inra.fr/ssb/rhom/rhom_doc/rhom_doc.html.Google Scholar

Salzberg, S. L., Delcher, A. L., Kasif, S. and White, O. (1998). Microbial gene identification using interpolated Markov models. Nucleic Acids Res. 26, 544–548.Google Scholar

Article contents

A simple maximization model inspired by algorithms for the organization of genetic candidates in bacterial DNA

Abstract

Keywords

MSC classification

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests