Article contents
Moderate Deviations for Word Counts in Biological Sequences
Published online by Cambridge University Press: 14 July 2016
Abstract
We derive a moderate deviation principle for word counts (which is extended to counts of multiple patterns) in biological sequences under different models: independent and identically distributed letters, homogeneous Markov chains of order 1 and m, and, in view of the codon structure of DNA sequences, Markov chains with three different transition matrices. This enables us to approximate P-values for the number of word occurrences in DNA and protein sequences in a new manner.
MSC classification
- Type
- Research Article
- Information
- Copyright
- Copyright © Applied Probability Trust 2009
References
- 1
- Cited by