Grammatical analysis of DNA sequences provides a rationale for the regulatory control of an entire chromosome

Susumu Ohno

doi:10.1017/S0016672300035187

Grammatical analysis of DNA sequences provides a rationale for the regulatory control of an entire chromosome

Published online by Cambridge University Press: 14 April 2009

Susumu Ohno

Show author details

Susumu Ohno: Affiliation:
Beckman Research Institute of the City of Hope, 1450 E. Duarte Road, Duarte, California 91010-0269

Article contents

Summary
References

Rights & Permissions

Summary

Core share and HTML view are not available for this content. However, as you have access to this content, a full PDF is available via the ‘Save PDF’ action button.

Regardless of their origins, functions, and base compositions, all DNAs are scriptures written following the same grammatical rule. At the level of syllables, two, CG and TA are seldom used, while three, TG, CT and CA are utilized with abundance. Accordingly, at the level of three-letter words, two complementary base trimers, CTG and CAG, invariably enjoy frequent usage. Inasmuch as two of the three frequently used syllables, TG and CA are complementary to each other, while two seldom used syllables, CG and TA, are both palindromes, two complementary strands of DNA are inherently symmetrical with each other. Consequently, palindromic sequences as favourite targets of DNA-binding proteins occur at unsuspectedly high frequencies, if they contain TG and CA or CTG and CAG. Nevertheless, there are grammatical rules operating among these high frequency palindromes as well; e.g. the palindromic tetramer TGCA occurs nearly two times more often than its reciprocal; CATG. Thus, DNA-binding proteins are provided with a wealth of abundant targets whose densities are influenced by a regional difference in GC/AT ratios to variable degrees. One palindromic heptamer CAGNCTG is an ideal target of one DNA-binding protein engaged in chromosome packaging and in generation of banding patterns. This heptamer occurs once every 1000 bases in moderately GC-rich sequences, while its incidence is reduced to once every 3000 bases in extremely AT-rich sequences. The above must be the very reason that a solitary human X-chromosome DNA coated with mouse DNA-binding proteins in mouse-man somatic hybrids still maintains the original banding pattern and that the inactive X remains inactive, while the active X remains active.

Type: Research Article
Information: Genetics Research , Volume 56 , Issue 2-3 , October 1990 , pp. 115 - 120

DOI: https://doi.org/10.1017/S0016672300035187 [Opens in a new window]

References

Early, P., Huang, H., Davis, M., Calame, K. & Hood, L. (1980). An immunoglobulin heavy chain variable region gene is generated from three segments of DNA: V _H, D, and J _H. Cell 19 981–992.CrossRef Google Scholar PubMed

Ishiguro, H., Ichihara, Y., Namikawa, T., Nagatsu, T. & Kurosawa, Y. (1989). Nucleotide sequences of Suncus Murinus immunoglobulin μ gene and comparisons with mouse and human μ genes. FEBS Letters 247, 317–322.CrossRef Google Scholar PubMed

Minghetti, P. P., Ruffner, D. E., Kuang, W.-J., Dennison, O. E., Hawkins, J. W., Beattie, W. G. & Dugaiczyk, A. (1986). Molecular structure of the human albumin gene is revealed by nucleotide sequence within q11–22 of chromosome 4. Journal of Biological Chemistry 261, 6747–6757.CrossRef Google Scholar PubMed

Ohno, S. (1988). Universal rule for coding sequence construction: TA/CG deficiency-TG/CT excess. Proceedings of the National Academy of Sciences USA 85, 9630–9634.CrossRef Google Scholar PubMed

Ohno, S. & Yomo, T. (1990). Various regulatory sequences are deprived of their uniqueness by the universal rule of TA/CG-deficiency and TG/CT excess. Proceedings of the National Academy of Sciences USA 87, 1218–1222.CrossRef Google Scholar PubMed

Sakano, H., Huppi, K., Heinrich, G. & Tonegawa, S. (1979). Sequences at somatic recombination sites of immunoglobulin light-chain genes. Nature 280, 288–294.CrossRef Google Scholar PubMed

Yomo, T. & Ohno, S. (1989). Concordant evolution of coding and noncoding regions of DNA made possible by the universal rule of TA/CG deficiency-TG/CT excess. Proceedings of the National Academy of Sciences USA 86, 8452–8456.CrossRef Google Scholar PubMed

Article contents

Grammatical analysis of DNA sequences provides a rationale for the regulatory control of an entire chromosome

Summary

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests