Hostname: page-component-586b7cd67f-r5fsc Total loading time: 0 Render date: 2024-11-23T00:13:46.993Z Has data issue: false hasContentIssue false

Grammatical analysis of DNA sequences provides a rationale for the regulatory control of an entire chromosome

Published online by Cambridge University Press:  14 April 2009

Susumu Ohno
Affiliation:
Beckman Research Institute of the City of Hope, 1450 E. Duarte Road, Duarte, California 91010-0269
Rights & Permissions [Opens in a new window]

Summary

Core share and HTML view are not available for this content. However, as you have access to this content, a full PDF is available via the ‘Save PDF’ action button.

Regardless of their origins, functions, and base compositions, all DNAs are scriptures written following the same grammatical rule. At the level of syllables, two, CG and TA are seldom used, while three, TG, CT and CA are utilized with abundance. Accordingly, at the level of three-letter words, two complementary base trimers, CTG and CAG, invariably enjoy frequent usage. Inasmuch as two of the three frequently used syllables, TG and CA are complementary to each other, while two seldom used syllables, CG and TA, are both palindromes, two complementary strands of DNA are inherently symmetrical with each other. Consequently, palindromic sequences as favourite targets of DNA-binding proteins occur at unsuspectedly high frequencies, if they contain TG and CA or CTG and CAG. Nevertheless, there are grammatical rules operating among these high frequency palindromes as well; e.g. the palindromic tetramer TGCA occurs nearly two times more often than its reciprocal; CATG. Thus, DNA-binding proteins are provided with a wealth of abundant targets whose densities are influenced by a regional difference in GC/AT ratios to variable degrees. One palindromic heptamer CAGNCTG is an ideal target of one DNA-binding protein engaged in chromosome packaging and in generation of banding patterns. This heptamer occurs once every 1000 bases in moderately GC-rich sequences, while its incidence is reduced to once every 3000 bases in extremely AT-rich sequences. The above must be the very reason that a solitary human X-chromosome DNA coated with mouse DNA-binding proteins in mouse-man somatic hybrids still maintains the original banding pattern and that the inactive X remains inactive, while the active X remains active.

Type
Research Article
Copyright
Copyright © Cambridge University Press 1990

References

Early, P., Huang, H., Davis, M., Calame, K. & Hood, L. (1980). An immunoglobulin heavy chain variable region gene is generated from three segments of DNA: V H, D, and J H. Cell 19 981992.CrossRefGoogle ScholarPubMed
Ishiguro, H., Ichihara, Y., Namikawa, T., Nagatsu, T. & Kurosawa, Y. (1989). Nucleotide sequences of Suncus Murinus immunoglobulin μ gene and comparisons with mouse and human μ genes. FEBS Letters 247, 317322.CrossRefGoogle ScholarPubMed
Minghetti, P. P., Ruffner, D. E., Kuang, W.-J., Dennison, O. E., Hawkins, J. W., Beattie, W. G. & Dugaiczyk, A. (1986). Molecular structure of the human albumin gene is revealed by nucleotide sequence within q11–22 of chromosome 4. Journal of Biological Chemistry 261, 67476757.CrossRefGoogle ScholarPubMed
Ohno, S. (1988). Universal rule for coding sequence construction: TA/CG deficiency-TG/CT excess. Proceedings of the National Academy of Sciences USA 85, 96309634.CrossRefGoogle ScholarPubMed
Ohno, S. & Yomo, T. (1990). Various regulatory sequences are deprived of their uniqueness by the universal rule of TA/CG-deficiency and TG/CT excess. Proceedings of the National Academy of Sciences USA 87, 12181222.CrossRefGoogle ScholarPubMed
Sakano, H., Huppi, K., Heinrich, G. & Tonegawa, S. (1979). Sequences at somatic recombination sites of immunoglobulin light-chain genes. Nature 280, 288294.CrossRefGoogle ScholarPubMed
Yomo, T. & Ohno, S. (1989). Concordant evolution of coding and noncoding regions of DNA made possible by the universal rule of TA/CG deficiency-TG/CT excess. Proceedings of the National Academy of Sciences USA 86, 84528456.CrossRefGoogle ScholarPubMed