Hostname: page-component-cd9895bd7-p9bg8 Total loading time: 0 Render date: 2025-01-03T14:26:45.690Z Has data issue: false hasContentIssue false

A Likelihood-Ratio Test of Twin Zygosity Using Molecular Genetic Markers

Published online by Cambridge University Press:  21 February 2012

Stephen Erickson*
Affiliation:
Department of Biostatistics, University of Alabama at Birmingham, Birmingham, Alabama, United States of America. [email protected]
*
*Address for correspondence: Department of Biostatistics, University of Alabama at Birmingham, Birmingham, Alabama 35233 USA.

Abstract

Core share and HTML view are not available for this content. However, as you have access to this content, a full PDF is available via the ‘Save PDF’ action button.

The importance of using multiple polymorphic genetic markers to determine unambiguously whether a twin pair is monozygotic (MZ) or dizygotic (DZ) has long been recognized. Concordance among a set of markers is used as evidence of monozygosity, as it would be improbable for DZ twins to be concordant at a large number of polymorphic loci. Several sources give a formula for the probability of two DZ twins sharing the same genotype at a locus, assuming knowledge of allele frequencies but not of either twin's genotype; this probability can be used to determine whether a set of markers will reliably distinguish between MZ and DZ status in a randomly selected twin pair. If the shared genotype is known, however, the likelihood-ratio test (LRT) of the null hypothesis of dizygosity against the alternative hypothesis of monozygosity takes into account the observed genotype and, by the Neyman-Pearson lemma, is the most powerful test of its size. The LRT is equivalent to conditioning on the genotype of one of the twins, and computing the probability, assuming DZ status, of the other twin sharing that genotype. The resulting p values are frequently lower than those produced by the unconditional probability, especially if rare alleles are observed. The unconditional probability can be recapitulated from conditional probabilities by averaging across all of the conditioned sibling's possible genotypes. To illustrate properties of the LRT applied to multiple markers, the probability distribution of the LRT p value is computed from allele frequencies of twelve unlinked markers published in Elbaz et al. (2006) and compared with the p value computed from unconditional probabilities.

Type
Articles
Copyright
Copyright © Cambridge University Press 2008