The recombination signals (RS) that guide V(D)J recombination are phylogenetically conserved but retain a surprising degree of sequence variability, especially in the nonamer and spacer. To characterize RS variability, we computed the position-wise information, a measure correlated with sequence conservation, for each nucleotide position in an RS alignment and demonstrate that most position-wise information is present in the RS heptamers and nonamers. We have previously demonstrated significant correlations between RS positions and here show that statistical models of the correlation structure that underlies RS variability efficiently identify physiologic and cryptic RS and accurately predict the recombination efficiencies of natural and synthetic RS. In scans of mouse and human genomes, these models identify a highly conserved family of repetitive DNA as an unexpected source of frequent, cryptic RS that rearrange both in extrachromosomal substrates and in their genomic context.
Prospective Estimation of Recombination Signal Efficiency and Identification of Functional Cryptic Signals in the Genome by Statistical Modeling
L.G. Cowell and M. Davila contributed equally to this work.
Abbreviations used in this paper: ampr, ampicillin resistant; camr, chloramphenicol resistant; cRS, cryptic RS; dsb, double-strand breaks; I, position-wise information; MI, mutual information; LM-PCR, ligation-mediated PCR; RIC, RS information content; RS, recombination signal; rS, Spearman's rank correlation coefficient; 12-RS, 12-bp spacer RS; 23-RS, 23-bp spacer RS.
Lindsay G. Cowell, Marco Davila, Kaiyong Yang, Thomas B. Kepler, Garnett Kelsoe; Prospective Estimation of Recombination Signal Efficiency and Identification of Functional Cryptic Signals in the Genome by Statistical Modeling . J Exp Med 20 January 2003; 197 (2): 207–220. doi: https://doi.org/10.1084/jem.20020250
Download citation file:
Sign in
Client Account
Sign in via your Institution
Sign in via your InstitutionEmail alerts
Advertisement