Many recent studies have raised interest in the nuclear associations of coregulated genes from different chromosomes, often evoking interpretations of gene–gene interactions, communication, and even “romance.” However, in some cases, the associations may be indirect and infrequent and may reflect the segregation of active and inactive genes into different nuclear compartments. The study by Brown et al. (see p. 1083 of this issue) reports that the apparent association of erythroid genes is not a direct interaction nor colocalization to one tiny transcription factory but arises as a result of the known clustering of many active genes with larger splicing factor–rich speckles (a.k.a., SC35-defined domains). This clustering appears largely stochastic but is impacted by the chromosomal neighborhood of the gene as well as its transcriptional status. The study adds a new twist by examining the same gene in a foreign chromosomal context, providing evidence that this impacts a gene's propensity to form gene–domain (or apparent gene–gene) associations within nuclei.
Interpreting gene associations in nuclei by FISH
Gene pairing in mammalian nuclei has been implicated in processes ranging from transient “kissing” of alleles during X inactivation or imprinted gene regulation to the colocalization of unlinked genes that are tightly coregulated during differentiation (for review see de Laat and Grosveld, 2007; Brown et al., 2008). Evidence for interallelic or interchromosomal gene associations in mammalian nuclei is based on two types of analyses: simultaneous visualization of loci by FISH or the use of 3C/4C technology to identify DNA sequences close enough for paraformaldehyde cross-linking and subsequent ligation (Dekker, 2006; for review see de Laat, 2007). In situ analysis is thought to be important for validation of the more molecular-level associations identified by 3C/4C, and 3C can similarly be used to validate apparent in situ interactions. However, an oft used definition of association is a separation of up to 0.5–1.0 μm between gene signals by FISH, allowing a cytological separation inconsistent with a molecular scale interaction. In some studies, signals that are closely juxtaposed are used to indicate colocalization, but given the limit of resolution of light microscopy (0.1–0.2 μm), any separation (lack of overlap) in two signals suggests a significant molecular-scale disconnect. For example, two 10-kb probes separated by 100–200 kb on a chromosome can only be separately resolved in a fraction of nuclei, even though these sequences may still be distant and noninteracting on a molecular scale (Lawrence et al., 1990). Brown et al. (2008) recognize that a direct physical interaction between two DNA sequences would generate completely overlapping FISH signals, which they essentially do not see (Brown et al., 2006, 2008). Thus, they examine whether the increased proximity of active genes may be accounted for by their association with other nuclear structures.
Globin gene associations may reflect the interaction of active erythroid genes with relatively large SC35-rich speckles
A prior study reported that as globin genes are activated in erythroid cells, they become colocalized with each other within a single transcription factory (Osborne et al., 2004). Transcription factories, initially defined by bromo-UTP (BrUTP) labeling for RNA synthesis, appear as ∼2,000 closely spaced punctuate spots that by electron microscopy are ∼40 nm (Pombo et al., 1999) in most cell types, which is as small or smaller than a single gene FISH signal. Although Brown et al. (2008) confirm that the erythroid genes are more associated by FISH in erythroid cells versus lymphocytes, they argue that the numbers do not add up: signals very rarely colocalize, and even associated signals are on average still a whopping 0.5 μm apart, ∼10 times the size of transcription factories in human erythroid cells. Brown et al. (2008) further suggest that the polymerase II antibody used in this prior analysis does not detect the transcription factories defined by BrUTP incorporation. Rather, they conclude that what brings two active loci in proximity is their mutual association with nuclear speckles, also known as SC35 domains. These large (0.5–2.0 μm) structures enriched in numerous splicing and RNA metabolic factors are often delineated by staining to spliceosome assembly factor SC35, which typically identifies ∼15–40 SC35 domains per nucleus (for review see Lamond and Spector, 2003; Hall et al., 2006). Although the function of these speckles is debated, the Brown et al. (2008) study adds to the evidence that speckles are associated with many highly active genes.
If globin genes interact with a larger SC35 speckle rather than smaller and more numerous transcription factories, as Brown et al. (2008) conclude, this is an important distinction. As shown in Fig. 1 A, two genes that associate with the same SC35 domain can be proximal to each other but still often quite far apart. This does not imply that the genes are interacting with each other or need to be near one another to be coregulated, as is more strongly suggested from colocalization to a transcription factory. Genes typically position at the outer edge of the SC35 domain, whereas the transcripts (from genes associated with that domain) often enter the domain and thus can transiently comingle there (Shopland et al., 2003; Hall et al., 2006). As shown here for COL1 and COL2 (Fig. 1, A and B), the two RNA foci can overlap within the same SC35 domain, whereas the genes are still separable. Thus, experiments that use RNA foci to mark specific genes (Osborne et al., 2007) may overestimate the frequency of truly coincident genes.
Is speckle choice random?
Even by the liberal 1-μm definition of DNA–DNA association, most nuclei still do not show associations between any two given loci. Association rates are typically around 10%, with the highest in Brown et al. (2008) approaching 30%. Importantly, Brown et al. (2008) find that the subset of gene pairs that are in proximity are consistently contacting an SC35-defined speckle, indicating that this is what drives the “pairing.” The clustering of active erythroid genes with SC35 domains fits with several prior studies, which showed that localization at the immediate edge of these domains is a common but locus-specific organization that closely correlates with the activity of some genes (Jolly et al., 1999; Smith et al., 1999; Hall et al., 2006; Brown et al., 2006). Throughout these studies, gene–domain associations have been more tightly defined (no visible separation) by fluorescence microscopy and commonly occur at much higher rates (∼90–100%) than gene–gene associations; in fact, gene–domain association rates of 5–20% were generally considered random.
If two active genes associate with speckles with high frequency, why are they not more frequently associated with each other? Insight into this was provided for the tightly coregulated COL1A1 and COL1A2 genes system (Shopland et al., 2003). Both genes associate with an SC35 domain almost 100% of the time, yet they were at the same SC35 speckles in only 7–13% of nuclei. Thus, rather than conclude that the genes had an affinity for one another, this nonrandom proximity was interpreted to be a biproduct of the specific positioning of both genes at SC35 domains. Because the choice of the domain appeared essentially random (in cells with 15–20 domains), these two tightly coregulated genes did not interact and communicate with each other but were expressed in a common subnuclear compartment. Although not precluding that instances of more specific gene pairing may occur in mammalian cells, this demonstrates that apparent gene–gene associations can arise as a result of broader principles of gene organization in nuclei.
3C analysis of gene associations: are weak interactions mediated by a third party?
Although the study by Brown et al. (2008) does not itself repeat 3C analysis of the erythroid genes, α- and β-globin genes have been perhaps the most studied genes by this method (Dekker, 2006; for review see de Laat, 2007). This approach has provided important evidence of direct interlocus interactions; for example, through chromatin looping of the locus control region and promoter in the mouse β-globin locus (Vakoc et al., 2005) or physical interaction of elements with the Ig heavy chain gene cluster (Ju et al., 2007). Using 3C techniques, interchromosomal DNA associations for the α- and β-globin genes and a number of other loci have been reported (Osborne et al., 2004; for review see de Laat and Grosvfeld, 2007). However, as recently discussed (Dekker, 2006), there may be a host of weaker interactions detected by 3C/4C that arise as a result of very infrequent and possibly quite indirect interactions. The specificity of these interactions may in fact be quite limited: genes that are active may be more likely to become cross-linked with each other because of their interaction with a third party, such as a common nuclear domain (Fig. 2). Similarly, heterochromatic genes would have chance 3C interactions as a result of their segregation in a distinct nuclear compartment, or “neighborhood.” Thus, in the case of weaker 3C interactions, it is all the more necessary to have strong FISH data to corroborate that these associations are meaningful.
In some cases, very low frequency associations could reflect transient one-time regulatory events, such as the initial choice of which X chromosome to inactivate or which imprinted allele will be expressed. It is harder to see how this would relate to the coregulation of two or more genes that are both stably expressed (or repressed) during development because, even using relatively loose criteria by FISH, only a subset of these expressing genes are typically paired with one another.
The influence of the chromosomal context
Brown et al. (2008) report that all five erythroid genes examined showed increased proximity in the active state, but the extent of this varied with the gene and, to some extent, the species. To investigate whether this is influenced by chromosomal context, the authors examine the 2-Mb chromatin regions around each locus for gene density, CpG islands, and the degree of distension via FISH. The two genes that showed the greatest propensity for interchromosomal gene associations were also in chromosomal regions that show the greatest extension within nuclei. Although the sample size is small, their evidence is bolstered by another approach to examine cells in which the human α-globin locus was placed into the mouse α-globin chromosomal site, which shows less frequent interactions than its human counterpart. The human α gene associated with a frequency similar to the mouse locale, providing further evidence that the broader chromosomal context can impact such interactions.
Genomic scale studies examining interchromosomal interactions by 4C have also begun to identify more interacting regions of chromosomes. For example, organizational differences between relatively large chromosomal segments is suggested by 4C studies (Simonis et al., 2006; for review see de Laat, 2007). Some chromosomal regions are more apt to show a 4C signal above background, and this can vary with cell type. More active regions cross-link more with active regions, and silenced regions cross-link more with silent regions. However, a question remains as to whether this reflects sequence-specific interactions or rather the general principles of nuclear organization in which genes segregate into larger euchromatic versus heterochromatic compartments.
Prior studies showed that a consistent association of genes with SC35 domains was not a property of all active genes but was locus specific and may relate to the properties of the gene or its RNA (Smith et al., 1999) as well as chromosomal neighborhood or band (Shopland et al., 2003). In fact, differences in association levels between regions of a chromosome reported by both Brown et al. (2008) and Simonis et al. (2006) may well relate to a long-known feature of chromosome biology: cytogenetic bands. The occurrence of chromosome bands itself demonstrates that there is a higher order organization of DNA in large (∼2–10 Mb) blocks (Fig. 1 E). A prior study found that R bands were more extended and intimately associated with SC35-rich speckles than G bands (Fig. 1, C and D; Shopland et al., 2003) and suggested that the organization manifest in chromosome bands may have evolved in part to accommodate the clustered organization of genes into nuclear neighborhoods. Accordingly, Brown et al. (2008) found that the most interactive gene, SLC4A1, maps to 17q21, an R band that Shopland et al. (2003) showed was especially distended and associated with SC35 speckles.
In fact, constraints imposed by chromosomal context likely impede the inherent propensity of active sequences to cluster, as illustrated by the fact that transfected sequences are not transcribed throughout the nucleus but are found in large nuclear clusters. An interesting recent study by Xu and Cook (2008) indicates that plasmids containing similar promoter types cluster together, forming foci that largely overlap with polymerase II as well as SC35.
Relation of gene organization to expression
A key question in any study of gene organization is how it relates to gene expression and regulation. Although not the main focus of Brown et al. (2008), their findings are consistent with a host of evidence that transcriptional activity correlates with the increased proximity of coregulated genes, gene positioning at the edge of SC35 speckles, or localization with transcription factories. However, this in itself does not demonstrate that the organization is required for activity. The available evidence has long indicated that expression of a gene does not require its association with an SC35-rich speckle, and Brown et al. (2008) make the case that the α- and β-globin genes still produce transcription foci at high frequency, irrespective of their different frequencies of association or their different chromosomal context. In contrast, other studies conclude that the movement into and out of preexisting transcription factories is required for gene expression (Osborne et al., 2004, 2007). Although many factors will impact the level of expression from a gene, it is likely that nuclear organization and chromosomal context both have a significant impact. Although RNA FISH can determine how many loci are associated with visible RNA, simply counting RNA foci associated with genes will not actually determine whether the level of expression (and posttranscriptional processing) changes. Therefore, a more challenging question for future studies is to determine whether the level of expression is indeed influenced by nuclear and chromosomal organization. Basic kinetic principles would suggest that the immediate access to high concentrations of numerous RNA metabolic factors would be expected to increase the efficiency of the enzymatic processes required.
Irrespective of whether genes interact with each other, an SC35-enriched speckle, or a transcription factory, the nonrandom organization of genes in nuclei is interesting and likely important in gene regulation. Although there is still much to learn from the detailed analysis of endogenous genes in their native context, Brown et al. (2008) illustrates an initial foray into what will be an important approach in the future: “locus-switching” experiments to test predictions about the impact on nuclear organization as well as expression of the chromosomal neighborhood.
© 2008 Lawrence and Clemson This article is distributed under the terms of an Attribution–Noncommercial–Share Alike–No Mirror Sites license for the first six months after the publication date (see http://www.jcb.org/misc/terms.shtml). After six months it is available under a Creative Commons License (Attribution–Noncommercial–Share Alike 3.0 Unported license, as described at http://creativecommons.org/licenses/by-nc-sa/3.0/).