We present a simple ratio method to infer protein composition within cellular structures using proximity labeling approaches but compensating for the diffusion of free radicals. We used tyramide signal amplification (TSA) and label-free mass spectrometry (MS) to compare proteins in nuclear speckles versus centromeres. Our “TSA-MS ratio” approach successfully identified known nuclear speckle proteins. For example, 96% and 67% of proteins in the top 30 and 100 sorted proteins, respectively, are known nuclear speckle proteins, including proteins that we validated here as enriched in nuclear speckles. We show that MFAP1, among the top 20 in our list, forms droplets under certain circumstances and that MFAP1 expression levels modulate the size, stability, and dynamics of nuclear speckles. Localization of MFAP1 and its binding partner, PRPF38A, in droplet-like nuclear bodies precedes formation of nuclear speckles during telophase. Our results update older proteomic studies of nuclear speckles and should provide a useful reference dataset to guide future experimental dissection of nuclear speckle structure and function.
Introduction
While different cellular compartments and structures have distinct protein compositions, methods for easily analyzing the protein composition of these different compartments and how they vary with changes in cell differentiation or physiology are limited (Bell et al., 2001; Croze and Morré, 1984; Mootha et al., 2003; Taylor et al., 2003). Biochemical purification is frequently accompanied by loss of specific proteins. Moreover, cellular bodies and structures that are not surrounded by lipid membranes may lose their physical integrity during biochemical isolation. Bodies with liquid-like properties are particularly prone to such disruptions with physical isolation procedures. For example, previous biochemical purification of interchromatin granule clusters (IGCs), also known as nuclear speckles, has enriched for individual granules present within IGCs; the extent to which nongranule components of the IGCs are lost during purification is unknown (Mintz et al., 1999; Saitoh et al., 2004).
Enzyme-based proximity labeling methods for tagging and then purifying the tagged proteins within particular cellular structures have been developed as an alternative to biochemical purification. Proximity labeling methods work to label proteins in situ with the goal of preserving cell structure during labeling. Thus, proximity labeling approaches are particularly well suited for enriching proteins within less stable and hard-to-purify cellular structures. For example, proximity-dependent biotin identification uses in vivo expression of a biotin ligase, targeted to a specific cell structure, to biotinylate proteins in the vicinity of the targeted biotin ligase. The biotin ligase (BirA) is engineered (BirA*) to label proteins “promiscuously,” with current models suggesting the release of a biotinoyl-5′-AMP that diffuses to label nearby proteins (Roux et al., 2012). Engineered ascorbate peroxidase (APEX) also uses in vivo expression of a cytoplasmic peroxidase to generate short-lived biotin-phenoxyl radical, from biotin-phenol, that diffuses to label tyrosines on nearby proteins. Again, targeting the APEX enzyme to specific cell structures enables labeling of nearby proteins (Rhee et al., 2013).
A major unknown for all enzyme-based proximity labeling approaches is how far the labeling can spread from the targeted enzyme generating the label. Current estimates point to a radius of ∼10 nm for biotin identification, which may be too small for complete labeling coverage of cellular organelles and bodies (Kim et al., 2014). The literature has cited an APEX staining radius of ∼20 nm, based on the visualization by electron microscopy of a polymerized DAB product catalyzed by free radicals produced by APEX (Martell et al., 2012). However, this reported staining radius is an underestimation as it does not take into consideration that the DAB product visualized by electron microscopy is a precipitant deposited near the site of the peroxidase.
Tyramide signal amplification (TSA), an older method that preceded APEX, uses exogenous HRP targeted by immunostaining to generate tyramide (phenoxyl) radicals (Bobrow et al., 1989, 1991; van Gijlswijk et al., 1996, 1997). HRP is more active than APEX and has been used to label proteins in extracellular, oxidizing compartments (Loh et al., 2016; Martell et al., 2016) and fixed samples (Bar et al., 2018). In fact, the staining radius of the HRP-generated radical, whose concentration decays exponentially from the HRP source, has been estimated by light microscopy at ∼0.5–1 µm (Carter et al., 2002; Chen et al., 2018). While the staining radius of the phenoxyl radical generated by APEX in the living cell has not been likewise measured, it is likely similar. The estimated staining radius of TSA, and possibly APEX, may exceed the size of some targeted cell structures.
Here, we describe our application of TSA to the proximity labeling and proteomic analysis of IGCs/nuclear speckles. Nuclear speckles are irregularly shaped nuclear bodies that vary in size and number depending on the cell type (Galganski et al., 2017) and are thought to form by protein–protein and protein–RNA interactions (Shevtsov and Dundr, 2011). Previously, nuclear speckle protein composition was characterized biochemically, through isolation of the individual granules contained within IGCs. Over 80% of the proteins identified in this granule fraction were proteins involved in RNA synthesis and processing (Saitoh et al., 2004).
We describe a simple but key approach whereby we compare the relative abundances of proteins identified by TSA labeling of nuclear speckles versus centromeres. The nuclear speckle enrichment of each protein was determined based on its relative abundance in the pull-down fractions from nuclear speckles versus centromeres after TSA labeling. This approach minimized nonspecific background and false-positive hits (Mellacheruvu et al., 2013), producing a list of several hundred proteins as enriched in nuclear speckles. The majority of these proteins are associated with RNA synthesis and/or processing and overlapped (∼60%) with the proteins identified in the previous proteomic analysis of isolated IGC granules. Approximately two thirds of the top 100 sorted proteins have been shown previously by light microscopy to accumulate in nuclear speckles.
In addition, we validated by light microscopy the nuclear speckle localization of several proteins, including the pre-mRNA processing factor microfibrillar-associated protein 1 (MFAP1; Andersen and Tapon, 2008; Bertram et al., 2017). Interestingly, MFAP1 levels modulate the size of nuclear speckles and may contribute to the nucleation of nuclear speckles after mitosis.
Results
Experimental goals and design of the staining approach
We applied TSA (Bobrow et al., 1989; Wang et al., 1999; Fig. 1 A and Fig. S1 A) proximity labeling of proteins followed by comparative MS (mass spectrometry) of multiple cellular compartments, a simple, robust, cost effective, and easily applicable approach that is suitable for some membrane-less nuclear bodies/compartments. Given that the estimated staining radius of the TSA reaction is roughly comparable to the size of some nuclear bodies, we reasoned that proteins identified by TSA labeling would contain a significant fraction of nonspecific proteins. To overcome the possibility of enriching and identifying extraneous proteins outside the structure of interest, we compared the abundance of proteins in a “target” nuclear body/structure to a “reference” body and tested our approach using nuclear speckles and centromeres as target and reference bodies, respectively (Fig. 1, A and B).
Nuclear speckles, ranging from ∼0.4 to 2 µm in diameter (Fakan and Puvion, 1980), are typically depleted from the nuclear periphery, enriched in the nuclear interior (Carter et al., 1991), and surrounded by the A1 Hi-C active chromosomal subcompartment (Chen et al., 2018). Centromeres and their flanking pericentric heterochromatin, on the other hand, are heterochromatic, with a protein composition distinct from the active chromatin regions surrounding nuclear speckles (Fodor et al., 2010; Müller-Ott et al., 2014; Obuse et al., 2004). Centromeres are present throughout the nucleus but nonrandomly associate with the nuclear periphery and nucleolus (Carvalho et al., 2001). Therefore, we reasoned that centromeres would be good reference nuclear body for comparison with nuclear speckles. Moreover, both centromeres (Obuse et al., 2004) and nuclear speckles (Mintz et al., 1999; Saitoh et al., 2004) have been targets of previous proteomic analysis, and many proteins have already been validated by microscopy as enriched in either centromeres or nuclear speckles.
We determined TSA-labeling conditions producing specific staining of nuclear speckles and centromeres in human U2OS cells (Fig. 1 B). For the TSA reaction, we modified conditions developed earlier for TSA sequencing (Chen et al., 2018). Cells were grown to ∼90% confluency, fixed with PFA, and stained first with primary antibodies against either SC35 or CENPA followed by HRP-conjugated secondary antibodies. Next, we incubated cells with TSA reaction buffer containing tyramide and H2O2 in PBS. For one replicate, we added sucrose to the TSA reaction buffer, which slightly reduces the TSA staining radius (Chen et al., 2018). Initially, we used tyramide conjugated to biotin for avidin-biotin pull-down. However, Western blotting showed prominent background bands, likely produced by endogenous biotinylated proteins. Thus, we replaced biotin-tyramide with FITC tyramide (Fig. S1, A and B). To further reduce nonspecific background, we selected antibodies such that the same secondary antibody is used for both the target and reference structures.
Protein pull-down and MS
We suspended TSA-labeled cells in lysis buffer supplemented with protease inhibitors and DNase I. Cell clumps were disrupted by passage through a 25G needle, and the cell suspension was sonicated to obtain a clear lysate. The lysate was centrifuged to remove cellular debris and FITC proteins were pulled down using agarose beads conjugated with anti-FITC antibodies. Proteins were eluted off the beads and resolved on polyacrylamide gel (Fig. 1 C and Fig. S1 C).
To test the TSA labeling and enrichment of labeled proteins, we examined Western blots probed with antibodies against FITC (Fig. S1 D) or SON and heterochromatin protein I α (HP1α). SON and HP1α are highly enriched in nuclear speckles and centromeres, respectively. Indeed, the nuclear speckle fraction was highly enriched with SON and depleted of HP1α, whereas the centromere fraction was enriched with HP1α and depleted of SON (Fig. 1 D).
Next, we repeated the TSA labeling and affinity pull-down but followed this by MS. The relative abundance of each protein in each of the two nuclear bodies was determined using label-free quantitation (LFQ; Cox et al., 2014).
There was an overlap of 1,620–1,870 identified proteins between paired-replicate nuclear speckle (SC35) TSA-MS experiments (Table S1) and 2,340–2,495 identified proteins between paired-replicate centromere (CENPA) TSA-MS experiments, with extensive overlap (60–90%) in the identified proteins from the combined nuclear speckle and centromere paired datasets (Table S1). We expected this extensive overlap between centromere and nuclear speckle identified proteins because of the large TSA-staining radius. Many overlapping proteins would be expected to be present in the nucleoplasm surrounding both nuclear bodies. Moreover, coimmunostaining revealed that centromeres and nuclear speckles, while typically spatially separated, occasionally localized close enough to each other so that their surrounding TSA-stained volumes would overlap. Additionally, we expected some proteins would be labeled due to the low-level nucleoplasmic distribution of CENPA and SC35 proteins outside of centromeres and nuclear speckles, respectively, as well as nonspecific background primary and/or secondary antibody staining.
To identify candidate nuclear speckle proteins, we calculated the nuclear speckle versus centromere LFQ intensity ratio for each protein in each paired dataset (Table S1 and Fig. S2 A). The geometric mean of these ratios from replicate datasets was then taken as the nuclear speckle versus centromere (SPK-CEN) ratio. Only proteins quantified in both the nuclear speckle and paired centromere pull-down fractions were analyzed. We note that unique proteins that were quantified in the nuclear speckle, but not the paired centromere, fractions, including some known nuclear speckle proteins, were excluded (Table S1).
High percentage of known nuclear speckle proteins among the top sorted proteins in TSA-MS ratio lists
Our knowledge of nuclear speckle proteins comes from three main sources. First is the cumulative literature describing microscopy localization of specific proteins to nuclear speckles. Second is the previous MS of biochemically fractionated IGC granules (Mintz et al., 1999; Saitoh et al., 2004). This work identified 146 proteins, including three proteins (SON, BCLAF1, and acinus) whose localization previously had been unknown but that were then validated as enriched in nuclear speckles using light microscopy (Saitoh et al., 2004; Table 1). Third is the Human Protein Atlas (HPA) survey of the staining patterns of thousands of different antibodies (Thul et al., 2017), which included a “localized to the nuclear speckles” annotation. Note that this assignment is based solely on the characteristic staining pattern observed and not on the actual colocalization of this staining pattern with a known nuclear speckle marker.
Our analysis also identified proteins previously shown by light microscopy to be enriched in nuclear speckles. Indeed, the list of proteins with the highest SPK-CEN ratios showed a very high percentage of these “validated” nuclear speckle proteins, with this percentage dropping with decreasing SPK-CEN ratio.
Specifically, 96% of the top 30 sorted proteins are validated as nuclear speckle proteins, including two proteins that we validated as localized to nuclear speckles (Table 1). This percentage, however, drops to 68% for the top 100 sorted proteins and 35% for the top 250 proteins (Fig. 1, E and F; Table 1; and Table S1). Roughly one third (23) of these 68 validated nuclear speckle proteins in the top 100 sorted list, including PRPF4B, LUC7L3, and CDK12, had not been identified previously in the MS analysis of isolated IGC granules (Saitoh et al., 2004; Table 1 and Table S1).
Empirically, the SPK-CEN ratio cutoff corresponding to a nuclear speckle enrichment sufficient for wide-field light microscopy detection was ∼1.65 (ratio for 91th protein), with approximately three fourths of the top-sorted 91 proteins previously identified by light microscopy as nuclear speckle localized. This includes 7 out of the 10 proteins whose SPK-CEN ratio is equal to or just above this 1.65 ratio cutoff, as compared with 2 of the next 10 proteins whose ratio is just below this 1.65 cutoff. The actual level of enrichment in nuclear speckles is likely several-fold higher. Given the known radius of diffusion of the tryamide free radical, the actual volume of the tryamide-stained regions surrounding the centromeres and nuclear speckles should be several-fold larger than the volumes of the speckles or centromere regions themselves. Experimentally, SON, with the highest SPK-CEN ratio of 4.9, appears >10-fold enriched in nuclear speckles by wide-field deconvolution light microscopy and even more highly enriched in speckles as observed by super-resolution light microscopy (Chen et al., 2018; Kim et al., 2019).
Interestingly, the HPA annotates some previously validated nuclear speckle proteins as “mainly localized to the nucleoplasm” (Table 1), including ACIN1 (Saitoh et al., 2004) and SRSF2 (SC35; Fu and Maniatis, 1992), which currently is the most widely used nuclear speckle marker. Using light microscopy, we show in a later Results section (Validating potential nuclear speckle proteins) that three (MFAP1, pre-mRNA processing factor 38A [PRPF38A], and zinc finger 207 [ZNF207]) out of three proteins tested from the top 30 list, whose nuclear speckle localization had been unknown or ambiguous, were indeed nuclear speckle enriched. Thus, the true percentage of speckle-enriched proteins in our top-sorted lists may be significantly higher than the percentage of previously validated nuclear speckle proteins.
Annotated functions of top sorted proteins in nuclear speckle–enriched TSA-MS ratio list
Similar to the findings from the previously published MS study of isolated IGC granules (Saitoh et al., 2004), a large fraction of proteins with high SPK-CEN ratio are also implicated in some aspect of RNA metabolism (Fig. S1, B and C; Table 1; and Table S1). Specifically, all of the top 30 sorted proteins (Table 1) are implicated in some aspect of RNA metabolism, including pre-mRNA splicing (25), poly(A) tail length control and RNA export (3), and transcription (2). Similarly, 90 of the top 100 proteins are functionally annotated as RNA processing or proteins engaged in transcription or transcription regulation (Fig. S2 C and Table S1). Moving further down the list, 186 of the top 250 proteins are involved in RNA metabolism, including pre-mRNA splicing factors (108), transcription (42), cleavage, polyadenylation or mRNA export (17), and other proteins annotated only as RNA-binding (19; Fig. S2 B and Table S1).
Structurally, nuclear speckles behave as liquid-droplet type bodies (Marzahn et al., 2016) and are thought to form through multivalent interactions between low-complexity domains (LCD) on multiple proteins as well as RNA–protein interactions. Most of the proteins with higher SPK-CEN ratio contain RNA-binding domains and/or LCDs (Table 1 and Table S1). One such highly disordered protein, ZNF207/BuGZ, has been previously demonstrated to undergo phase transition to form liquid droplets as part of the mitotic spindle apparatus (Jiang et al., 2015).
Sorting proteins enriched in centromeres
Sorting proteins instead by the magnitude of the centromere versus nuclear speckle (CEN-SPK) ratio creates a list of potential centromere-enriched proteins, analogous to the previously defined nuclear speckle–enriched list (Fig. 1 A). However, in contrast to the significant enrichment of known nuclear speckle proteins among proteins with high SPK-CEN ratio, our list of proteins with high CEN-SPK ratio was dominated by proteins that localized to the nucleolus, nucleoplasm, or cytoplasm (Table S1). Of the top 100 proteins sorted by the CEN-SPK ratio, 50 are nucleolar proteins, while HP1α (Kang et al., 2011) and Aurora B kinase (Yamagishi et al., 2010) were the only well-characterized heterochromatin and/or centromere-associated proteins (Table S1).
This small number of validated centromere or heterochromatin proteins in our CEN-SPK ratio–sorted list does not appear to be a problem with the centromere TSA labeling and/or pull-down. Rather, many known centromere- and heterochromatin-associated proteins were present in the replicate centromere TSA-MS datasets but had no LFQ intensities in the paired nuclear speckle TSA-MS datasets (Table S1), including 10 CENP proteins (CENPC, CENPB, CENPH, CENPK, CENPT, CENPN, CENPI, CENPO, CENPP, and CENPX; Obuse et al., 2004; Verdaasdonk and Bloom, 2011), HMGA2 (Narita et al., 2006), and HDAC1 (Yamada et al., 2005; Table S1).
We chose centromeres as the reference nuclear body for our ratio approach, because we expected centromere labeling would include proteins present throughout the nucleoplasm, but not proteins enriched within or near nuclear speckles. Centromeres nonrandomly localize close to nucleoli and the nuclear periphery, which are regions not typically associated with nuclear speckles. Moreover, centromeres are smaller than the diffusion radius of the tyramide radical, allowing significant labeling of the proteins in the surrounding nucleoplasm. Future application of the TSA-MS ratio to identify proteins specifically enriched in centromeres and other heterochromatin regions might benefit from a different reference target. Here, we focused our attention and further validation work on nuclear speckle–enriched proteins.
Validating potential nuclear speckle proteins
As described, many of the top sorted proteins in our SPK-CEN ratio list are previously validated nuclear speckle proteins (Table 1 and Table S1). However, the top-sorted lists also included proteins for which conflicting data previously existed with regard to their nuclear speckle localization, proteins previously identified as not enriched in speckles, or proteins with no information available regarding their possible nuclear speckle localization. Some proteins with high SPK-CEN ratio could be proteins specifically depleted from centromeres and the surrounding nucleoplasm rather than enriched within nuclear speckles (false positives). Other proteins with a high SPK-CEN ratio could be specifically enriched in nuclear speckles but have not been examined previously by microscopy or have incorrectly appeared not to be enriched in nuclear speckles due to some technical problem (false negatives).
Therefore, we investigated the nuclear speckle localization of 3 proteins among the top 30 sorted protein list: MFAP1, PRPF38A, and ZNF207. Both MFAP1 and PRPF38A are components of the spliceosome (Agafonov et al., 2011; Bertram et al., 2017). They are implicated in pre-mRNA splicing (Andersen and Tapon, 2008) but are annotated as “localized to the nucleoplasm” by the HPA. ZNF207, also known as BUGZ, is a kinetochore and microtubule-binding protein (Jiang et al., 2015) recently shown to accumulate in nuclear speckles of preextracted cells (Rai et al., 2018) but annotated as “localized to the nucleoplasm” by the HPA.
Coimmunofluorescence staining using antibodies against MFAP1 or ZNF207 and antibody for SC35, to visualize nuclear speckles, showed nuclear speckle accumulation for both proteins in cells fixed with methanol (Fig. 2 A). Formaldehyde-fixed cells instead showed diffuse foci throughout the nucleoplasm (Fig. S2 D). However, recent reinvestigation of mitotic bookmarking revealed a large number of transcription factors that appear to lose their DNA binding during mitosis, as visualized after formaldehyde fixation, in fact show strong mitotic chromosome localization in live cells using fluorescently tagged protein expression. Live-cell imaging revealed a loss of transcription factor within seconds of adding formaldehyde (Teves et al., 2016).
To distinguish between these possibilities and to validate nuclear speckle localization of these proteins, we transfected cells to express human MFAP1, PRPF38A, or ZNF207 proteins fused to GFP. GFP-MFAP1– or GFP-PRPF38A–expressing cells showed GFP accumulation in nuclear speckles after formaldehyde fixation (Fig. 2, B–D). Therefore, the apparent loss of nuclear speckle staining for these proteins likely reflects the sensitivity of their antibody epitopes to formaldehyde fixation. However, cells transiently transfected with GFP-ZNF207 and fixed with formaldehyde still showed diffuse nucleoplasm staining, similar to immunostaining of the endogenous ZNF207 protein (Fig. S2 E). Live-cell microscopy, however, showed GFP-ZNF207 colocalized with SON in nuclear speckles (Fig. 2 E).
This sensitivity of either antibody epitope preservation or protein nuclear speckle localization to formaldehyde fixation conditions (with or without preextraction) may account for previous negative or conflicting results for nuclear speckle localization for these three proteins. We note that while this article was in preparation, MFAP1 was localized to nuclear speckles using the sequential combination of preextraction, formaldehyde fixation, and then ethanol fixation (Salas-Armenteros et al., 2019).
In addition to localizing within nuclear speckles, at high expression levels, both GFP-MFAP1 and GFP-PRPF38A also formed bright and distinct, intranuclear, round foci, which scored negative for both SC35 and SON nuclear speckle markers after immunostaining (Fig. 2, B–D). Some of these foci were adjacent to but distinct from nuclear speckles. Foci were absent in cells expressing lower levels of GFP-MFAP1 or GFP-PRPF38A (Fig. 2 C, arrowhead) but increased in number with increasing GFP-MFAP1 or GFP-PRPF38A expression (Fig. 2 C, arrow; Fig. 2 D), suggesting foci form as a result of overexpression.
Coexpression of MFAP1 and PRPF38A at higher expression levels revealed both proteins colocalized in nuclear speckles and the foci outside of and adjacent to nuclear speckles (Fig. 2 D).
MFAP1 depletion increases nuclear speckle size
Proteins containing arginine/serine (RS) dinucleotide repeats, including members of the serine/arginine (SR) superfamily of pre-mRNA splicing factors, are typically enriched in nuclear speckles (Galganski et al., 2017) and comprise a large fraction of proteins with high SPK-CEN ratio (Table 1). SON, the protein with the highest SPK-CEN ratio (Table 1), is a very large, multidomain protein with a double-stranded RNA-binding domain, a RS repeat domain, and multiple tandem repeats; all of these domains could influence nuclear speckle composition and physical properties. Depleting SON changed nuclear speckle shape from a solid to doughnut shape and reduced the concentration of U1-70K snRNP and SF2/ASF proteins within nuclear speckles (Sharma et al., 2010).
We asked whether other proteins, especially non-SR proteins in the top 30 SPK-CEN ratio–sorted list, might also regulate nuclear speckle structure (Table 1). SRRM2, number 2 on this list, is also a large, SR-related protein that has many RS repeats (Wu and Maniatis, 1993) but lacks an RNA recognition motif (RRM) that is found in SR proteins (Long and Caceres, 2009), suggesting SRRM2 may interact with RNA in the presence of other SR proteins (Blencowe et al., 2000). SRRM2 knockdown (KD) by siRNA produced no observable change in nuclear speckle shape or size (Fig. S3, A and D). However, SRRM2 KD reduced the nuclear speckle association of SC35, but not SON, as visualized by immunostaining (Fig. S3 E), suggesting that SC35 also is not required for maintaining normal nuclear speckle size and shape.
ZNF207 undergoes phase transition to form liquid droplets that promote microtubule polymerization (Jiang et al., 2015). While ZNF207 has a high nuclear speckle enrichment (Rai et al., 2018; Fig. 2, A and E), it lacks both an RS domain and an RRM. Nuclear speckle morphology, and the nuclear speckle localization of SC35 and SON, were unchanged in ZNF207-depleted cells (Fig. S3, A and D).
In contrast, nuclear speckles became noticeably enlarged after MFAP1 KD (Fig. 3 B and Fig. S3 F). The average nuclear speckle in a typical MFAP1-depleted cell was ∼3 µm along its widest length (maximum length, ∼5 µm), as compared with typical, ∼0.5–2 µm lengths in control cells (Fig. S3 D; and Fig. 3, B and C). MFAP1 depletion also increased nuclear speckle size in nontransformed human hTERT-immortalized Tig3 fibroblasts (Fig. S3, B and F) and CHO cells (Fig. S3, C and G). In CHO cells, MFAP1 depletion also increased SON nucleoplasmic levels (Fig. S3 G). We did not observe increased nuclear speckle roundness after MFAP1 KD (Fig. S3 D and Fig. 3 B).
Some nuclear speckles in MFAP1 KD cells migrated and encircled the periphery of nucleoli after RNA polymerase II (RNA pol II) transcription inhibition (Fig. 3, D–F; and Video 1). Longer exposure of MFAP1-depleted cells to α-amanitin resulted in unusually large and fewer nuclear speckles per cell (five or fewer nuclear speckles in some cells), which sometimes surrounded remnants of nucleoli (Fig. 3 E). In contrast, KD of PRPF38A, which directly binds MFAP1 (Bertram et al., 2017; Ulrich et al., 2016), had no effect on nuclear speckle morphology (Fig. S3, A and D). These observations suggest that normal levels of MFAP1 are required to both maintain smaller nuclear speckle size and stabilize nuclear speckle interactions with other nuclear structures such as active chromatin.
MFAP1 and PRPF38A partially separate from nuclear speckles into adjacent bodies after transcriptional inhibition
Overexpression of GFP-MFAP1 results in nuclear foci separate from and adjacent to nuclear speckles. In some GFP-MFAP1–overexpressing cells, nearly all GFP-MFAP1 is sequestered into the adjacent foci, with undetectable GFP-MFAP1 within the neighboring nuclear speckle (Fig. 2 B). RNA pol II transcription inhibition increases nuclear speckle size and roundness (Galganski et al., 2017). Furthermore, previous FRAP experiments showed that ASF/SF2 association with nuclear speckles depended on RNA pol II transcription (Kruhlak et al., 2000; Phair and Misteli, 2000). We therefore examined the association of GFP-MFAP1 to nuclear speckles after RNA pol II transcription inhibition.
As expected, inhibiting RNA pol II transcription caused nuclear speckles to become rounder but also caused fluorescently tagged MFAP1 and/or PRPF38A to partially segregate out of nuclear speckles into foci next to nuclear speckles (Fig. 4, A–D; Fig. S4 A; and Video 2). SON was undetectable in the adjacent MFAP1/PRPF38A foci, as visualized by anti-SON immunostaining (Fig. 4, B and C). The MFAP1/PRPF38A foci that form following transcriptional inhibition are similar in shape and size to the foci observed in GFP-MFAP1– or GFP-PRPF38A–overexpressing cells (Fig. 2, B–D).
Live-cell imaging revealed that these GFP-MFAP1 and GFP-PRPF38A foci that form next to nuclear speckles after RNA pol II transcriptional inhibition are dynamic, disappearing within 1 h following release of the transcription block (Fig. 4, E and F). Their disappearance coincides with the reaccumulation of GFP-MFAP1 or GFP-PRPF38A inside nuclear speckles and restoration of the normal nuclear speckle shape (Fig. 4, E and F; Video 3; and Video 4).
Nuclear entry of MFAP1 and formation of nuclear foci precedes complete nuclear speckle reformation in early G1
Given MFAP1’s possible role in maintaining normal nuclear speckle size, we next investigated the temporal localization of MFAP1 during mitosis when nuclear speckles first disassemble and then reform.
Nuclear speckles disassemble after initiation of mitosis and their components diffuse throughout the dividing cell (Ferreira et al., 1994). In late metaphase, some nuclear speckle components begin to aggregate into cytoplasmic “mitotic interchromatin granules” (MIGs). MIGs increase in size and number as cells progress into telophase (Ferreira et al., 1994; Prasanth et al., 2003). During late telophase, MIGs then disassemble, and their components are sequentially imported into the nucleus to form nuclear speckles (Prasanth et al., 2003).
We compared GFP-MFAP1 localization versus SON or SC35 immunostaining during these mitotic events. GFP-MFAP1 diffusely localized throughout the cell following initiation of mitosis. In telophase cells, GFP-MFAP1 was mainly concentrated within the nuclei, while SON and SC35 remained within cytoplasmic MIGs (Fig. 5, A and B). Before nuclear speckles reformed, as assayed by SON or SC35 localization, GFP-MFAP1 was present both diffusely throughout the nucleus and also concentrated in many small, distinct, round foci that did not stain for either SON or SC35 (Fig. 5, A and B; and Fig. S4 B). Similar nuclear foci were observed in telophase cells expressing GFP-PRPF38A (Fig. 5 C). These GFP-MFAP1 foci showed no coilin staining (Fig. 5 D); therefore, they do not correspond to mitotic coiled bodies (Ferreira et al., 1994). However, their appearance and distribution resemble previous reports of the redistribution of U-snRNPs during mitosis in PtK2 rat kangaroo cells (Spector and Smith, 1986).
We next compared MFAP1 and SON dynamics in live cells during progression through mitosis into early G1. Both GFP-MFAP1 and mCherry-SON became diffusely localized throughout the cytoplasm upon the start of mitosis (Fig. 5 E, Video 5, and Video 6). In late anaphase and early telophase, when mCherry-SON was mainly present in cytoplasmic MIGs, GFP-MFAP1 already was rapidly accumulating in the reforming nucleus (Fig. 5 E and Video 5). This nuclear accumulation of GFP-MFAP1, lasting ∼5 min, was then followed by the formation of distinct nuclear foci (Video 5 and Video 6). Formation of GFP-MFAP1 nuclear foci was followed by the gradual reduction in the size and number of cytoplasmic MIGs and the increasing colocalization of GFP-MFAP1 and mCherry-SON (Fig. 5 E and Video 5). These results are consistent with previous reports of the sequential entry of nuclear speckle components into the nucleus after mitosis (Prasanth et al., 2003) but show that formation of MFAP1 foci precedes the start of nuclear speckle assembly (Fig. 5). However, our current temporal and spatial resolution does not allow us to determine whether MFAP1 foci might directly nucleate the reformation of nuclear speckles.
Discussion
To identify proteins within a specific cellular structure, we combined the TSA proximity-labeling method with a simple ratio approach comparing target and reference cell structures. TSA proximity labeling was followed by affinity pull-down and quantitative MS. This TSA-MS ratio approach compensates for the large diffusion radius of the label-generating free radical, resulting in labeling of many proteins located outside the target cell structure, and allows for the sorting of candidate proteins with a high likelihood of localizing within the target cell structure. We applied TSA-MS ratio to identify proteins enriched in nuclear speckles, using nuclear speckles as the target and centromeres as the reference structures.
Peroxidase-based proximity-labeling methods rely on the diffusion of enzyme-generated free radicals to label proteins. Thus, extraneous proteins may represent a significant fraction of labeled proteins due to the staining radius of the free radical relative to the size of the target structures. For example, the ∼1 µm (Chen et al., 2018) estimated staining radius of HRP-generated biotin-tyramide is relatively large and, in the case of centromeres, could overlap with some nuclear speckles. To minimize false positives resulting from this extended TSA labeling radius, we compared the relative abundance of proteins in the nuclear speckles versus centromere pull-down fractions.
This comparison resulted in a significant enrichment of validated nuclear speckle proteins among the top proteins sorted by their SPK-CEN ratio. Specifically, 29 out of the top 30 and 67 out of the top 100 sorted proteins localized to nuclear speckles based on light microscopy. This includes three proteins (MFAP1, PRPF38A, and ZNF207), for which conflicting data existed previously, that we validated here as localized to nuclear speckles using a combination of different fixation conditions and live-cell imaging. The true percentage of nuclear speckle proteins may be even higher, as we demonstrated that formaldehyde fixation perturbs apparent nuclear speckle localization. The percentage of validated nuclear speckle proteins, however, dropped rapidly as the SPK-CEN ratio further decreased.
In contrast, the list of proteins with high centromere ratio (CEN-SPK ratio) was dominated by annotated nucleolar and cytoplasmic proteins. However, this is consistent with the estimated TSA staining radius (Chen et al., 2018) and the observed localization of centromeres adjacent to the nucleolar and nuclear periphery (Carvalho et al., 2001). Centromere and pericentric heterochromatin proteins were present in this sorted list, but they showed lower CEN-SPK ratios than the nucleolar and cytoplasmic contaminants; other centromeric and pericentric heterochromatin proteins were only present in the centromere pull-down fractions.
Thus, the different performance of the ratio sorting approach for identifying the relative enrichment of proteins in centromere versus nuclear speckle pull-down fractions can be explained by the TSA staining radius and the size and location of the centromere and nuclear speckle target bodies. While centromeres were a good reference structure to eliminate false positives corresponding to nucleoplasmic proteins surrounding both centromeres and nuclear speckles, nuclear speckles were not a suitable reference structure to identify proteins enriched in centromeres. Future applications of TSA-MS ratio will need to use these concepts to select suitably matched target and reference compartments. Previously, we showed that the tyramide free radical–staining radius can be reduced by varying the TSA-staining conditions (Chen et al., 2018). We anticipate that future use of conditions resulting in reduced free radical spreading should further improve TSA-MS ratio results.
Our analysis identified many known nuclear speckle proteins that were absent in the isolated IGC granule proteome (Saitoh et al., 2004), including 20 among the 67 validated nuclear speckle proteins in the top 100 sorted list. Perhaps many of these proteins detected by our approach that were not in the IGC granule proteome correspond to nuclear speckle proteins that are not IGC granule components or granule components that are lost during the IGC granule fractionation procedure.
Consistent with previous work (Saitoh et al., 2004), the majority of the top SPK-CEN ratio–sorted proteins are involved in RNA synthesis or processing (Saitoh et al., 2004). The fraction of proteins involved in RNA metabolism remains high with decreasing SPK-CEN ratio, even while the percentage of validated nuclear speckle proteins falls off sharply after the top 100 sorted proteins. Proteins with low SPK-CEN ratio may have fast on/off rates of nuclear speckle association, reducing their speckle concentration during the slow formaldehyde fixation. Thus, the true nuclear speckle concentration of such proteins may be higher if visualized using other fixation methods or live-cell imaging (Teves et al., 2016), as we demonstrated for ZNF207. Proteins with lower SPK-CEN ratio might also correspond to proteins enriched in regions surrounding nuclear speckles rather than inside the speckles themselves. A last possibility is that these proteins might associate with tyramide-labeled proteins after cell lysis and during pull-down.
Nuclear speckles are now thought to form through liquid–liquid phase separation mediated by protein–protein and protein–RNA interactions (Galganski et al., 2017). Proteins containing low-complexity disordered domains readily form liquid-like bodies both in vivo and in vitro (Altmeyer et al., 2015; Jiang et al., 2015; Patel et al., 2015). However, pure liquid-phase separated bodies should be perfectly round, unlike the shape of nuclear speckles in nonperturbed cells. What nuclear speckle components might contribute to these deviations from a pure liquid-like behavior, conferring nonround shape and nonliquid-like behavior to nuclear speckles, are currently unknown.
Here, we identified MFAP1 as a possible regulator of nuclear speckle size and anchoring of nuclear speckles to chromatin. Specifically, we demonstrated enlargement of nuclear speckles after MFAP1 KD. Conversely, after transcription inhibition both MFAP1 and PRPF38A partially segregate into dynamic foci next to nuclear speckles, with this segregation correlating with the rounding of nuclear speckles generally seen after transcription inhibition (Galganski et al., 2017). Combining RNA pol II transcription inhibition with MFAP1-KD cells caused fusion of some nuclear speckles into fewer and larger nuclear speckles, as well as nuclear speckle fusion with the nucleolar periphery.
Finally, MFAP1, possibly with PRPF38A, forms nuclear bodies distinct from nuclear speckles. When overexpressed, both MFAP1 and PRPF38A segregate out of nuclear speckles, forming round nuclear bodies in the nucleoplasm distinct from nuclear speckles, defined by either SC35 or SON staining. As described in the previous paragraph, transcriptional inhibition also causes MFAP1 and PRPF38A to segregate out of but adjacent to nuclear speckles. Interestingly, both MFAP1 and PRPF38A reenter telophase nuclei before import of other nuclear speckle markers such as SC35 and SON. In these early telophase nuclei, MFAP1 and PRPF38A then condense to form round nuclear bodies similar in size and shape to those observed after either overexpression or transcriptional inhibition. Future experiments will be needed to test what other nuclear speckle proteins cosegregate within these MFAP1/PRPF38A bodies and whether they may actually nucleate the reformation of nuclear speckles after mitosis.
In summary, comparing the proteomes of multiple subcellular structures within the crowded nucleoplasm allows improved identification of proteins in different membrane-less cellular compartments. Furthermore, combining proximity-labeling methods with different effective staining radii should allow the probing of different size volumes surrounding the staining target. This should help improve the true positive hit rate while also helping to identify proteins in the same cell structure that may not be in actual molecular contact. Meanwhile, identification of new nuclear speckle proteins should lead to improved structural and functional dissection of nuclear speckles, as illustrated with our identification of the changes in nuclear speckle size, shape, and nuclear distribution produced after manipulating MFAP1 levels.
Materials and methods
Cell culture, plasmid constructs, and transfections
U2OS and Tig3 cells were cultured in DMEM (Thermo Fisher Scientific) supplemented with 10% bovine growth serum (Hyclone). CHO cells were cultured in Ham’s F12 media (Thermo Fisher Scientific) supplemented with FBS (MilliporeSigma). Penicillin-streptomycin (Thermo Fisher Scientific) was added to all growth media, and cells were incubated in a 37°C humidified incubator with 5% CO2. For DNA cloning, total RNA was extracted from U2OS cells using RNeasy kit (Qiagen). Approximately 1 µg total RNA was reverse transcribed into cDNA using the Protoscript First Strand cDNA Synthesis Kit (New England Biolabs). Unless otherwise stated, all GFP constructs were cloned, using a PCR-based strategy, into the Clonetech pGFP-C1 backbone (Clonetech). All mCherry constructs were based on the pmCherry-C1 backbone (Clonetech). pCIBN-hTRF2-tagRFP-T was a gift from Karsten Rippe (University of Heidelberg, Heidelberg, Germany; plasmid 103812; Addgene). TagRFP-MFAP1 was modified from GFP-MFAP1 construct by replacing the GFP open reading frame with TagRFP open reading frame using AgeI and XhoI restriction sites. All constructs were sequenced and verified before use. DNA constructs were transfected using the Lipofectamine 2000 Transfection Reagent (Thermo Fisher Scientific) and following the manufacturer’s instructions with minor modifications. Briefly, cells in 6-well plates were transfected with 0.5 µg plasmid DNA, and 24-well plates were transfected with 0.1 µg plasmid DNA. Transfection complexes were prepared by separately diluting plasmid DNA and Lipofectamine 2000 reagent in appropriate volumes of OptiMEM (Thermo Fisher Scientific). Diluted DNA and Lipofectamine 2000 transfection reagent were mixed and incubated for 5 min at room temperature to form transfection mix. Transfection mix was added onto cells in complete growth media, and cells were incubated at 37°C and either analyzed after 24 h or, for long-term culturing, drugs for selection were added.
Antibodies
The following primary antibodies were used in this study: anti-CENPA (AB13939; Abcam), anti-SC35 (S4045; MilliporeSigma), anti-MFAP1 (HPA042370; MilliporeSigma), anti-SRRM2 (PA5-68009; Thermo Fisher Scientific), anti-ZNF207 (HPA017013; MilliporeSigma), anti-HP1α (H2164; MilliporeSigma), anti-β Tubulin (AB18207; Abcam), anti-FITC (200-002-037; Jackson ImmunoResearch), custom-made anti-SON (Chen et al., 2018; PACIFIC10700; Pacific Immunology), anti-Nucleophosmin (AB86712; Abcam), anti-nucleolin (AB70493; Abcam), anti-Coilin (Ab87913; Abcam), and anti-PRPF38A (PA5-62730; Thermo Fisher Scientific). Secondary antibodies used were goat anti-mouse HRP (115-035-062; Jackson ImmunoResearch), goat anti-rabbit HRP (111-035-144; Jackson ImmunoResearch), goat anti-mouse FITC (111-095-144; Jackson ImmunoResearch), and goat anti-rabbit Texas red (115–075-146; Jackson ImmunoResearch).
RNA interference
The following RNA duplexes were used for RNAi: ON-TARGETplus nontargeting control siRNA #1 (D-001810-01-05; Dharmacon), human MFAP1 (D-020071-03-0002: 5′-AAGUGAAGGUAAAGCGUUA-3′; Dharmacon), human MFAP1 3′ UTR (D-020071-17-0002: 5′-GGAGUAAUAUACCCGGAAA-3′; Dharmacon), mouse Mfap1a (D-063511-01-0002: 5′-UAAGAAAGCUAAGGAACAA-3′; Dharmacon), human SRRM2 (D-015368-01: 5′-GAGCAAAUCUCAAACAUCA-3′; D-015368-03: 5′-AAAGAAUGUCCACAAGUA-3′; D-015368-05: 5′-GGAAUGAAAGAUAUACCUA-3′; D-015368-006: 5′-GACAGCAAAUCUCGACUAU-3′; Dharmacon), and human PRPF38A (D-014833-02-0002: 5′-CGAUAAAGCCAUGGAGUUA-3′; Dharmacon). ZNF207 siRNA oligo was used in a previous study (Jiang et al., 2015; 5′-GCCUGCUACACUUACAACAACUAU-3′; Invitrogen). The sequence targeting the mouse Mfap1a (D-063511-01-0002) exactly matches the region in CHO cells and was thus used to knock down Mfap1 in CHO cells. All siRNAs were transfected at a concentration of 10 nM using the Lipofectamine RNAiMax transfection reagent (Thermo Fisher Scientific) and following the manufacturer’s directions. Briefly, cells were seeded overnight, and transfection complexes were prepared by separately diluting 10 nM siRNA and appropriate volume of Lipofectamine RNAiMAX transfection reagent in OptiMEM. Diluted siRNA and RNAiMAX transfection reagent were mixed and incubated for 5 min to form transfection complexes. Transfection complexes were added onto cells in complete growth media and cells were incubated for 72–96 h before cells were analyzed. Western blotting was used to determine KD of proteins.
Chemical treatments
The following chemical inhibitors were used in this study to inhibit RNA pol II transcription: 5,6-dichloro-1-β-D-ribofuranosylbenzimidazole (DRB; D1916) was purchased from MilliporeSigma and α-amanitin (CAS 23109–05-9) was purchased from Santa Cruz Biotechnology. All inhibitors were dissolved in DMSO and used at the following final concentrations: DRB, 50 µg/ml; α-amanitin, 10 µg/ml. For washout experiments with DRB, cells were incubated with DRB for 2 h before cells were rinsed at least three times with prewarmed complete growth media.
Immunostaining, microscopy, and image processing
Cells were seeded on coverslips in appropriate wells, and where indicated, cells were treated before fixing with either 100% ice-cold methanol at −20°C or 4% PFA (MilliporeSigma) in PBS at room temperature. All fixations were for 20 min, and cells were rinsed at least three times with PBS after fixation. PFA-fixed cells were permeabilized with 0.1% Triton X-100 (MilliporeSigma) for 30 min at room temperature. Cells were blocked for 1 h at room temperature in blocking buffer consisting of 5% goat serum (MilliporeSigma) and 0.1% Triton X-100 in PBS before incubation with appropriate dilution of primary antibody, in blocking buffer, for 1 h at room temperature or overnight at +4°C. Cells were rinsed three times with wash buffer (0.1% Trition X-100 in PBS) and incubated at room temperature with fluorescent-conjugated secondary antibody diluted in blocking buffer. After 1 h, cells were rinsed three times with wash buffer and mounted using Mowiol (Calbiochem) containing 1,4-diazabicyclo [2.2.2] octane (MilliporeSigma). DAPI (Millipore-Sigma) was added to stain DNA in the nucleus. All images of fixed cells were acquired using a DeltaVision microscope equipped with a Xenon lamp, 60× 1.4 NA oil-immersion objective lens (Olympus) and CoolSNAP HQ charge-coupled device camera (Roper Scientific) and the SoftWorx imaging software (GE Healthcare). Where necessary, images were deconvolved and projected into 2D using the maximum intensity algorithm in SoftWorx imaging software. Where applicable, images were exported into TIFF and processed using FIJI (ImageJ). Nuclear speckle areas were also measured using FIJI, and time-lapse videos were converted into MP4 with VLC media player.
Time-lapse microscopy
For live imaging, cells were seeded in 35-cm glass-bottom dishes (MatTek Corporation) and treated as required. Imaging was performed at 37°C in 5% CO2 chamber attached to a V3 OMX microscope (GE Healthcare) equipped with a 100× 1.4 NA oil-immersion objective (Olympus) and an electron multiplying charge-coupled device camera (Photometrics). We collected image stacks, deconvolved and projected into 2D using the maximum intensity projection algorithm in the SoftWorx software (GE Healthcare). Where necessary, DNA was stained using the SiR-DNA kit (Cytoskeleton).
Western blotting
Proteins were resolved in 8% or 10% SDS-PAGE gels and transferred into nitrocellulose membranes (Bio Rad laboratories). Membranes were blocked for 1 h at room temperature with blocking buffer (5% nonfat milk, 0.1% Tween 20 in PBS) before membranes were incubated, overnight at 4°C, with gentle rocking, in primary antibodies diluted in blocking buffer. Membranes were washed three times 10 min in wash buffer (0.1% Tween 20 in PBS) at room temperature before membranes were incubated with HRP-conjugated secondary antibodies at room temperature for 1 h in blocking buffer. Membranes were washed 3 × 10 min in wash buffer, incubated in Immobilon Western HRP Substrate (Millipore) for 5 min before protein bands were detected using iBright Imaging System (Thermo Fisher Scientific). Tubulin was detected as the loading control.
Tyramide-FITC conjugation and TSA reaction
While tyramide-fluorescein (FITC) is commercially available (for example, PerkinElmer, # NEL741001KT), we instead performed tyramide-fluorescein conjugation as previously described (Hopman et al., 1998). Fluorescein-NHS ester was purchased from Thermo Fisher Scientific; and tyramine hydrochloride (tyramide), dimethyl formamide (DMF), and triethylamine were purchased from MilliporeSigma. Fluorescein-NHS stock solution (21 mM) was prepared in DMF. Tyramide-hydrochloride (HCL) stock solution (58 mM) was prepared in DMF containing 71 mM triethylamine. Equal amounts of fluorescein-NHS in the fluorescein-NHS stock solution and tyramide-HCL in the tyramide-HCL stock solution were combined and the mixture was incubated in the dark for 2 h at room temperature. The conjugated tyramide-FITC was diluted to 1 mg/ml with ethanol and stored in dark at −20°C. The TSA protocol was modified from Chen et al. (2018). U2OS cells were grown to 90% confluency in 15-cm cell culture dishes. Cells were fixed with 1% PFA in PBS at room temperature for 20 min. Cells were rinsed for 10 min with 0.25 M glycine to quench free aldehydes and further rinsed with PBS, scrapped using cell lifter (Thermo Fisher Scientific), and transferred into a 1.5-ml tube for subsequent processing. From this point on, all washes were done by suspending cells in 1 ml PBST (0.1% Triton X-100 in PBS) followed by 3-min centrifugation at 3,000 rpm in room temperature. Cells were permeabilized with 0.5% Triton X-100 for 30 min at room temperature under gentle rotation. Cells were washed three times and suspended in 1.5% H2O2 (MilliporeSigma) and incubated at room temperature, with gentle rotation, to quench the endogenous peroxidase. After 1 h, cells were rinsed three times with PBST and blocked for 1 h at room temperature in blocking buffer (5% goat serum in PBST). Typically, cells were suspended in primary antibodies diluted in 1 ml blocking buffer at 2 µg per 20 million cells and incubated overnight at 4°C with gentle rotation. Cells were rinsed three times with PBST and suspended in HRP-conjugated secondary antibody at 2 µg per 20 million cells in blocking buffer and incubated overnight at 4°C. Cells were rinsed three times with PBST and suspended in TSA reaction buffer (0.1 µg/ml tyramide-fluorescein in PBS and 0.015% H2O2) for 10 min before cells were rinsed three times with PBST. To verify the TSA labeling reaction, ∼5 µl cell pellet was mounted on cover glasses, using Mowiol 1,4-diazabicyclo [2.2.2] octane, for light microscopy. Approximately 150 million cells were processed per antibody for each MS run.
Pull-down of tyramide-FITC–labeled proteins
To affinity purify and enrich fluorescein (FITC)–labeled proteins, we used the Pierce Direct IP kit (#26148; Thermo Fisher Scientific) to couple anti-FITC antibody to agarose beads following the manufacturer’s instructions with some modifications. We used 10 µg anti-FITC antibody and 25 µl Pierce AminoLink Plus Coupling Resin slurry in the coupling reaction. After the coupling reaction, columns containing anti-FITC coupled beads were either directly used for the pull-down of FITC-labeled proteins or stored at +4°C in Pierce coupling buffer with 0.02% sodium azide until needed. For the pull-down of FITC-labeled proteins, after TSA reaction, cells were rinsed once with coupling buffer and suspended in 500 µl ice-cold Pierce lysis buffer supplemented with protease inhibitor cocktail (MilliporeSigma) and DNase I (New England Biolabs). Cell suspension was incubated on ice for 30 min and vortexed every 5–10 min. Cell suspension was passed through a 25G needle (BD Biosciences) attached to a 1-ml syringe (BD Biosciences) to break larger cell clumps. Cells were sonicated at 4°C using the Bioruptor Pico sonicator (Diagenode) to obtain clear or evenly cloudy lysates. Lysates were centrifuged at 16,000 rpm for 10 min at 4°C to pellet cell debris. Equal amounts of the supernatants, as determined using the BCA Assay Kit (Thermo Fisher Scientific) from either anti-SC35 TSA or anti-CENPA TSA reaction, were cleared for 1 h at 4°C using Pierce Control Agarose Resin. Cleared lysates were loaded onto Pierce Spin Columns containing anti-FITC–coupled Pierce AminoLink plus coupling resin and incubated overnight at +4°C with gentle end-over-end rotation. Beads were washed three times with ice-cold wash buffer supplemented with protease inhibitor and once with 100 µl Pierce conditioning buffer. Beads were eluted in a total volume of 50 µl of Pierce elution buffer. Approximately 20 µl eluate was boiled in SDS-PAGE sample loading buffer for 10 min at 100°C and saved for Western blotting; the other 30 µl was saved for in-solution trypsin digestion and MS.
MS
Eluted proteins were reduced with Tris (2-carboxyethyl) phosphine (MilliporeSigma), alkylated with Iodoacetamide (MilliporeSigma), and digested with Lys C/Trypsin (Thermo Fisher Scientific) overnight. The digested samples were desalted by HPLC using an Optimize Technologies C8 microtrap. Samples were resuspended in 0.2% formic acid and directly loaded onto a PicoFrit column (New Objective) packed in house with ReproSil-Pur C18AQ 1.9 µm resin (120 Å pore size; Dr. Maisch GmBH). The 25 cm × 50 µm ID column was heated to 60°C. The peptides were separated with a 120-min gradient at a flow rate of 220 nl/min using a nanoflow LC system, EASY-nLC 1200, (Thermo Fisher Scientific). The gradient was as follows: 2–6% solvent B (7.5 min), 6–25% B (82.5 min), and 25–40% B (30 min), and 40–100% B (9 min). Solvent A consisted of 97.8% H2O, 2% acetonitrile, and 0.2% formic acid and solvent B consisted of 19.8% H2O, 80% acetonitrile, and 0.2% formic acid. The separated peptides were ionized by a Nanospray Flex ion source and subjected to MS/MS analysis using a QExactive HF Orbitrap mass spectrometer (Thermo Fisher Scientific). The QExactive HF Orbitrap was operated in data-dependent mode with the Tune (version 2.7 SP1build 2659) instrument control software. Spray voltage was set to 2.5 kV, S-lens RF level at 50, and heated capillary at 275°C. Full scan resolution was set to 60,000 at m/z 200. Full scan target was 3 × 106 with a maximum injection time of 15 ms. Mass range was set to 300−1,650 m/z. For data-dependent MS2 scans, the loop count was 12, target value was set at 105, and intensity threshold was kept at 105. Isolation width was set at 1.2 m/z, and a fixed first mass of 100 was used. Normalized collision energy was set at 28. Peptide match was set to off, and isotope exclusion was on. Data acquisition was controlled by Xcalibur (4.0.27.13).
LFQ of MS data
Raw data were analyzed using MaxQuant (version 1.6.5.0; Cox and Mann, 2008). Spectra were searched against UniProt human entries (74,487 sequences) and a contaminant protein database (247 sequences). A decoy database was used to establish score thresholds to achieve a 1% protein and peptide false discovery rate. Variable modifications searched were protein N-terminal acetylation and methionine oxidation. Carbamidomethylation was specified as a fixed modification of cysteine. Trypsin was the specified digestion enzyme with up to two missed cleavages allowed. LFQ and match-between-runs were enabled (Cox et al., 2014).
Online supplemental material
Fig. S1 shows TSA labeling of proteins, including chemical structure of tyramide-FITC, Western blots, and protein gels after TSA labeling. Fig. S2 shows data analysis, functional annotation, and validation of potential nuclear speckle proteins, including analysis of MS data and demonstration of loss of ZNF207 and MFAP1 from speckles after PFA fixation. Fig. S3 shows that MFAP1 KD specifically increases nuclear speckle size. Fig. S4 shows nuclear distribution of MFAP1 relative to nuclear speckles as function of DRB treatment and also in mitosis. Table S1 shows MS data. Video 1 shows that nuclear speckles in MFAP1-depleted cells relocate to nucleolar periphery after DRB treatment (related to Fig. 3 F). Video 2 shows that GFP-MFAP1 segregates into foci adjacent to nuclear speckles after DRB treatment. Video 3 shows that reversal of RNA pol II transcription inhibition results in disappearance of MFAP1-positive droplet-like foci adjacent to nuclear speckles (related to Fig. 4 E). Video 4 shows that reversal of RNA pol II transcription inhibition results in disappearance of GFP-PRPF38A droplet-like foci adjacent to nuclear speckles (related to Fig. 4 F). Video 5 shows that GFP-MFAP1 reenters nucleus and forms droplet-like foci before reentry of SON protein and nuclear speckle reassembly after mitosis (related to Fig. 5 E). Video 6 shows that GFP-MFAP1 localizes and accumulates into foci in the nucleus before complete chromatin decondensation (related to Fig. 5 E).
Acknowledgments
The authors thank all members of the Belmont laboratory for suggestions and advice.
A.S. Belmont acknowledges support from National Institutes of Health R01 grant GM58460. The California Institute of Technology Proteome Exploration Laboratory is supported by the Beckman Institute and National Institutes of Health grant 1S10OD02001301. Light microscopy was performed in the Department of Molecular and Cellular Biology Light Microscopy Facility, funded with support from the Carver Foundation.
The authors declare no competing financial interests.
Author contributions: J. Dopie and A.S. Belmont conceived of and designed the research plan. J. Dopie designed the experimental plan and performed all experiments. A. Moradian and M.J. Sweredoski provided advice on preparation of samples for MS, performed MS, provided analysis of MS data, and advised on the ratio analysis approach. The manuscript was written by J. Dopie and A.S. Belmont with input from A. Moradian and M.J. Sweredoski regarding the MS.