Translation control is critical to regulate protein expression. By directly adjusting protein levels, cells can quickly respond to dynamic transitions during stem cell differentiation and embryonic development. Ribosomes are multisubunit cellular assemblies that mediate translation. Previously seen as invariant machines with the same composition of components in all conditions, recent studies indicate that ribosomes are heterogeneous and that different ribosome types can preferentially translate specific subsets of mRNAs. Such heterogeneity and specialized translation functions are very important in stem cells and development, as they allow cells to quickly respond to stimuli through direct changes of protein abundance. In this review, we discuss ribosome heterogeneity that arises from multiple features of rRNAs, including rRNA variants and rRNA modifications, and ribosomal proteins, including their stoichiometry, compositions, paralogues, and posttranslational modifications. We also discuss alterations of ribosome-associated proteins (RAPs), with a particular focus on their consequent specialized translational control in stem cells and development.
Introduction
Protein and mRNA amounts are poorly correlated, indicating the importance of posttranscriptional regulation (Khan et al., 2013). Recently, translational control has been found to play a dominant role in the regulation of gene expression. Particularly, during dynamic cell fate transitions, such as embryonic development or responses to internal and external stimuli, translation control enables cells to quickly respond by directly adjusting protein abundance before a new chromatin state and transcriptional network form. The chromatin state and many critical mRNAs of embryonic stem cells (ESCs) are stringently subjected to translational control in maintaining self-renewal and pluripotency as well as controlling developmental signaling pathways, cellular differentiation, and embryonic development across many species (Bulut-Karslioglu et al., 2018; Fujii et al., 2017; Kondrashov et al., 2011; Lee et al., 2013b; Slaidina and Lehmann, 2014).
Ribosomes are complex macromolecular assemblies that are the central sites for translation in cells. Using mRNA as a template, one ribosome reads each codon of the mRNA and pairs the codon with the appropriate amino acid provided by an aminoacyl-tRNA to form a polypeptide chain. The ribosome was viewed as a ubiquitous, invariant entity serving for mRNA translation without alterations or specific regulatory functions. However, recent studies have shown ribosome heterogeneity and specialized functions of ribosomes in controlling selective subsets of mRNAs (Xue et al., 2015; Ferretti et al., 2017; Locati et al., 2017a; Shi et al., 2017; Simsek et al., 2017). The ribosome consists of two major components, the small ribosomal subunit (SSU), containing the decoding center to read mRNA and monitor the complementarity between tRNA and mRNA, and the large subunit (LSU), responsible for peptide bond formation and containing three binding sites for tRNA and polypeptide exit tunnel. Each subunit consists of specific ribosomal RNA (rRNA; SSU: 18S; LSU: 5.8S, 28S, and 5S) and ribosomal proteins (RPs; SSU: ∼34 RPs, known as RPS proteins; LSU: ∼51 RPs, known as RPL proteins; Braschi et al., 2019; Baßler and E. Hurt, 2019; Klinge and Woolford, 2019; Fig. 1). Variations in these components and their assemblies can give rise to heterogeneity in ribosomes, and heterogeneous ribosomes could function as key regulatory players in the translational control of stem cells and development.
Historically, the idea of heterogeneous ribosomes with specific functions originated from the earlier “ribosome filter hypothesis” (Mauro and Edelman, 2002). This hypothesis states that ribosome heterogeneity enables specific mRNA–rRNA and mRNA–RP interactions, resulting in differential rates of mRNA translation. Subsequently, a “ribosome code,” analogous to the “histone code” that regulates transcription, was proposed based on the discovery that different RP paralogs have distinct roles in specific translation regulation (Komili et al., 2007). The tenet of both theories is that heterogeneous ribosome composition confers specific translation regulation, providing another level of complexity in the regulation of gene expression. Based on this tenet, we discuss what constitutes the ribosomal heterogeneity (Fig. 2) and how this heterogeneity regulates mRNA translation in stem cells and development.
In this review, we start with the introduction of global translation control in stem cell differentiation, followed by a discussion on our current knowledge on heterogeneous ribosomes and their specialized translation control in stem cells and development, with a particular focus on vertebrates.
Global translation control in stem cell differentiation
Stem cells require a low global protein synthesis rate to maintain their overall homeostasis. During ESC differentiation into embryoid bodies (EBs), polysome content and translation efficiency are elevated on a genome-wide scale (Sampath et al., 2008; Ingolia et al., 2011). A similar increase upon differentiation is reported in other stem cell systems. Hematopoietic stem cells (HSCs) have a significantly lower protein synthesis rate than most other hematopoietic cells. This low rate of protein synthesis is tightly controlled and is essential for the maintenance of HSCs, as a modest increase or decrease in protein synthesis is sufficient to impair HSC function (Signer et al., 2014). Similar observations have been reported in various tissue-specific stem cell maintenance and differentiation systems, such as hair follicle stem cells (Blanco et al., 2016), muscle stem cells (satellite cells; Zismanov et al., 2016), neural stem cells (Bonaguidi et al., 2011), germline stem cells (GSCs; Sanchez et al., 2016), hepatocytic differentiation (Parent and Beretta, 2008), and spermiogenesis (Yanagiya et al., 2010).
Interestingly, while global translation increases in early differentiation, it decreases in later differentiation. For instance, during GSC differentiation in Drosophila, differentiating germ cells consistently present increased protein synthesis rates from GSCs to 2-cell cysts, reaching the maximum at 4-cell cysts. However, after 4-cell cysts, protein synthesis rates decrease, and 16-cell cysts have an even lower rate than GSCs (Sanchez et al., 2016). Similar observations have been made in human ESC (hESC)-to-cardiomyocyte differentiation (Pereira et al., 2019), mouse neural stem cell differentiation (Baser et al., 2019), the neural progenitor differentiation in mouse forebrain (Chau et al., 2018), and mouse ESC (mESC)-EB differentiation (You et al., 2015). Consistent with these observations, despite globally increased protein synthesis during early differentiation, both RPs (Ingolia et al., 2011) and rRNA synthesis (Savić et al., 2014; Woolnough et al., 2016) are repressed, foreshadowing the reduced protein synthesis in later differentiation. Altogether, these studies show that protein synthesis is dynamically controlled during development and demonstrate that translational control is a key modulator of stem cell maintenance and differentiation.
rRNA heterogeneity and specialized functions
rRNA accounts for >55% of the mass of ribosomes and composes around 80% of total RNA in most cells (Blanco and Blanco, 2017). RNA Polymerase (Pol) I, in the nucleolus, mediates transcription of a single rRNA precursor (prerRNA), which is subsequently cleaved into 18S, 5.8S, and 28S rRNA. On the other hand, 5S rRNA is transcribed by RNA Pol III in the nucleoplasm (Lafontaine, 2015; Fig. 1).
Ribosomal DNA (rDNA) genes encode rRNAs and comprise hundreds of transcription units consisting of tandem repeats. Due to their large copy numbers and the highly repetitive nature of rDNA genes, reliable sequencing and a comparison among different species and tissues are challenging. Recent advances in sequencing technologies and bioinformatics analyses have improved the accuracy and consistency of the sequencing mapping to rDNA loci, revealing the extensive variation of rDNA copy numbers across individuals in many species, including human and mouse (Parks et al., 2018). In the human genome, 45S rDNAs are clustered in nucleolus organizer regions, which are located in five chromosomes (chromosomes 13, 14, 15, 21, and 22). The diploid copy number of the 45S rDNA varies from ∼60 to >800 units. The 5S rDNA array resides on chromosome 1 and the diploid copy number ranges from ∼10 to >400 units (Yu and Lemos, 2016). Analysis of copy-number variation in normal tissues and tumors of human suggests that most cancers are subjected to coupled 5S rDNA amplification and 45S rDNA loss, which are associated with increased cell proliferation rates and nucleolar activity (Wang and Lemos, 2017). Analysis of rRNA, the product of rDNA transcription, shows a reduced transcription in the early ACTIVIN A–induced hESC differentiation (48 h; Woolnough et al., 2016) and in mouse NPC differentiation (Savić et al., 2014), whereas another study shows an approximate 20% increase in 5 day–differentiated EBs compared with mESCs (Sampath et al., 2008). Additionally, HTATSF1 (HIV-1 Tat-specific factor 1), a nucleic acid–binding protein that functions as a cofactor for the stimulation of transcriptional elongation, affects rRNA transcription and processing to modulate overall protein synthesis in controlling ESC pluripotency and differentiation (Corsini et al., 2018). Furthermore, beyond its simple expression level change during differentiation, rRNA modulation can also change the cell fate, growth, and proliferation, which is shown in the study of female Drosophila GSCs and their daughters (Zhang et al., 2014) and of mESCs (Watanabe-Susaki et al., 2014). These results indicate that the copy-number variation of rDNA, together with the expression level difference of its product, rRNA, exerts regulatory effects in different cell types.
After transcription, rRNAs are modified and assembled with RPs to form preribosomal subunits, which are then exported from the nucleus to the cytoplasm (Fig. 1). The steps involved in these processes can specify alternative rRNA molecules into ribosomes, changing the functional specificity of the translation machinery and providing opportunities that ribosome heterogeneity can arise.
Heterogeneity from rRNA variants
Alternative rRNA molecules have been widely reported in archaea, bacteria, and simple eukaryotes (Kim et al., 2008; Yano et al., 2013; López-López et al., 2007; Dimarco et al., 2012; Gunderson et al., 1987). Recent studies have also discovered rRNA heterogeneity in organisms with higher complexity. Humans and mice exhibit vast rDNA variations both across and within individuals (Parks et al., 2018). Specifically in mice, rRNA variants are differentially expressed between tissues, and an analysis indicates that their tissue-specific expression is conserved in humans (Parks et al., 2018; Tseng et al., 2008). These results suggest that rRNA variants can be developmentally regulated. Indeed, in zebrafish, there are at least two rRNA systems comprising distinct subtypes of all four rRNA species. There are maternal-type and somatic-type 5S and 45S rDNA transcription units at different chromosomal locations. Between two developmental subtypes, all four mature rRNA species have sequence differences. During oogenesis, maternal-type rRNAs are expressed and accumulated, and these rRNAs are replaced by the somatic-type rRNAs during embryogenesis (Locati et al., 2017a, 2017b). In silico analysis indicates that 5′ UTRs of maternally expressed mRNAs preferentially bind to certain regions present in the maternal-type 18S rRNA, whereas somatic-type 18S rRNA has a better affinity for maternally nonexpressed mRNAs (Locati et al., 2017b; Fig. 3 A).
Heterogeneity from rRNA modifications
rRNA is extensively modified posttranscriptionally, adding another regulatory layer to its heterogeneity. Considering the extensive interactions between rRNA and various proteins, rRNA modifications are potentially able to modulate the ribosome assembly and fine-tune its specificity in translation (Sharma and Lafontaine, 2015; Sloan et al., 2017). The most common rRNA modifications are ribose 2′-O methylation (2′-O-Me) and pseudouridylation (isomerization of uridine to 5-ribosyl isomer, pseudouridine, ψ). Other identified rRNA modifications include acetylation and methylation of the nucleotide base. Most of these modifications are constitutively present in very conserved regions, important for the maintenance of ribosome structure and function (Decatur and Fournier, 2002; Natchiar et al., 2017). The identification of sites amenable to partial (or substoichiometric) rRNA modification highlights rRNA modifications as a major source of ribosome heterogeneity (Birkedal et al., 2015; Taoka et al., 2016). Furthermore, rRNA modifications are mostly guided by small nucleolar RNAs (snoRNAs), which show tissue-specific expression, providing another source for ribosome heterogeneity (Castle et al., 2010).
Mounting evidence has linked changes in rRNA modification, including defects in the regulators of rRNA modification, to developmental regulation in multiple species. For example, loss of function of three snoRNAs in zebrafish reduces the snoRNA-guided methylation of the target nucleotides in rRNA, and the compromised rRNA modification, even at a single site, causes profound morphological defects and embryonic lethality (Higa-Nakamine et al., 2012). The snoRNA-guided methylation on rRNA is regulated by NPM1, which directly binds snoRNAs and the rRNA 2′-O-methyltransferase fibrillarin (FBL). With unchanged snoRNA abundance and FBL levels, the depletion of NPM1 in human erythroleukemia cells decreases 2′-O-Me levels and some internal ribosome entry site (IRES)–translated proteins. Its depletion also reduces colony-formation potential and increases erythroid differentiation. Moreover, Npm1 inactivation in mouse adult HSCs leads to bone marrow failure. These results suggest the important functions of NPM1-mediated 2′-O-Me in HSC maintenance and erythroid development (Nachmani et al., 2019). Pseudouridylation of rRNA is also reported to have functional importance. Impaired rRNA pseudouridylation by mutations in the Dkc1 gene that encodes dyskerin, a pseudouridine synthase, perturbs the translation of mRNAs containing IRES elements in mice and human cells (Yoon et al., 2006). The catalytically impaired DKC1 leads to defects in ribosome activity with decreased translational fidelity and IRES-dependent translational initiation (Jack et al., 2011). Likewise, the catalytic activity of dyskerin is also required for HSC differentiation (Bellodi et al., 2013). In addition, the alterations in these rRNA modifications have been widely characterized in cancers and other conditions (Dimitrova et al., 2019; Kadumuri and Janga, 2018; Huang et al., 2020; Tusup et al., 2018). A systematic mapping of rRNA 2′-O-Me in paired cancer cell lines (HCT116 p53 +/+ and −/−) identified 106 modified sites on mature rRNAs and revealed only some of these 2′-O-Me sites are vulnerable to variation. Interestingly, these vulnerable sites lie peripherally on ribosomal subunits while other sites are present at the core, which are made more robustly. These results indicate that those vulnerable sites of rRNA 2′-O-Me are subject to specific regulation in cancers (Sharma et al., 2017; Natchiar et al., 2018).
Of note, studies using either qRT-PCR (Sampath et al., 2008; Corsini et al., 2018) or northern blots (Zhang et al., 2014) to measure rRNA levels with certain simple rRNA probes only show the change of rRNA abundance without focusing on its heterogeneity (i.e., rRNA variants or modifications), which overlooked the heterogeneous nature of the changed rRNA population. Thus, rRNA expression level changes, such as during stem cell differentiation and development, may potentially reflect the change of rRNA heterogeneity (i.e., rRNA variants or modifications) in these processes. Additionally, in most cases, it remains unclear whether these rRNA variants and rRNAs with changed modifications are incorporated into translationally active ribosomes or not. All of these warrant the additional scrutiny on functional significance of such heterogeneity in these processes.
RP heterogeneity and specialized functions
RPs extensively interact with rRNAs, making up the ribosomal subunits involved in the translation process. Compared with rRNA, the variance in RPs is more complex. The eukaryotic ribosome has 70–80 RPs with various RP composition and stoichiometry of specific RPs in individual ribosomes, and some RPs have paralogous genes. RPs also have distinct posttranslational modifications (PTMs). Variance from all these elements and the small sizes of many RPs (median of ∼150 amino acids) challenge accurate quantification of candidate RPs with mass spectrometry–based proteomics methods. Also, ribosomal subunit fractions contain immature ribosome biogenesis particles, and there are free RPs not assembled into ribosomes, which complicate the RP-related analysis. The development of technologies has enabled more advanced quantitative strategies, and many of these strategies have been employed to study RPs, extending our knowledge of translational control exerted by heterogeneous RPs in stem cells and development (Peterson et al., 2012; Slavov et al., 2015; Shi et al., 2017; Simsek et al., 2017).
Heterogeneity from stoichiometry of RPs
Various expression levels of different RPs are reported in multiple human tissues and some regions of the mouse embryo (Bortoluzzi et al., 2002; Kondrashov et al., 2011). For example, some ribosomal subunits were identified as stem cell markers due to their highly enriched expression levels in mESCs (e.g., Rps2/uS5 and Rpl13/eL13; Sharov et al., 2003), hESCs (e.g., RPL13A/uL13; Richards et al., 2004), and inner cell mass of human blastocysts (e.g., RPL14/eL14, RPL7A/eL8, RPL19/eL19, and RPL32/eL32; Adjaye et al., 2005). In different tissues and developmental conditions, cells can alter the stoichiometry of RPs assembled into the ribosome to modify its translation specificity. RP stoichiometry in ribosomes was identified in bacteria and yeast, which shows that one or more copies of certain RPs are present in the large ribosomal subunit and that substoichiometry of these RPs sensitizes yeast to cold temperature and affects the translation of certain mRNAs (Ballesta and Remacha, 1996; Remacha et al., 1995). In mESCs, quantification of RPs between monosome and polysome fractions using tandem mass tags mass spectrometry revealed a low polysome-to-monosome ratio, the characteristic feature of ESCs (Slavov et al., 2015). The examination of each RP revealed differential stoichiometry among mouse RPs in monosomes and polysomes. The stoichiometry among RPs also depends on the number of ribosomes bound per mRNA and the growth conditions. These results also indicated that the differential RP stoichiometry and related biological functions are conserved between mice and humans. Subsequently, an absolute protein quantification using selected reaction monitoring–based proteomics strategy was employed to determine the exact RP stoichiometry in mESCs (Shi et al., 2017). This quantification included 15 RPs, among which 4 RPs (RPS7/eS7, RPS25/eS25, RPL10A/uL1, and RPL38/eL38) were identified as significantly substoichiometric in polysomes. A comparison of the RP stoichiometry in polysomes to free ribosomal subunits identified some RPs with a higher (RPS26/eS26, RPL10/uL16, RPL38/eL38, and RPL40/eL40) or lower (RPS7/eS7, RPS25/eS25, and RPL10A/uL1) abundance in polysomes compared with free subunits (Shi et al., 2017). During EB differentiation, RPS18/uS13, RPL7A/eL8, and RPL36A/eL42 protein levels are originally high in mESCs, transiently decrease upon differentiation, and increase again later (Corsini et al., 2018). The differential RP expression levels may also be important for extraribosomal functions, as will be discussed later.
Heterogeneity from compositions of RPs
Various compositions of RPs are one important source of ribosomal heterogeneity. Dysregulation of RP composition affects translation and leads to developmental defects.
For instance, in zebrafish, an individual knockdown of 21 RP genes revealed that depletion of 19 RPs caused obvious developmental deformities, with varying degrees of abnormality in the eyes, ears, brain, and body trunk (Uechi et al., 2006). In mice, deficiency in RPL24/eL24 leads to down-regulation of protein synthesis as well as cellular proliferation and causes physical anomalies, including the kinked tail and preaxial polydactyly (extra digits; Oliver et al., 2004). Mechanistically, variations in RP composition affect the translation specificity and consequently elicit relevant cellular phenotypes. Translational preference of ribosomes with different RP compositions remains an area of active research, and some recent studies have addressed RP-specific translational regulation in stem cells and development.
One example is RPL38/eL38. Deletion in the Rpl38/eL38 gene is identified as the causative mutation for Tail short (Ts) hemizygous mice. The Rpl38/eL38 mutant mice exhibit skeletal patterning defects with the presence of 14 instead of 13 ribs, suggesting that the first lumbar vertebra is transformed anteriorly into a thoracic vertebra. In Rpl38/eL38 mutant embryos, global translation is unchanged, but the translation of a select subset of Homeobox (Hox) mRNAs, the critical regulators controlling axial skeletal morphology, is impaired (Kondrashov et al., 2011). In studying the RNA sequences within the 5′ UTRs of these Hox mRNAs, two RNA regulons were uncovered: a translation inhibitory element (near the 5′ cap) that blocks the canonical cap-dependent translation and a structured IRES element (near the main start codon) that facilitates the recruitment of ribosomes containing RPL38/eL38 to initiate translation. Such two-pronged translational regulation confers specific controls in embryonic development by using specialized ribosomes (Xue et al., 2015; Fig. 3 B).
Similar to the preference of RPL38/eL38-containing ribosomes to certain IRES RNA elements, the ribosomes carrying RPS25/eS25 are also required for translation initiation of multiple classes of viral IRESs, such as the hepatitis C virus IRES and the cricket paralysis virus intergenic region IRES (Landry et al., 2009; Hertz et al., 2013). A chemical cross-linking study revealed the strong interaction between RPS25/eS25 and cricket paralysis virus intergenic region IRES (Nishiyama et al., 2007), and a cryoelectron microscopy study showed the direct contact of RPS25/eS25 to hepatitis C virus IRES (Quade et al., 2015), suggesting that RPS25/eS25 can recognize certain IRES elements and control the relevant translation initiation. To identify mRNAs bound by RPS25/eS25-containing ribosomes in mESCs, “RPS25/eS25-Ribo-seq” was performed by a ribosome profiling protocol using FLAG-immunoprecipitation of cellular extracts from a knock-in mESC line harboring an endogenously tagged Rps25/eS25-3xFLAG allele (Shi et al., 2017). The results showed that the bound transcripts are enriched in specific cellular processes, such as the cell cycle, cell morphogenesis in differentiation, and vitamin B12 pathway. Shi et al. also performed “RPL10A/uL1-Ribo-seq” and found that in mESCs, RPL10A/uL1-containing ribosomes preferentially translate distinct subpools of mRNAs, including those important for system development and extracellular matrix organization. They further showed that RPL10A/uL1-containing ribosomes can regulate specific translation through IRES elements (Shi et al., 2017; Fig. 3 C).
Besides the RPs discussed, there are other RPs that are involved in IRES-mediated translation. In mouse erythroblasts, depletion of Rps19/eS19 or Rpl11/uL5 affects the IRES-dependent translation of Csde1 and Bag1. The lacking of Bag1 is embryonic lethal, and low Csde1 expression inhibits erythroid proliferation and differentiation (Horos et al., 2012). Another example is RACK1, an RP of the 40S ribosomal subunit. In Drosophila, RACK1 is ubiquitously expressed in developing embryos, and its loss is embryonic lethal (Kadrmas et al., 2007). In mice, homozygous deletion of Rack1 leads to embryonic lethality at gastrulation, while heterozygous mice are viable, but with skin pigmentation defects (Volta et al., 2013). RACK1 is essential for IRES-dependent translation in Drosophila and in human hepatoma cells (Majzoub et al., 2014). Future studies should look into the mechanisms underlying RACK1-mediated IRES-dependent translation in embryonic development. Apart from IRES-mediated cap-independent translation, viral transcripts can also be translated in a cap-dependent manner by specialized ribosomes through recognition of other cis-regulatory RNA motifs. For example, RPL40/eL40 recognizes specific cis-regulatory determinant in vesicular stomatitis virus mRNAs, whose translation is activated in a cap-dependent mode (Lee et al., 2013a). The Kozak sequence is another cis-regulatory RNA motif that can be preferentially recognized by specialized ribosomes. This sequence is well known to be important in the translation initiation (Kozak, 1986). The mRNAs with full Kozak consensus are recognized by RPS26/eS26-containing ribosomes for translation initiation, while the mRNAs with less conserved Kozak sequence are translated by RPS26/eS26-depleted ribosomes (Ferretti et al., 2017). Such specialized translation regulation is consistent with the finding that Rps26/eS26 resides upstream of the translation initiation site and directly interacts with the residues before the start codon (Pisarev et al., 2008). In humans, RPS26/eS26 is frequently mutated in Diamond–Blackfan anemia (Boria et al., 2010) and the study in yeast shows that the accumulation of Rps26/eS26-deficient ribosomes is part of the response to multiple stresses (Ferretti et al., 2017). The relevant functions of RPS26/eS26 in distinct tissues and developmental stages need further investigation. Moreover, because some RPs have been shown to have some extra-ribosomal functions (Warner and McIntosh, 2009; Zhou et al., 2015), further work such as the identification of specific RP on the ribosome is necessary to properly ascribe the phenotypes caused by RP dysregulation to specialized ribosome-mediated translational control. Otherwise, extraribosomal roles of RPs cannot be ruled out.
Heterogeneity from paralogs of RPs
The presence of paralogous genes of RPs produces more complexity to RP heterogeneity. Paralogues of RP coding genes widely exist in bacteria, yeast, and plants. In contrast, most mammalian genes coding for RPs are represented in a single copy, with only a few exceptions. One exception is RPL22/eL22. In mice, RPL22/eL22 can repress the expression of its paralog, Rpl22/eL22-like 1 (Rpl22l1/eL22l1), by binding to an internal hairpin structure in Rpl22l1/eL22l1 mRNA and destabilizing it. In the Rpl22/eL22−/− mice, Rpl22l1/eL22l1 has a significantly compensatory increase in its expression and RPL22L1/eL22L1 can be incorporated into ribosomes (O’Leary et al., 2013). Although RPL22/eL22 and RPL22L1/eL22L1 share >70% identical amino acid sequence, there are differences in their developmental functions and organismal phenotypes. For example, Rpl22/eL22−/− mice are phenotypically normal except for specific defects in T and B cell development (Anderson et al., 2007), whereas Rpl22l1/eL22l1 deficiency is embryonic lethal (Zhang et al., 2017), indicating that they cannot fully compensate for each other’s roles. Moreover, RPL22/eL22 and RPL22L1/eL22L1 can perform extraribosomal functions during gastrulation, through antagonistically controlling the splicing of Smad2 pre-mRNA (Zhang et al., 2017). They also play distinct and antagonistic roles in hematopoiesis by binding to Smad1 mRNA with opposing effects on Smad1 expression (Zhang et al., 2013). Further studies are needed to find out whether these posttranscriptional regulations are through ribosomal or extraribosomal effects.
Another example is RPS4/eS4. In humans, there are three genes encoding RPS4/eS4, one (RPS4X/eS4X) on the X chromosome and the other two (RPS4Y1/eS4Y1 and RPS4Y2/eS4Y2) on the Y chromosome (Fisher et al., 1990; Skaletsky et al., 2003; Lopes et al., 2010). RPS4X/eS4X and RPS4Y1/eS4Y1 are ubiquitously expressed, while RPS4Y2/eS4Y2 is specifically expressed in testis and prostate (Lopes et al., 2010). The polypyrimidine tract characterizes the promoters of the constitutively expressed RP genes, including the promoter of RPS4Y1/eS4Y1. This tract is disrupted in RPS4Y2/eS4Y2, which may explain the loss of its constitutive expression. Comparative structure modeling showed very similar structures of the three proteins with a unique hydrogen bond in RPS4Y2/eS4Y2 between its central and C-terminal domains. The C-terminal domain is more exposed at the ribosome surface, and this difference may enable the interaction of RPS4Y2/eS4Y2 with different RPs or extraribosomal factors, exerting testis-specific functions (Lopes et al., 2010). A paralog of RPS4/eS4 exhibiting testis-prominent expression is also identified in mice, known as Rps4/eS4-like, but it is an autosomal gene while the mouse Rps4/eS4 is located on X chromosome as human RPS4X/eS4X (Sugihara et al., 2013). The expression pattern of all RPs and their paralogs across 22 different mouse tissues indicates that expression levels of RP paralogs have a much higher variability across tissues than other RPs (Wong et al., 2014). The results in both mouse and human cell lines also indicate that RP paralogs evolved more recently to additionally provide a greater level of gene expression control (Guimaraes and Zavolan, 2016; Wong et al., 2014).
Heterogeneity from modifications of RPs
PTMs of RPs introduce another layer of heterogeneity to ribosomes. It is a relatively simple way to modify ribosomes compared with energy-consuming de novo protein synthesis and assembly of ribosomes with new RPs. PTMs are covalent modifications that can change the structure, stability, subcellular localization, interacting partners, and functions of modified RPs and consequently contribute to translational control in stem cells and development. Proteomics studies have identified numerous PTMs on RPs, including phosphorylation, ubiquitination, methylation, acetylation, and hydroxylation (Simsek and Barna, 2017).
The phosphorylation of mammalian RPS6/eS6 is the first inducible PTM identified on ribosomes, which was discovered in 1974 from rat liver samples (Gressner and Wool, 1974). RPS6/eS6 phosphorylation is downstream of cellular signal pathways that respond to multiple external stimuli, including the PI3K and mTOR pathways (Meyuhas, 2015). Physiological or pharmacological stimuli in mice that activate neurons result in an increase of RPS6/eS6 phosphorylation, which led to the use of phosphorylated RPS6/eS6 as a marker of activated neurons (Kelleher et al., 2004; Knight et al., 2012). Although RPS6/eS6 phosphorylation is involved in multiple growth signals and stimuli, Rps6/eS6 phosphorylation-deficient mice are viable with only subtle, tissue-specific phenotypes (Ruvinsky et al., 2005, 2009). The second example of phosphorylated RP is RPL13A/uL13, identified from human monocytic cells. RPL13A/uL13 is phosphorylated upon the innate immune response, and this PTM leads to the release of RPL13A/uL13 from the ribosome (Mukhopadhyay et al., 2008). Released phosphorylated RPL13A/uL13 has an extraribosomal function as part of the GAIT complex to bind and inhibit the translation of specific inflammation-related mRNAs (Mazumder et al., 2003; Kapasi et al., 2007; Fig. 3 D). Other examples of RP phosphorylation include RPL12/uL11 phosphorylation regulating translation of mitosis-related transcripts in human proliferating cells (Imami et al., 2018), RPS15/uS19 phosphorylation important for the neurotoxicity phenotype in Parkinson’s disease (Martin et al., 2014), and RACK1 phosphorylation induced upon poxvirus infection (Jha et al., 2017).
Another extensively studied PTM on ribosomes is ubiquitylation, whereby a ubiquitin is covalently attached to a substrate protein singularly (monoubiquitylation) or in long chains (polyubiquitylation). Ubiquitin is a highly conserved small peptide and is highly enriched in cells, representing up to 5% of the total cellular protein content (Park and Ryu, 2014). Interestingly, RPs are important sources for such a big ubiquitin pool. In both humans and mice, there are four genes encoding ubiquitin: UBB and UBC encoding polyubiquitin precursor with ubiquitin repeats and RPS27A/eS31 and UBA52 encoding ubiquitin as in-frame fusions to RPS27A/eS31 and RPL40/eL40, respectively (Park and Ryu, 2014). These RP-ubiquitin fusions (ubiquitin-S27A/L40) are identified across the full breadth of eukaryotic diversity (Sibbald et al., 2019). Ubiquitin can be posttranslationally cleaved from RP–ubiquitin fusions. An early study shows that the transient association between these ubiquitin molecules and their originally fused proteins can promote their incorporation into the nascent ribosome to adjust the ribosome structure and is required for efficient ribosome biogenesis, suggesting specific roles of the ubiquitin generated from RP-ubiquitin fusions (Finley et al., 1989). Consistent with this indication, a recent study shows that the deficiency of the RPL40/eL40–ubiquitin fusion gene UBA52 does not change the global ubiquitin level but decreases the ubiquitylation of ribosomes, which is required for efficient translation. Consequently, the deficiency of Uba52 in mice is embryonic lethal (Kobayashi et al., 2016).
Besides being sources to ubiquitin, RPs can also be ubiquitylated as substrates. Ubiquitylation of RPs can mark them for degradation by the proteasome (degradative ubiquitylation) or alter their functions (regulatory ubiquitylation). Excess RPs that are not assembled into ribosomes can be polyubiquitylated and degraded. This regulation is very important for proper ribosome functions, as RPs are usually expressed at high levels beyond what is actually needed (Sung et al., 2016; Lam et al., 2007). During erythroid differentiation, the expression of UBE2O, a hybrid E2 (ubiquitin-conjugating)–E3 (ubiquitin ligase) enzyme, is induced in reticulocytes. UBE2O ubiquitinates some RPs for degradation, driving the transition from a complex to a simple proteome, which is needed for the transformation of reticulocytes into erythrocytes (Nguyen et al., 2017; Yanagitani et al., 2017). Thus, UBE2O is very important in the differentiation of red blood cells, and its deficiency in mice causes anemia (Nguyen et al., 2017). Moreover, several RPs have been shown to undergo regulatory ubiquitylation in response to cellular stresses. For instance, the site-specific regulatory ubiquitylations of RPS2/uS5, RPS3/uS3, and RPS20/uS10, which occur on assembled cytoplasmic ribosomes, are stimulated by activation of the unfolded protein response (UPR), a cellular response to proteotoxic stress in the ER. These regulatory ribosomal ubiquitylations occur on a timescale coincident with eIF2α phosphorylation, which reduces protein synthesis. Defective ubiquitylation of these RPs sensitizes cells to UPR-induced cell death (Higgins et al., 2015). Interestingly, two of these ubiquitylated RPs, RPS3/uS3 and RPS20/uS10, together with an LSU, RPL10/uL16, are found to be ufmylated by UFM1 (ubiquitin-fold modifier 1) in mESCs (Simsek et al., 2017). UFM1 is a recently identified ubiquitin-like modifier. Mice lacking UFL1, a UFM1 E3 ligase, exhibit severe anemia, cytopenia, and death (Zhang et al., 2015), and similarly, mice without UBA5, a UFM1 E1 activating enzyme, die in utero with severe fetal anemia (Tatsumi et al., 2011). The molecular function of this modification remains to be defined, but the overlap of these two modifications, ufmylation and ubiquitylation, on RPS3/uS3 and RPS20/uS10 are very intriguing, indicating their combinatorial presence and potential cooperative function on the same RP.
The transfer of ubiquitin to protein substrates is directly catalyzed by a ubiquitin ligase and regulated by other factors. ZNF598, a ubiquitin ligase, is required for the regulatory ubiquitylation of RPS10/eS10 and RPS20/uS10. RACK1, a 40S RP, regulates the ubiquitylation of RPS2/uS5, RPS3/uS3, and RPS20/uS10 (Sundaramoorthy et al., 2017). The ubiquitylation of RPs regulated by ZNF598 and RACK1 happens on the 40S–40S interface of the collided diribosomes (Juszkiewicz et al., 2018). Diribosomes are usually formed in stalled ribosomes by which a trailing ribosome encounters a slower leading ribosome. A long stretch of adenosine nucleotides, like the poly(A) tail, stall ribosomes and inhibit downstream translation, which requires the regulatory ubiquitylation of certain RPs. These events activate the ribosome-associated quality control (RQC) pathway, which can resolve the stalled ribosomes and recycle ribosomal subunits. This is an evolved failsafe mechanism to protect ribosomes from accidentally translating into the poly(A) tail (Juszkiewicz and Hegde, 2017; Sundaramoorthy et al., 2017; Garzia et al., 2017; Brandman and Hegde, 2016). Hel2, the mammalian homologue of ZNF598 in yeast, can recognize the stalled ribosomes at poly(A) tail as well as the dicodon CGA-CCG, which can efficiently cause ribosomal stalling. It then ubiquitylates RPS20/uS10, directing the ribosome to RQC pathway (Matsuo et al., 2017; Ikeuchi et al., 2019). The failure of this process results in protein aggregation and proteotoxic stress in yeast and neurodegeneration in mice (Shen et al., 2015; Chu et al., 2009; Choe et al., 2016; Yonashiro et al., 2016; Defenouillère et al., 2016; Fig. 3 E). These findings strongly suggest that the structural unit for RQC is a diribosome and indicate that mRNA translation can be regulated, particularly monitored, by a higher-order ribosome architecture, which may serve as another layer of ribosome heterogeneity.
RAP heterogeneity and specialized functions
In addition to rRNAs and canonical core RPs, ribosomes have hundreds of ribosome-associated proteins (RAPs), which tremendously increase the size and complexity of ribosomes and further expand ribosomal heterogeneity and functional diversity on mRNA translation. The structure on ribosomal surface is important for such RAP assembly. The eukaryotic ribosomes have variable expansion segments (ESs) in their rRNAs, some of which resemble tentacle-like extensions exposed on the ribosome surface. ES27L, one of the largest ESs, can recruit nascent peptide processing enzymes that are required for translation fidelity (Fujii et al., 2018). The termini of human ESs 7 and 27 are exposed on the ribosomal surface and contain tandem G-tracts that can form highly stable G-quadruplexes. These G-quadruplexes can bind to RAPs, including helicases (such as DDX3, CNBP, DDX21, and DDX17, all of which are reported to be able to unfold G-quadruplexes) and some heterogeneous nuclear ribonucleoproteins (Mestre-Fos et al., 2019).
Several identified RAPs have been shown to specifically regulate translation of specific subsets of mRNAs. A well-known example is FMRP, the loss of which causes fragile X syndrome. FMRP directly binds to the coding regions of transcripts associated with synaptic signaling and autism spectrum disorders (Darnell et al., 2011), possibly through the specific binding of its RGG motif (arginine-glycine-rich box) to guanine-quadruplex–containing RNAs (Vasilyev et al., 2015). FMRP can also directly interact with RPL5 protein, and this interaction happens near the binding sites for tRNA and translation elongation factors, precluding their incorporation to the ribosome and inhibiting the translation of FMRP targets (Chen et al., 2014). To determine the magnitude and the components of the “ribo-interactome,” an affinity enrichment for endogenously tagged ribosome subunits in mESCs was performed (Simsek et al., 2017). This interactome is composed of >400 proteins, including RPs and translation initiation and elongation factors, which have been expected, as well as many other proteins with known functions, such as RNA helicases, RNA modifiers, cis-regulatory element readers, protein modifiers, and proteins involved in cell cycle, cellular metabolism, and mRNA stability. These proteins may specialize the structure and function of their associated ribosomes and change their translation efficiency on certain subset of mRNAs. For example, the ribo-interactome includes well-known readers (YTHDF1, YTHDF3, and eIF3) of N6-methyladenosine modifications on mRNAs. 5′ UTR N6-methyladenosine is up-regulated upon diverse cellular stresses, and it can promote cap-independent translation in response to cellular stresses (Meyer et al., 2015). Future work is required to provide the direct validation of such RAP-induced translation specialization. PKM2 is another interesting partner identified and further studied in this ribo-interactome (Simsek et al., 2017; Fig. 3 F). In mESCs, PKM2 is enriched on ER-associated ribosomes and directly binds ER-associated mRNAs to facilitate their translation. A primary role for the ER is in the biogenesis of secretory and membrane proteins, which is well established by many studies showing the subcellular distributions of transcripts between cytosol and ER compartments (Reid and Nicchitta, 2011, 2015; Voigt et al., 2017). A very recent study shows that several ER membrane proteins, like SEC61β (a subunit of the SEC61 translocon) and LRRC59, are associated with ER-bound ribosomes. The mRNAs translated by SEC61β- or LRRC59-labeled ribosomes reveal commonly shared and some specifically enriched transcripts, indicating the contribution of these RAPs to translational specialization (Hoffman et al., 2019). These intriguing findings suggest RAPs can associate with subpools of ribosomes in different cellular compartments to display preferential translation, indicating another layer of ribosomal heterogeneity arising from spatial regulation of ribosomes.
Ribosome concentration model
Even though extensive studies have shown the significance of ribosomal heterogeneity to gene-specific regulations of translation, how ribosomes contribute to specific mRNA translation remains a matter of debate. Apart from ribosomal heterogeneity, many reports show that much of the specificity observed in mRNA translation resulting from ribosomes could be explained by a ribosome concentration model (Mills and Green, 2017). In this model, ribosome concentration can affect global and mRNA-specific translational control. Variable dependence of mRNAs on cellular ribosome concentration is mostly due to their difference in the translation initiation rates that results from mRNA-specific features (such as IRES elements, 5′ UTR structures; Lodish, 1974; Noderer et al., 2014). mRNAs with poor translation initiation rates are typically more sensitive to ribosome concentration fluctuations, and certain cell types and tissues that reply on the protein levels of such mRNAs are consequently more vulnerable to perturbations in ribosome concentration (Lodish, 1974; Mills and Green, 2017), such as the important erythroid transcription factor GATA1 in the context of Diamond–Blackfan anemia (Khajuria et al., 2018; Ludwig et al., 2014). The ribosome concentration model was further supported by a recent study demonstrating a preferential translation of synaptic mRNAs by monosomes, but not polysomes, in neuronal processes (Biever et al., 2020). For more information on the ribosome concentration model, readers are referred to the review by Mills and Green (2017). The models of ribosome heterogeneity and ribosome concentration are not mutually exclusive. While our current review highlights impacts of ribosomes on mRNA-specific translational control in stem cells and development, we should not ignore the possibility of other reasonable models, which may complement each other and/or be tightly coordinated to control mRNA translation.
Concluding remarks and future perspectives
Ribosomes are multisubunit molecular machines and directly responsible for the translation of the genetic information into the ultimate functional product proteins. Ribosomes were initially considered as indiscriminate machines that comprise the same components in any cells or conditions. However, studies over the past decades have proven that ribosomes are heterogeneous macromolecular complexes with both general translation functions and also specialized translation roles of specific transcripts. Several related hypothetical theories have been proposed, including the ribosome filter hypothesis (Mauro and Edelman, 2002), ribosome code (Komili et al., 2007), and ribosome concentration model (Mills and Green, 2017), to introduce another regulatory level in the control of gene expression. Ribosomes comprise rRNAs and RPs. Any differences in these components can lead to ribosome heterogeneity; rRNA variants and modifications can give rise to the formation of heterogeneous ribosomes, and ribosomal heterogeneity can also be introduced by RPs through their differential stoichiometry, varying components, existence of paralogs, and numerous PTMs. Apart from these core components, RAPs, which bind to the ribosome surface, can also contribute to ribosome heterogeneity and broaden its regulatory capacity. In addition to these factors, cellular localization of ribosomes and higher-order ribosome architecture may contribute to ribosome heterogeneity as well. On the other hand, different mRNAs have variable sensitivities to subtle alterations in ribosome levels. All of these regulatory controls from ribosomes prove to be an integral regulatory mechanism to modulate and fine tune protein abundance in cells. This mechanism is particularly important in response to internal and external stimuli, like during embryogenesis, as ribosomes can directly change the protein levels in a speedy way, preceding the formation of a new chromatin landscape and transcription program that constitute a secondary response important for long-term adjustment.
For heterogeneous ribosomes, there are many gaps and questions left for future investigations (see text box) to gain comprehensive understanding of this translational machinery in stem cells and development. We should also bear in mind that variable expression levels of certain ribosome-related components (such as rRNA variants, RPs, and RAPs) do not necessarily mean that these components are actually incorporated into ribosomes and result in various functionally distinct populations of ribosomes in cells. Most cases that we discuss here lack such direct verifications. To decipher the heterogeneity in ribosomes and characterize their functions in specialized translation, researchers need techniques to identify heterogeneous ribosomes and specific translated transcripts, as well as the related comprehensive molecular analyses from transcriptome to translatome and proteome. And as this mechanism is extremely critical during cell fate transition, the studies need to be conducted under various cellular conditions to determine the roles of heterogeneous ribosomes during the process. Recently, many advanced techniques have been developed to suit such needs. These techniques are very useful to advance our understanding of heterogeneous ribosomes and their related translation functions, but we still need more advanced technologies. Most mentioned technologies are performed in cultured cells, and related technologies that can be done at the single-cell level and in vivo (such as embryos and tissues) are needed to enable us to study heterogeneity of ribosomes within single cells and during different developmental stages. For example, it would be very interesting to know the changes of ribosome components or other modulations during embryogenesis that may indicate the existence of totipotency-, pluripotency-, multipotency-, unipotency-, and certain differentiation-associated ribosomes, which in turn can be used to achieve “cellular alchemy” for regenerative medicine and disease therapeutics.
1. For the genetic information flow based on “the central dogma,” is there a specialized class of rDNAs/rRNAs/RPs that is specifically active (or silent) during stem cell maintenance or differentiation? If so, what are the chromatin, sequence, and epigenetic features of this class of rDNAs/rRNAs/RPs?
2. How is ribosome heterogeneity selectively determined, including alterations from rRNAs and RPs? Multiple regulatory mechanisms should be involved, like transcription and translation. Signals from both outside and inside of cells are also supposed to confer and control the heterogeneity, as ribosome heterogeneity exists not only intercellularly but also intracellularly.
3. Following the previous question, how are heterogeneous ribosomes formed? For example, are ribosomes without certain RP (e.g., RPL38/eL38 and RPS26/eS26) formed from mature ribosomes containing the RP, or are the newly produced ribosomal subunits formed without the RP? Ribosome assembly is an efficient but complicated process (Warner, 1999; Davis and Williamson, 2017). During this process, transient rRNA–RP interactions chaperone rRNA folding, which is essential for pre-rRNA processing and the following rRNA–RP assembly, and many other assembly factors have been identified (Bohnsack and Bohnsack, 2019; Duss et al., 2019; Rodgers et al., 2019; Prattes et al., 2019), indicating that assembly of ribosomes, especially heterogeneous ribosomes, is significantly more complex than previously thought.
4. How does the specific translation happen? Is it through the interaction between ribosomal compositions and cis-regulatory elements in mRNAs, including a specific sequence, modification, or structure (like IRES)? A major knowledge gap exists in our current understanding of these cis-regulatory elements in mRNAs, especially with regard to their interface in trans with different factors and how this interface results in the recruitment of or resistance to certain mRNA subset. These in trans factors include different RNA-binding proteins, such as RPs, RAPs, and translation initiation factors, as well as rRNAs, tRNAs, and noncoding RNAs.
5. How heterogeneous are ribosomes in a certain cell type? To answer this question, we may need to know how diverse the ribosome population needs to be in any given cell. Is it possible that in a certain cell type, one type of the ribosome is enough to maintain cell viability and basic functions? In responses to stress and stimuli, does the extent of ribosomal heterogeneity increase or decrease?
6. rRNAs are transcribed and modified by multiple proteins, and RPs and RAPs are also translation products. It is also interesting to know whether there is a positive/negative feedback loop to enhance/inhibit certain ribosome heterogeneity in stem cells, certain lineage differentiation, or tissue development.
7. Considering the ribosome concentration model, which may complement ribosome heterogeneity, how do these alterations in cellular ribosome concentrations and ribosome compositions affect stem cell fates and development? To explore this question, we need to know how to determine protein synthesis rates of one certain mRNA (1) under gradient concentrations of ribosomes with the same composition or (2) under the same concentration of ribosomes with different compositions. These measurements are very technically challenging, but they are critical to the understanding of cohesive molecular mechanisms connecting quantitative/qualitative regulation of ribosomes to phenotypic consequences of specific mRNA translations in stem cells and development. Additionally, we should not exclude the possibility that translation of one certain mRNA may be robust to all ribosomes with mentioned heterogeneity if ribosome concentrations are not changed, in which the mRNA feature and ribosome concentration have a larger impact on translational control.
Last, but not least, the model of large multisubunit complexes with both general and specific functions is not unique to ribosomes. Another similar well-studied example is the Mediator, a critical regulator that has both general and specialized functions in transcription with altering compositions and activities (Poss et al., 2013; Cramer, 2019). More similar examples are histone octamers in DNA packaging (Soshnev et al., 2016; Talbert and Henikoff, 2017) and proteasomes in protein degradation (Morozov and Karpov, 2018; Bousquet-Dubouch et al., 2011). These examples indicate that heterogeneity may widely exist in such complicated multisubunit complexes, especially the ones with critical cellular functions. Such heterogeneity may enable their specialized functions in orchestrating gene expression at different regulatory levels in stem cells and development, as well as in human diseases. Studies of such heterogeneity and related specialized functions not only reveal novel gene regulatory mechanisms but also bring insights into diagnostics and therapeutics of human diseases.
Acknowledgments
We are grateful to Jill Gregory for the illustrations (Figs. 1, 2, and 3) and members of Wang laboratory for helpful discussions.
This study was supported by the National Institutes of Health (grants GM129157, HD095938, HD097268, and HL146664) and New York State Stem Cell Science (NYSTEM grants C32583GG and C32569GG).
The authors declare no competing financial interests.
Author contributions: D. Li and J. Wang both conceived and wrote the review.