Dynamic regulation of gene expression in response to changing local conditions is critical for the survival of all organisms. In metazoans, coherent regulation of gene expression programs underlies the development of functionally distinct cell lineages. The cooperation between transcription factors and the chromatin landscape enables precise control of gene expression in response to cell-intrinsic and cell-extrinsic signals. Many of the chemical modifications that decorate DNA and histones are adducts derived from intermediates of cellular metabolic pathways. In addition, several of the enzymes that can remove these marks use metabolites as part of their enzymatic reaction. These observations have led to the hypothesis that fluctuations in metabolite levels influence the deposition and removal of chromatin modifications. In this review, we consider the emerging evidence that cellular metabolic activity contributes to gene expression and cell fate decisions through metabolite-dependent effects on chromatin organization.
All organisms must adapt to changing environmental conditions to survive and thrive. Therefore, scientists have long studied how changes in nutrient availability influence cellular behaviors. Seminal work by Jacob and Monod (1961) investigating how single-cell organisms adapt to alterations in nutrient supply led to the discovery of the lac operon and laid the groundwork for the modern understanding of gene regulation. After the observation that bacteria could, after a small lag in growth, switch to lactose as a fuel source once glucose was exhausted, Jacob and Monod systematically dissected how bacteria adapt to this metabolic challenge by inducing the expression of genes involved in lactose uptake and catabolism. They proposed a model wherein a metabolite acting as an “inducer” blocks the action of a “repressor” molecule that inhibits expression of a suite of related genes (Fig. 1 A). Subsequent work showed that two metabolic pathways converge to regulate the activity of the lac operon. Allolactose, a product of lactose metabolism, serves as the inducer by binding the repressor, thereby reducing the fraction of repressor that can bind and repress the operon. Cyclic AMP (cAMP), which increases dramatically in the absence of glucose, positively increases transcription of the operon by promoting the binding of a coactivator that recruits RNA polymerase (Fig. 1 A; Lewis, 2005). Thus, the lac operon serves as an AND gate that integrates multiple metabolic inputs to coordinate appropriate gene expression in response to environmental fluctuations. This model, whereby sequence-specific DNA binding proteins regulate the transcription of genes that contain their cognate sequence (Ptashne, 1988) in direct proportion to the ability to bind and recruit RNA polymerase, serves as a basis for how specific gene regulation is thought to be effected.
Nutrient signaling in metazoan organisms is more complex than in prokaryotes. Multicellular organisms have evolved signaling pathways that respond to specific nutrients as well as hormones that reflect organismal metabolic status (Chantranupong et al., 2015). The response of an individual cell (e.g., whether to rewire metabolic pathways to favor an anabolic vs. catabolic state) to such extracellular signals depends in turn on a variety of intracellular nutrient and bioenergetic sensors including AMP-activated protein kinase (AMPK), mammalian target of rapamycin (mTOR), and GCN2. These enzymes sense changes in intracellular metabolites and convert these variations into an output, substrate phosphorylation, which is able to be effected at all ratios of ATP/ADP that exist in viable cells. Collectively, these signaling pathways enable cells to coordinate organismal metabolic status (through extracellular signaling pathways) with intracellular metabolic status. Furthermore, these kinases allow metazoan organisms to enact changes in gene expression over a wide range of variation in the substrates used to maintain bioenergetics.
However, metazoan cells also retain features of direct nutrient sensing within their nuclear organization. All organisms harbor variable levels of chemical modification on their DNA and DNA-associated proteins (Yung and Elsässer, 2017). The deposition and removal of these marks require metabolites that are intermediates of distinct metabolic pathways. This has led to the hypothesis that these chromatin modifications respond to fluctuations in nutrient availability to modulate gene expression. In contrast to the basic model of transcription proposed by Jacob and Monod (1961) in Escherichia coli, it appears gene regulation in more complex organisms has evolved to also be regulated by chemical modifications of DNA and its attendant proteins (Fig. 1 B). In this review, we discuss examples of how metabolite availability shapes gene expression in mammalian cells through effects on chromatin modifications. In particular, we focus on the hypothesis that levels of specific metabolites provide signals that can be integrated in chromatin modifications to influence gene expression.
Theories and limitations
Regulation of gene expression
Viewed from an enzymatic perspective, gene expression involves the recruitment of RNA polymerase to a specific DNA template and the activation of its transcribing ability (Sainsbury et al., 2015). However, in more complex organisms, it appears a prerequisite to this reaction is that the chromatin regions to which RNA polymerase is recruited have previously been rendered accessible (Levine et al., 2014). Accessibility is largely a result of the combined activity of transcription factors, nucleosome remodeling complexes, and chromatin-modifying enzymes that add or remove marks that favor recruitment of specific coactivators. Key to the links between metabolism and gene expression is the fact that these marks are derived from intermediary metabolites.
This hypothesis does not challenge the importance of transcription factors in directing gene expression. However, transcription factors act in concert with a number of chromatin-modifying activities that may sense the metabolic state of a cell. Although controversy still exists as to the causative or correlative nature of chromatin modifications with respect to gene expression, it is clear that metazoan transcription factors do not survey the entire genome landscape (Kolomeisky, 2011) and that chromatin accessibility can be increased or decreased by various chromatin modifications. As demonstrated by the inefficiency of the inducible pluripotent stem cell reprogramming process (Takahashi and Yamanaka, 2006), transcription factors are not free to bind and/or activate any genomic region. In contrast to the deterministic effect of metabolites on gene expression in E. coli as demonstrated by the lac operon model (Fig. 1 A), metazoan cells engage a model in which transcription factors, chromatin remodelers, and metabolic state cofactors act in concert to influence whether specific gene loci are activated or repressed (Fig. 1 B).
Links between metabolites and chromatin modifications
The repertoire of annotated chromatin modifications is large and continually expanding. To date, the most common modifications include methylation of nucleic acids and acetylation, methylation, phosphorylation, and ubiquitination of histones. Additional histone acylations, such as crotonylation, succinylation, and propionylation, arise in the presence of high levels of the specific acyl-donor (Sabari et al., 2017). Here, we focus on the common histone and DNA modifications that are most affected by the metabolic state of the cell.
All methylation reactions require the metabolite S-adenosylmethionine (SAM) as a methyl donor. This metabolite is directly derived from the combined activities of serine-glycine-one carbon metabolism and the methionine cycle (Fig. 2 A). Cytosine DNA methylation is catalyzed by the three DNA methyltransferases, DNMT1, DNMT3a, and DNMT3b (Lyko, 2018). DNMT1, the maintenance methyltransferase, recognizes hemimethylated DNA to produce a symmetrically modified DNA duplex (Gruenbaum et al., 1982). DNMT3a and 3b, in contrast, are de novo methyltransferases that deposit methyl marks on previously unmodified cytosines (Okano et al., 1999). A wide variety of histone methyltransferases, including lysine methyltransferases and arginine methyltransferases, similarly use SAM as a methyl donor. This can result in mono- or dimethylation of arginine residues or mono-, di-, or trimethylation of the amide nitrogen of lysine (Fig. 2 B). In all cases, the reaction results in the donation of a methyl group from SAM to the substrate and the release of S-adenosylhomocysteine, which can serve as a competitive inhibitor of these enzymes or be restored to SAM via intracellular metabolic pathways (Fig. 2 A).
Although DNA and histone methylation can be lost passively through replication or nucleosome eviction, the active removal of methylated moieties can proceed through one of two major chemical mechanisms. The LSD1 family of histone demethylases uses oxygen and a flavin-dependent reaction mechanism to remove methyl moieties from mono- or dimethylated histone residues (Shi et al., 2004). A second, more general mechanism of methyl removal involves the family of α-ketoglutarate (αKG)–dependent dioxygenases (Gao et al., 2017), which encompasses a diverse array of enzymes responsible for histone demethylation (JmjC family of histone lysine demethylases), DNA 5-methyl-cytosine hydroxylation (Tet family of DNA modifying enzymes), RNA N6-methyladenosine (m6A) demethylation (FTO, fat mass and obesity-associated protein, and ALKBH5), and EglN prolyl-4-hydroxylation, among others. All αKG-dependent dioxygenases use oxygen and αKG as substrates and release CO2 and succinate as products. This reaction requires ferrous iron (Fe(II)) as a cofactor and, like LSD1-dependent demethylation, results in hydroxylation of the enzymatic substrate (Fig. 2 B). In the case of lysine or arginine demethylation, this hydroxyl-methyl intermediate is released as formaldehyde, which can ultimately be recycled into formate to fuel nucleotide biosynthesis (Burgos-Barragan et al., 2017).
DNA methylation and several histone lysine methylation states are traditionally considered to exert repressive effects on gene expression (Mozzetta et al., 2015; Rao et al., 2017; Schuettengruber et al., 2017). In contrast, histone acetylation is primarily associated with actively transcribed loci (Xu et al., 2005; Tessarz and Kouzarides, 2014; Fig. 2 B). The acetylation of histone lysines is performed by acetyl-transferases that use acetyl-CoA to modify target proteins, which may not always be histones. Acetyl-CoA can be generated either through the mitochondrial oxidation of carbon sources including glucose, amino acids, and fatty acids or the recycling of acetate groups (Fig. 2 A). Three classes of histone deacetylases (HDAC classes I, II, and IV) remove acyl moieties from histone lysine residues though simple amide hydrolysis (Seto and Yoshida, 2014). Class III HDACs, also known as sirtuins, use a more complex method of deacylation in which the cosubstrate NAD+ is cleaved to form acyl-ADP-ribose and nicotinamide.
Enzymatic considerations for the metabolic regulation of gene expression
Thus, metabolites are integral components of the reactions that regulate chromatin modifications. A multitude of factors can influence the efficiency of these enzymatic reactions. As has been reviewed previously (Fan et al., 2015; Su et al., 2016; Reid et al., 2017), the KM of the enzyme must be equal to, or higher than, the range of substrate concentrations for physiological fluctuations in substrate concentration to affect the rate of the reaction. Therefore, if substrate concentrations are well above enzymatic KM, small variations in substrate concentration will not influence enzymatic rate. For example, cellular ATP concentrations (∼10 mM) are far higher than the levels needed to saturate kinase activity. Furthermore, in viable cells, the ADP and Pi levels are much lower than the ATP levels and therefore play essentially no role in product inhibition. Thus, kinases are unlikely to be influenced by fluctuations in substrate or product concentrations. In the case of enzymes that regulate methylation or acetylation, however, the measured KM is frequently in the range of observed substrate concentrations in cells (Su et al., 2016; Reid et al., 2017).
Therefore, a key question in the field is whether or not substrate concentrations are limiting or product levels are too high to allow constitutive activity of the suite of enzymes that regulate chromatin modifications. This is not a trivial question. First, several additional factors influence enzymatic rate, including cofactor availability, pH, and allosteric regulation by metabolites or enzymatic partners. As an example, high acetyl-CoA levels in the mitochondria (0.1–1.5 mM) coupled with high mitochondrial pH (7.9–8.0) enable nonenzymatic acetylation of mitochondrial proteins to proceed as an energetically favorable event (Wagner and Payne, 2013). At the lower levels of cytoplasmic acetyl-CoA (2–13 µM; Lee et al., 2014) coupled with a lower pH, acetylation of proteins occurs only if catalyzed by cytoplasmic and nuclear acetyltransferases (Wagner and Payne, 2013).
Second, the determination of enzyme KM in vitro does not necessarily provide a reliable measure of enzyme function in vivo. It is impossible to recapitulate in a test tube the complex microenvironment an enzyme would experience in cells, including the presence of multiple metabolites that could compete for the substrate binding site and thus raise the effective KM of the enzyme for its normal substrate. Similarly, measures of in vitro KM do not account for the inclusion of the enzyme in a multiprotein complex as found for the majority of chromatin-modifying enzymes (e.g., histone H3K4 methyltransferases as part of Set1/COMPASS [complex of proteins associated with Set1; Miller et al., 2001] or polycomb repressive complex 2 [PRC2] histone H3K27 methyltransferases as part of PRC2 [Klymenko et al., 2006] methyltransferase complexes). Moreover, local concentrations of enzyme cofactors, allosteric regulators, cosubstrates, end products, and any existing posttranslational modifications of the chromatin-modifying enzymes may influence in vivo KM.
Third, it is almost impossible with current technology to estimate the local concentration of individual metabolites. To date, most estimates of metabolite availability are based on whole-cell analyses and thus scramble different pools with vastly different substrate concentrations (Chen et al., 2016). However, localization of enzymes that produce key metabolites can provide a clue as to the mechanisms cells use to generate local pools of substrates for chromatin modifications (Fig. 3). For example, the pyruvate dehydrogenase complex, traditionally found in the mitochondrial matrix adjacent to the inner mitochondrial membrane, can transit to the nucleus where it generates locally high concentrations of acetyl-CoA to fuel histone acetylation required for transcription of genes involved in S phase (Sutendra et al., 2014) or lipid biosynthesis (Chen et al., 2018). Similarly, nuclear ATP-citrate lyase generates acetyl-CoA pools upon DNA damage, facilitating the histone acetylation required for efficient double-strand break repair by homologous recombination (Sivanand et al., 2017). Another example that involves local nuclear production of acetyl-CoA depends on the function of acetyl-CoA synthetase 2 (ACSS2). Under normoxic conditions, cytoplasmically localized ACSS2 drives the use of acetate for lipogenesis (Kamphorst et al., 2014; Schug et al., 2015). Under hypoxia, however, ACSS2 translocates to the nucleus where it allows the recycling of acetate produced by histone deacetylation reactions for histone reacetylation (Bulusu et al., 2017).
SAM-generating enzymes can also localize to the nucleus. Methionine-adenosyltransferase complex II associates with the large protein complex associated with transcriptional repression of the COX2 locus mediated by methylation of H3K9 by the methyltransferase SETDB1 (Katoh et al., 2011; Kera et al., 2013). Interestingly, this mechanism is not restricted to targeting SAM for repressive functions. The serine-responsive SAM-containing metabolic enzyme (SESAME) complex interacts with the Set1 methyltransferase complex to coordinate H3K4 methylation and H3T11 phosphorylation at the pyruvate kinase M2 locus (Li et al., 2015), leading to its transcriptional activation. These examples suggest that despite the free passage of metabolites from cytoplasm to nucleus through the large nuclear pore complex, local generation of metabolite pools can have biologically relevant functions. Nonetheless, a great deal of work remains to explore how regional pools of metabolites might affect chromatin modifications. A combination of in nucleo chemical assays to probe enzyme responsiveness to variations in metabolites (Shrimp et al., 2015) and chemoproteomic approaches to quantitate metabolite/enzyme interactions in native contexts (Montgomery and Meier, 2016) may begin to answer this question.
Regulation of gene expression by metabolites
Metabolites as substrates
Despite the challenge of estimating the biochemical milieu of chromatin-modifying enzymes in living cells, increasing evidence suggests that changes in metabolite availability do influence chromatin modifications and ultimately gene expression in mammalian cells. In cultured cells, nucleocytosolic pools of acetyl-CoA for histone acetylation are derived primarily through the action of ATP-citrate lyase, which converts mitochondrial-derived citrate to oxaloacetate and acetyl-CoA. Signaling-induced changes in glucose oxidation and ATP-citrate lyase activity increase acetyl-CoA availability and total histone acetylation (Wellen et al., 2009; Lee et al., 2014). Outside idealized cell culture conditions, acetate may provide a major source of carbon for histone acetylation: in hypoxia and in tumors, conversion of acetate to acetyl-CoA by ACSS2 provides an additional source of nucleocytosolic acetyl-CoA and could represent an important recycling mechanism for recapture of acetate after histone deacetylation (Comerford et al., 2014; Schug et al., 2015; Bulusu et al., 2017).
Perturbations in SAM availability likewise influence total levels of histone methylation. In contrast to human cells or differentiated mouse cells, some strains of mouse embryonic stem cells (ESCs) can generate SAM from threonine catabolism; consequently, threonine depletion reduces H3K4me3 levels in these ESCs (Shyh-Chang et al., 2013). In both stem cells and differentiated cells, methionine restriction similarly reduces SAM pools and compromises histone methylation (Shiraki et al., 2014). Human pluripotent cells regulate SAM pools through the action of nicotinamide N-methyltransferase, an enzyme that consumes SAM and buffers low levels of H3K27me3 at key developmental genes (Sperber et al., 2015). In all cases, a major experimental hurdle is to uncouple direct effects on chromatin from secondary changes driven by reduced viability or proliferation that may result from depletion of critical metabolites. This is especially true for studies attempting to manipulate αKG levels, as cultured cells largely derive αKG from glutamine, a metabolite that is critical for proliferation and survival of cultured mammalian cells (Wise et al., 2008). However, mouse ESCs in the ground state of naive pluripotency exhibit the unusual ability to proliferate in the absence of exogenous glutamine. Restricting glutamine in these cells triggered accumulation of several trimethylated histone lysine residues, consistent with impaired αKG-dependent demethylation (Carey et al., 2015). Together, these cell-based studies have begun to lay the groundwork for our understanding of how metabolic networks provide the substrates that regulate chromatin modifications.
Metabolites as inhibitors
An additional determinant of enzymatic rate is the inhibitory effect of enzymatic products and other closely related metabolites. The biological importance of enzyme inhibition by small molecules is underscored by the observation that mutations in metabolic enzymes occur in a variety of human malignancies (Kaelin and McKnight, 2013). For example, loss-of-function mutations in the metabolic enzymes succinate dehydrogenase (SDH; Baysal et al., 2000) and fumarate hydratase (FH; Tomlinson et al., 2002) were originally associated with cancer syndromes including hereditary paraganglioma and pheochromocytomas and hereditary leiomyomatosis and renal cell carcinoma, respectively. Subsequent large scale sequencing studies found FH and SDH mutations in a number of other malignancies (Carvajal-Carmona et al., 2006; Bardella et al., 2011). Mechanistically, high levels of succinate and fumarate driven by loss of SDH or FH function, respectively, can act as competitive inhibitors of αKG-dependent enzymes. In particular, SDH and FH mutations are associated with impaired DNA and histone demethylation and the accumulation of repressive chromatin modifications that contribute to malignant progression (Laukka et al., 2016). In addition, high levels of succinate can inhibit the activity of EglN prolyl-4 hydroxylases, leading to stabilization of the heterodimeric transcription factors HIF1α and HIF2α, which may represent possible mediators of the oncogenic effects of succinate and fumarate accumulation seen in SDH and FH mutant tumors, respectively (Koivunen et al., 2012; Laukka et al., 2016).
The discovery of neomorphic point mutations in the genes coding for isocitrate dehydrogenase 1 and 2 in a number of different cancers was a key finding in expanding the understanding of the connections between metabolism and gene expression. Point mutations in IDH1 (R132) and IDH2 (R172K or R140Q) are seen in tumors including acute myelogenous leukemia (AML), grade II–III glioblastoma, chondrosarcoma, cholangiocarcinoma, and many other tumors at low frequencies. Wild-type IDH1 and IDH2 catalyze the interconversion of isocitrate and αKG, with concurrent reduction of NAD(P)+ to NAD(P)H and release of CO2 in the forward reaction. The neomorphic mutant enzymes catalyze the irreversible partial reduction of αKG to d-2-hydroxyglutarate (D-2HG), a metabolite not normally present in physiological conditions (Dang et al., 2009; Ward et al., 2010). D-2HG is in turn a competitive inhibitor of αKG-dependent enzymes (Xu et al., 2011).
As a result of D-2HG–mediated inhibition, tumors harboring IDH1 or IDH2 mutations show various degrees of hypermethylation of DNA and histone marks, which are thought to mediate the transcriptional changes that block terminal differentiation (Figueroa et al., 2010; Chowdhury et al., 2011; Lu et al., 2012). Interestingly, not all D-2HG effects are pro-oncogenic. In non-IDH mutant leukemias, D-2HG leads to inhibition of the m6A demethylase FTO and consequent hypermethylation and destabilization of Myc transcripts (Su et al., 2018).
Metabolites that accumulate in physiological conditions can also inhibit chromatin-modifying enzymes. For example, fasting-induced increases in circulating levels of the ketone body β-hydroxybutyrate inhibit class I HDACs in multiple tissues and induce expression of genes associated with resistance to oxidative stress (Shimazu et al., 2013). Similarly, short-chain fatty acids such as propionate and butyrate derived from commensal bacterial populations increase histone acetylation and promote extrathymic maturation of regulatory T cells (Arpaia et al., 2013). Low oxygen or acidic pH induce chromatin hypermethylation in part by stimulating the production of L-2HG (the enantiomer of D-2HG) by wild-type forms of malate dehydrogenase or lactate dehydrogenase (Intlekofer et al., 2015, 2017; Oldham et al., 2015). These selected examples illustrate how cell-autonomous and circulating metabolites serve as signals that transmit information about environmental changes to individual cells to induce appropriate adjustments in gene expression.
Metabolic modulation of enzyme activity
To maintain their activity, αKG-dependent dioxygenases use ascorbate (vitamin C) to facilitate regeneration of the required Fe(II) cofactor (Myllylä et al., 1978). Providing exogenous ascorbic acid potently enhances αKG-dependent dioxygenase activity (Knowles et al., 2003). This effect can have biological relevance. Defective collagen production—a process mediated by the αKG-dependent prolyl hydroxylases—is a major driver of the pathogenesis of scurvy. The therapeutic effect of vitamin C–rich fruits in patients with scurvy provides a dramatic example of the relevance of this cofactor for organismal homeostasis (Kuiper and Vissers, 2014).
Tet enzymes are known to be particularly sensitive to ascorbate levels, and multiple studies have shown that variations in ascorbate levels can affect stem cell biology via Tet enzymatic activity (Blaschke et al., 2013; Chen et al., 2013b; Minor et al., 2013; Yin et al., 2013; Cimmino et al., 2017). Metabolite profiling of isolated hematopoietic progenitors revealed 2–20-fold higher levels of ascorbate in hematopoietic stem cells (HSCs) and multipotent progenitors compared with other hematopoietic precursors (Agathocleous et al., 2017). Reducing ascorbate availability in vivo increased HSC frequency and function as a result of impaired Tet2 function. Consequently, ascorbate depletion results in increased leukemogenesis in the setting of a cooperating oncogene (Agathocleous et al., 2017). These collective findings suggest that HSC expansion is favored by the inhibition of Tet2 that results from decreased ascorbate. As such, conditions of ascorbate depletion would mimic loss of Tet2 and favor leukemogenesis in the setting of additional oncogenic events.
Integrating metabolism and gene expression
Transcription factors are required but may not be sufficient for cell fate decisions
Although metabolites may contribute to gene regulation, the ability of a cell to make state or fate changes relies on the expression of specific transcriptional programs. One explanation for how specific biological events lead to activation of appropriate transcriptional programs is through the DNA binding specificity of transcription factors. Transcription factors are hence an essential force determining cell identity, as demonstrated by the pivotal role of MyoD in directing muscle differentiation and factors such as Oct4, Sox2, and Klf4 in directing the acquisition of the pluripotent state (Davis et al., 1987; Takahashi and Yamanaka, 2006). At a first approximation, it is difficult to see how the metabolic changes discussed above can lead to specific transcriptional programs that effect cell specification. The modifications affected by intermediary metabolite levels are often present genome-wide, and the metabolites themselves are not localized to specific genomic loci.
However, gene expression is not simply a binary output driven by the presence or absence of transcription factor binding. Rather, gene expression is the result of multiple inputs including the recruitment of chromatin-modifying activities that deposit activating marks (e.g., acetylation at H3K9 and H3K27 and methylation at H3K4) and remove inhibitory marks (e.g., methylation of H3K9 and H3K27). These marks can in turn enhance gene expression by altering charge-driven nucleosomal contacts or promoting recruitment of additional coactivators. Gene expression therefore represents the integrated output of multiple disparate inputs, including metabolite availability. Taken together, metabolic alterations may influence the probability of expressing a given gene, and, therefore, in aggregate may contribute to the induction or repression of specific gene expression programs.
A major challenge to the hypothesis that metabolites provide important information that links gene expression programs to environmental conditions is the problem of specificity. First, the same chemical modification on different residues can exert vastly different biological effects. For example, H3K9me3 and H3K27me3 are associated with gene repression (Mozzetta et al., 2015), H3K4me1 with gene enhancers, and H3K4me3 with gene expression (Shilatifard, 2012; Rao et al., 2017; Schuettengruber et al., 2017), and yet all of these modifications are derived from SAM. Likewise, the enzymatic removal of the trimethyllysine modification always requires αKG, regardless of the residue being modified. Second, to exert coherent effects on gene expression, marks must be deposited on specific regulatory regions and not distributed randomly genome-wide. How, then, do cells translate global shifts in metabolite availability into specific gene expression programs?
One answer to the first problem may be hardwired in the biochemistry of the enzymes themselves. The KM of each enzyme for its substrate can vary even between closely related members of a protein family, thus providing a hierarchy of enzymes that will be more or less sensitive to fluctuations in the availability of a specific substrate. For example, PCAF and GCN5, which acetylate H3K9 and H3K14 residues, exhibit a KM of ∼0.6 µM for acetyl-CoA, whereas p300, which acetylates H3K27 residues, has a 10-fold higher KM for acetyl-CoA (Jin et al., 2011; Fan et al., 2015). Acetyltransferases also exhibit diversity in their susceptibility to inhibition by free CoA (Montgomery et al., 2015).
In yeast, the H3K4 methyltransferase, Set1, has a comparatively high KM for SAM (Sadhu et al., 2013). Consequently, H3K4me3 is most sensitive to conditions that restrict SAM availability (Shyh-Chang et al., 2013; Shiraki et al., 2014; Mentch et al., 2015). Conversely, whereas most histone trimethyllysine modifications were increased by restricting intracellular αKG pools, H3K4me3 was comparatively unaffected (Carey et al., 2015). These findings lead to the intriguing hypothesis that activating histone modifications (acetylation, H3K4me3) may be sensitive to an ON rate driven by acetyl-CoA and SAM availability, whereas repressive modifications may be sensitive to an OFF rate mediated by αKG pools. In this manner, gene activation may require disparate metabolic inputs that signal nutrient sufficiency. Alternatively, metabolic regulation of individual marks might have context specificity: in addition to H3K4 methylation, H3K9 and H3K27 methylation were responsive to methionine restriction in vitro, although in vivo this could only be seen for H3K4 (Dann et al., 2015; Mentch et al., 2015).
Even more challenging is the question of how metabolites could affect a particular chromatin modification in a site-specific manner. One possibility is that DNA sequence-specific transcription factors enable recruitment of chromatin-modifying complexes that respond to fluctuations in metabolite levels. Similarly, enzymes that produce or consume metabolites may also be directly recruited to specific chromatin loci to influence local substrate concentrations. Alternatively, the local landscape of transcription factors, chromatin-modifying enzymes, and chromatin readers may collectively determine the likelihood of specific marks being deposited or removed.
Although most examples of differential expression programs are a consequence of local changes in chromatin, some biological processes are driven by global alterations in the chromatin landscape. Global erasure of parental methylation patterns occurs early in vertebrate embryonal development and is followed by a transient pluripotent period marked by globally accessible chromatin. This has been observed in a number of species by both electron microscopy and the pattern of chromatin modifications (Ahmed et al., 2010; Borsos and Torres-Padilla, 2016).
Metabolites may play particularly important roles regulating the global chromatin landscape in stem cells, which are characterized by dynamic chromatin regulation. Mouse ESCs exhibit global chromatin hypomethylation (Ficz et al., 2013; Habibi et al., 2013; Hackett et al., 2013; Leitch et al., 2013; Pedersen et al., 2016; von Meyenn et al., 2016), and interventions that elevate αKG levels enhance demethylation and promote self-renewal of mouse ESCs (Carey et al., 2015; Hwang et al., 2016). In contrast, mouse epiblast stem cells and human primed pluripotent cells must undergo demethylation to differentiate. As a result, αKG accelerates differentiation of human ESCs (TeSlaa et al., 2016). This is also the metabolic node that has been coopted by tumor cells harboring IDH, SDH, and FH mutations. By preventing activation of genes associated with differentiation, these mutations lock malignant cells into a dedifferentiated state (Figueroa et al., 2010; Lu et al., 2012, 2013). AML stem cells can promote a similar hypermethylated state in the absence of IDH mutations by elevating activity of branched-chain amino acid transaminase 1, thus restricting intracellular levels of αKG (Raffel et al., 2017). Together, these findings suggest that αKG levels are limiting for cells that must demethylate regions of repressed chromatin to differentiate. Thus, pathological restriction of αKG or inhibition of αKG-dependent enzymes locks cells in a malignant state.
Changes in availability of acetyl-CoA donors for histone acetylation can likewise induce global chromatin alterations defined by alterations in bulk histone acetylation. Both mouse and human pluripotent cells consume high levels of glucose, which can be used to generate nucleocytosolic acetyl-CoA by the combined activities of the tricarboxylic acid (TCA) cycle and ATP-citrate lyase (Fig. 2 B; Zhou et al., 2012; Gu et al., 2016; Flores et al., 2017). Declines in glucose metabolism that accompany differentiation are associated with decreased acetyl-CoA availability and reduced histone acetylation (Moussaieff et al., 2015). Similarly, restricting acetyl-CoA availability via loss of ATP-citrate lyase has dramatic effects on broad gene expression programs, including the ability to respond to growth factors or differentiate (Wellen et al., 2009). Nevertheless, locus-specific regulation still occurs, as not all transcription is shut off when acetyl-CoA is limited. Bromodomain-containing proteins such as Brd4 that recognize acetylated lysines provide one major mechanism through which changes in histone acetylation can influence gene expression (Dey et al., 2003; Jang et al., 2005). The identification of recurrent translocations involving Brd4 in human cancer led to the development of JQ1, a small molecule that impairs the ability of Brd4 to bind acetylated lysine (Filippakopoulos et al., 2010). Although JQ1 has potent inhibitory effects on highly expressed genes such as Myc, many chromatin loci remain active despite the impaired ability of Brd4 to read acetylated histones in the presence of JQ1. Although the mechanisms responsible for this locus-specific JQ1 resistance are unclear, certain genetic loci may have an advantage in recruiting transcription factors and coactivators that mediate specific transcriptional programs (Finley et al., 2018). In support of this hypothesis, tumors driven by mutant IDH1 exhibit globally repressed chromatin and enhanced sensitivity to JQ1 (Chen et al., 2013a), suggesting that metabolic alterations that shape the chromatin landscape may determine sensitivity to perturbations in gene expression networks. This exemplifies the combinatorial effect of global metabolically mediated expression changes and locus-specific recruitment of transcriptional coactivators that collectively define gene expression programs.
Dysregulation in cancer
As previously discussed, mutations in metabolic enzymes result in accumulation of pathological levels of metabolites that can interfere with normal regulation of histone and DNA methylation (Figueroa et al., 2010; Chowdhury et al., 2011; Lu et al., 2012). Even in the absence of such mutations, cancer-associated changes in metabolic networks can induce alterations in the chromatin landscape (Sciacovelli et al., 2016). A recent study of pancreatic adenocarcinoma (PDAC) highlights the global effects of metabolic changes on gene expression in the context of metastatic progression (McDonald et al., 2017). Comparison of primary PDAC samples with liver and lung metastatic samples as well as primary cell lines derived from these tumors revealed widespread reprogramming of chromatin modifications at the level of regions rich in H3K9me3 and DNA methylation (large heterochromatin domains [LOCKs]). These changes occurred in the absence of metastasis-specific driver mutations, arguing for clonal chromatin drivers of metastatic progression. The decrease in repressive marks in LOCKs correlated with increased expression of genes contained in these regions in metastatic lines. Moreover, metastatic lines showed increased activity in the oxidative pentose phosphate pathway at the level of 6-phosphogluconate dehydrogenase (PGD) in glucose tracing experiments. Genetic or chemical inhibition of PGD, the enzymatic step that showed the most increased flux in metastatic lines compared with primary lines, was sufficient to revert the chromatin changes of metastatic lines to those seen in primary tumors. PGD inhibition also led to impaired metastatic ability in metastatic lines but not growth inhibition of the primary lines. These results argue that a clonal metabolic adaptation at the level of PGD leads to chromatin alterations that drive prometastatic behaviors conducive to dissemination and/or growth in highly vascularized organs such as the lung or liver where glucose concentrations are high. Although the direct link between higher PGD activity and loss of heterochromatic marks has not been explored, it is tempting to speculate that metabolic effects like the ones outlined in this review are directly responsible for the observed phenotype.
Another notable example of how master regulators of cellular metabolism can affect gene expression globally was recently outlined in the context of LKB1 mutant pancreatic adenocarcinoma (Kottakis et al., 2016). LKB1, a kinase that activates AMPK, is a tumor suppressor inactivated in a variety of malignancies, including pancreatic adenocarcinoma (Hawley et al., 2003; Woods et al., 2003; Shaw et al., 2004). Using genetically engineered mouse models harboring a KrasG12D and/or LKB1 homozygous loss, Kottakis et al. (2016) noted that a combination of KrasG12D mutation and LKB1 loss led to an mTOR-dependent rewiring of metabolism that resulted in increased activity through the serine-glycine-one carbon network. This in turn resulted in higher levels of SAM, increased global DNA methylation, and transcriptional silencing of retrotransposons. The effect was specific to KrasG12D;LKB1−/− cells and was validated in human pancreatic adenocarcinoma cell lines with this genetic makeup. Moreover, these cells were uniquely sensitive to DNA methyltransferase inhibitors, outlining a potential therapeutic vulnerability. This study exemplifies how changes in the metabolic state of a cell can have global effects on gene expression that are mediated through a change in availability of substrates of chromatin-modifying enzymes.
The interplay between metabolites and gene expression programs that sustain malignancy may provide opportunities for therapeutic intervention. For example, glutamine, which provides many critical metabolic functions that fuel tumor growth (DeBerardinis et al., 2007), also generates the αKG used by mutant IDH to produce 2HG. Consequently, IDH mutant cells are more sensitive than wild-type counterparts to glutaminase inhibitors (Elhammali et al., 2014; Emadi et al., 2014). Glutamine metabolism may affect chromatin even in tumors lacking IDH mutations. A number of studies have reported histone methylation changes after treatment with glutaminase inhibitors, although whether these effects are direct or a downstream consequence of cellular adaptations to the drug remains unknown (Simpson et al., 2012a,b). Reductive carboxylation of glutamine-derived αKG has been shown to contribute to acetyl-CoA pools (Metallo et al., 2011; Mullen et al., 2011; Wise et al., 2011). Therefore, glutaminase inhibitors may also affect gene expression by decreasing citrate pools that maintain acetyl-CoA levels. Hence, the glutamine dependence of cancer cells may reflect its contributions to carbon pools used for both demethylation and acetylation reactions.
Alterations in glutamine levels, a common feature of human tumors (Roberts and Frankel, 1949; Roberts and Tanaka, 1956; Kamphorst et al., 2015), may allow tumor cells to change their chromatin environment in ways that allow escape from oncogene addiction. Studies in a panel of xenograft tumors revealed that glutamine and αKG levels were decreased in the ischemic central areas of tumors compared with the tumor periphery, and these metabolic alterations contributed to increased histone methylation in tumor cores (Pan et al., 2016). Decreased αKG impaired activity of the histone demethylase KDM6B and resulted in H3K27 hypermethylation. In BRAFV600E mutant melanoma cells, low glutamine promoted features of dedifferentiation and resistance to BRAFV600E inhibitors driven in part by H3K27 methylation. Consequently, inhibiting the H3K27 methyltransferase EZH2 restored sensitivity to BRAFV600E inhibition (Pan et al., 2016). Thus, extracellular nutrient changes, in this case as a result of ischemia in poorly vascularized tumors, may have effects on tumor biology that are a result of changes to the chromatin landscape.
Across evolution, fluctuations in nutrient availability influence gene expression programs. Increasing evidence supports the hypothesis that the chromatin landscape of metazoan cells integrates nutrient sensing with transcriptional control to effectively regulate transcriptional programs. These effects of metabolites on gene expression provide a supplemental layer of regulation on top of the more canonical regulators of gene expression, including sequence-specific binding proteins.
Fundamental questions regarding how metabolites affect gene expression programs remain. What enzymes preferentially respond to fluctuations in metabolite availability? Can physiological changes in extracellular nutrient availability alter levels of intracellular metabolites to a degree sufficient to affect chromatin modifications? What are the factors that control local generation of metabolites that serve as substrates or inhibitors of chromatin modifying enzymes? How do specific chromatin regions become sensitive (or resistant) to effects of metabolism when similar chromatin marks are present genome wide? Can the chromatin-modifying enzymes that link intermediary metabolites with the recruitment of RNA polymerase be pharmacologically targeted for therapeutic effect in cancer or other disease states? Continued technological developments will aid the pursuit of these questions. Measures of in vivo metabolic activity of specific cell populations, determination of metabolite concentrations in subcellular compartments, acute genetic manipulation of developing tissues, and sequence-specific targeting of chromatin-modifying enzymes will collectively enable direct assessment of the extent to which changes in metabolite availability influence chromatin modifications, gene expression, and cell fate decisions in mammalian cells.
We thank members of the Thompson and Finley laboratories for discussion.
J.M. Schvartzman is supported by a Hope Funds for Cancer Research Postdoctoral Fellowship. L.W.S. Finley is a Dale F. Frey-William Raveis Charitable Fund Scientist supported by the Damon Runyon Cancer Research Foundation (DFS-23-17). This work was additionally supported by the Memorial Sloan Kettering Cancer Center Support Grant P30 CA008748.
C.B. Thompson is a founder of Agios Pharmaceuticals and a member of its scientific advisory board. He also serves on the board of directors of Merck and Charles River Laboratories.
The authors declare no further conflict of interest.
J.M. Schvartzman, C.B. Thompson, and L.W.S. Finley wrote the manuscript.