Tremendous variety in form and function is displayed among the intracellular membrane systems of different eukaryotes. Until recently, few clues existed as to how these internal membrane systems had originated and diversified. However, proteomic, structural, and comparative genomics studies together have revealed extensive similarities among many of the protein complexes used in controlling the morphology and trafficking of intracellular membranes. These new insights have had a profound impact on our understanding of the evolutionary origins of the internal architecture of the eukaryotic cell.
Introduction
The elaborate internal membrane compartments of modern eukaryotic cells are testament to both the functional flexibility and the selective advantage that these structures provide, despite the significant energetic burdens that building such structures places on an organism. Although the presence of internal membrane compartments is not strictly restricted to eukaryotes (Stolz, 1998; Seufferheld et al., 2004; Fuerst, 2005), the sheer number and complexity of these compartments is unique to the Eukaryota (Fig. 1). Yet, a prokaryotic progenitor lacking elaborate internal membranes clearly evolved into modern eukaryotes. How did this transition occur?
Intracellular organelles could originate via two mechanisms: (i) exogenously, by acquiring a preformed biological structure, i.e., endosymbiont; or (ii) autogenously, from preexisting intrinsic structures, requiring duplications of existing genes and subsequent acquisition of new functions in the duplicates (Margulis, 1993; Martin, 1999). Both mechanisms undoubtedly contributed to eukaryotic origins, and it is accepted that the mitochondrion has an exogenous origin, probably through a single endosymbiosis event (Fig. 1; Tovar et al., 1999, 2003; Dolezal et al., 2005). Similarly, the chloroplast arose by endosymbiosis, initially from a cyanobacteria and then with multiple subsequent endosymbiotic events in the different lineages (Gould et al., 2008). Both chloroplasts and mitochondria have experienced significant secondary reduction (Wilson et al., 1996; Funes et al., 2002; McFadden, 2010). However, all remaining internal organelles, i.e the endoplasmic reticulum, Golgi complex, vacuoles/lysosomes, and the nucleus most likely have autogenous origins (Martin, 1999).
Among the huge diversity in endomembrane compartments, some of the most striking and well-characterized examples are found in protists. This diversity has arisen by several distinct routes. First is innovation, whereby duplication of a preexisting organelle allows a new function; for example, the malaria parasite possesses three specialized regulated exocytic organelles that appear to have arisen through such innovation, and discharge their contents in a strict order to facilitate host cell invasion (Kats et al., 2008). Second is sculpting, where membrane compartments are simplified by secondary loss; in Toxoplasma gondii ER exit sites are fused to the nuclear envelope (Hager et al., 1999). Third is modification, where compartment function is modified, but retains original functions; thus trypanosome peroxisomes have become glycosomes, and acquired the glycolytic apparatus (Hart et al., 1984; Michels and Opperdoes, 1991), whereas the exocytic pathway of Emiliana huxleyi can export colossal cargoes such as calcium carbonate coccoliths that are of similar diameter to the cell itself (Marsh, 1999). More complex patterns, likely from a combination of processes, occur in Giardia lamblia, where biosynthetic and degradative processes occupy the same compartment (Abodeely et al., 2009). A remarkable feature of these pathways is that many of the large number of proteins involved in them, representing some 5–10% of the total cellular proteome (Koumandou et al., 2008), can be organized into a comparatively small number of families. Taking a mainly molecular view, we consider several questions concerning origins and subsequent evolution of membrane trafficking systems at the level of the protein player, specifically: What are the protein cohorts that form the machineries of membrane compartments? What are the molecular details of the membrane-manipulating machines? Are these machineries related to each other and if so, how? Why, and from where, did the membrane compartment–forming machines evolve? Have any of these systems come to be dominant, and if so, why?
What are the protein cohorts that form the machineries of membrane compartments?
Based on extensive and growing comparative genomics, the basic configuration of the endomembrane system was established early in eukaryotic evolution. Indeed, when the major eukaryotic lineages of plants, protists, amoeba, and animals/fungi appeared about one and one half billion years ago the eukaryotic cellular bauplän was (remarkably) already established. By several measures the last eukaryotic common ancestor (LECA) possessed a degree of complexity well beyond basic exocytic, endocytic, and phagocytic routes essential for delivery of material to the cell surface or for uptake of external material (Dacks and Field, 2007; Field et al., 2007).
The endomembrane compartments and the vesicles and tubules carrying cargo proteins, lipids, and other molecules between them are generated by membrane-deforming proteins and complexes. These complexes also specify membrane and organellar identity, and together with proteins facilitating interactions between the organelles and transport intermediates with the cytoskeleton, facilitate positioning and movement within the cell. The major players coordinating vesicle budding, docking, and fusion include the coiled-coil SNARE proteins, Ras-GTPase domain containing Rabs, ARFs, and ARLs, and the CATCHR family of tethering complexes, as well as the proteins that coat and deform the membranes. Remarkably, these proteins comprise several paralogous families, underscoring mechanistic commonalities sitting at the heart of many transport steps (Jékely and Arendt, 2006; Dacks and Field, 2007; Koumandou et al., 2007; Satir et al., 2008; Vedovato et al., 2009; Brighouse et al., 2010; Elias, 2010).
The mechanistic commonalities between the machineries acting on membrane compartments reflect both basic thermodynamic requirements and deep evolutionary relationships. All eukaryotic membrane manipulation ultimately involves a coating event followed, in most cases, by fission to create a transport intermediate. Frequently this process is controlled by Rab or ARF proteins (Stenmark, 2009; Brighouse et al., 2010; East and Kahn, 2011). Recruitment of coat proteins serves to provide energy and a scaffold for membrane deformation. Most coat proteins assemble into a curved lattice; oligomerization and tight interaction with membrane lipid provides part of the thermodynamic force required to achieve fission (Manneville et al., 2008; Pinot et al., 2010).
Remarkably, the known protein-based membrane-deforming systems can be grouped into just three major structural classes (Fig. 1, and see Fig. 3): (i) the ESCRT complex, including Snf7 domain subunits, which operates in cytokinesis and late endosomal sorting (Hurley and Emr, 2006; Carlton and Martin-Serrano, 2007; Morita et al., 2007; Raiborg and Stenmark, 2009; Hurley and Hanson, 2010); (ii) BAR domain proteins functioning mainly at the Golgi–endosomal interface but also in phagocytosis and cytokinesis (Gallop and McMahon, 2005; Ren et al., 2006; Zhao et al., 2011); and (iii) the coat complexes, which mostly act in exocytosis and endocytosis. These, which include the COPI, COPII, HOPS/CORVET, SEA (Seh1-associated), and clathrin complexes (see below), are important for the production of transport vesicles and are recruited at the early stages of vesicle formation. Additionally, membership of this coat protein complex family extends to proteins that do not form transport vesicles but coat or associate with membranes, such as the nuclear pore complex (NPC) core scaffold proteins and intraflagellar transport (IFT) complexes (Devos et al., 2004, 2006; Jékely and Arendt, 2006). All of these systems were present in LECA, with molecular configurations predicted to be essentially indistinguishable from modern cells. Interestingly, many adaptor proteins that bring coat complexes to specific membrane microdomains, for example Epsin/AP180, were also present in LECA, albeit in a simpler configuration (Holstein and Oliviusson, 2005; Gabernet-Castello et al., 2009). There are also several lineage-specific instances of later evolution in specific taxa, including the RON complex of T. gondii, which is involved in membrane deformation and appears structurally unrelated to other coats (Alexander et al., 2005), and caveolin, acting in clathrin-independent endocytic mechanisms, and specific to metazoan organisms (Field et al., 2007). Importantly, although there may exist unidentified coat/membrane-deforming systems, most of the major families of players have, in all likelihood, been identified.
A structural relationship between multiple components of membrane coats, specifically COPI and clathrin/adaptin complexes, has been recognized for some time (Boehm and Bonifacino, 2001). Both complexes contain proteins consisting of one or two iterations of a β-propeller fold, an α-solenoid–like fold, or both in the order β-α (Devos et al., 2004). More recently, this architecture was recognized in the NPC, COPII, and likely other coating complexes, even extending to Sec13, being a bona fide subunit shared by both COPII and the NPC. Significantly, in the heart of the NPC is a “core scaffold” assembly entirely composed of β-propeller and α-solenoid–like proteins, and comprising ∼50% of the NPC mass. Based on these similarities the protocoatomer hypothesis proposed that NPCs and clathrin, COPI, and COPII vesicle coats share a common evolutionary origin in an early membrane-curving module, the “protocoatomer” (Fig. 2; Devos et al., 2004, 2006; Dokudovskaya et al., 2006; Field and Dacks, 2009). This hypothesis has been extended to IFT, SEA, and HOPS/CORVET complexes (Jékely and Arendt, 2006; Nickerson et al., 2009; Dokudovskaya et al., 2011). Protein structures determined by x-ray crystallography and electron microscopy have strongly supported the hypothesis, and greatly reduce the likelihood that these complexes arose via convergent evolution (Fig. 3; Fotin et al., 2004; Wilbur et al., 2005; Hsia et al., 2007; Debler et al., 2008; Stagg et al., 2008; Brohawn and Schwartz, 2009; Brohawn et al., 2009; Kampmann and Blobel, 2009; Leksa et al., 2009; Nagy et al., 2009; Seo et al., 2009; Whittle and Schwartz, 2009; Lee and Goldberg, 2010; Sampathkumar et al., 2011). Furthermore, similar architectures are shared with additional complexes associated with coating systems, though their evolutionary origins are currently unclear; for example, the α-solenoid is present in NPC-interacting karyopherins (Devos et al., 2006) and even subunits of the retromer complex, involved in Golgi/late endosomal transport (Cook et al., 2007; Collins, 2008).
The presence of multiple paralogous families in membrane-targeting systems and their importance in defining organelle identity provides a potential modular route for evolution of new compartments, as described by the organellar paralogy model (Dacks and Field, 2007). The model proposes that individual components arise by paralogous expansion and may function within preexisting complexes, but diverge into new complexes by step-wise subunit replacement/sequence divergence, creating diversity and new compartments. Organellar paralogy suggests this ratchet-like mechanism as far more flexible for the diversification of new organelles than defining organelles with disparate groups of unrelated proteins or requiring coevolution of a large cohort of proteins to produce new specificity (Dacks and Field, 2007).
What are the molecular details of membrane-manipulating machines?
Atomic structures for many components of the protocoatomer complexes have been determined experimentally or predicted by comparative structure modeling (Fig. 3), and encompass whole proteins, domains, or subcomplexes. In some cases these components have been assembled into coat lattice models based on electron density maps, protein contacts determined from proteomics and biochemistry, and molecular modeling. The clathrin/adaptin (Fotin et al., 2004), COPII (Fath et al., 2007), and COPI (Lee and Goldberg, 2010) complexes have an outer lattice made of coat proteins anchored to the membrane via adaptor proteins. Complexes of other membrane coats have not yet been determined at atomic resolution, though portions of some, e.g., sections of the Nup84 complex of the NPC, have been modeled (Kampmann and Blobel, 2009).
These coat proteins consist of one or more domains of β-propeller and α-solenoid–like (Fig. 3). The β-propellers have been determined or predicted to consist of four to eight blades of four β-strands arranged in a barrel (Chaudhuri et al., 2008; Smith, 2008), whereas the α-solenoids consist of a varying number of α-helices in a more or less compact arrangement (Kobe and Kajava, 2000; Karpenahalli et al., 2007). Intriguingly, the order of the domains tends to be conserved, with one β-propeller domain preceding the α-solenoid domain (Devos et al., 2004, 2006); some sequences, such as β′-COP, have two β-propeller domains (Lee and Goldberg, 2010), and additional coat proteins, e.g., IF122, are predicted to also have this configuration (Jékely and Arendt, 2006).
As an aside, a consideration of the α-solenoid–like proteins illustrates general problems with assigning evolutionary relations between proteins of similar fold types. Thus, it is notable, and confusing, that the term α-solenoid is not used consistently (Murzin et al., 1995; Orengo et al., 1997; Groves and Barford, 1999; Kobe and Kajava, 2000; Andrade et al., 2001; Devos et al., 2004; Söding et al., 2005; Karpenahalli et al., 2007). Initially, α-solenoids were defined to consist of repeating helix-turn-helix motifs, with these units arranged into a continuous, rather noncompact superhelix (Kobe and Kajava, 2000). Since then, (i) additional structures with irregular helical repeats have been classified as α-solenoids; (ii) the SCOP database completely omits the term α-solenoid, coming closest to the original definition with “alpha-alpha-superhelix” fold (Murzin et al., 1995); and (iii) the CATH database applies the term α-solenoid architecture to a compact irregular stacking of helices (exemplified by peridinin chlorophyll protein; Protein Data Bank accession no. 1PPR) and uses the term α-horseshoe for folds that are sometimes classified as α-solenoids (Orengo et al., 1997). Moreover, the relationship between and even within the various α-solenoid superfamilies (e.g., TPR, ARM, and HEAT) can be difficult to detect due to limitations in the current sequence and structure comparison methods, such as DALI (Holm and Rosenström, 2010) and HHpred (Söding et al., 2005). However, it is clear from studies combining both inspection and computation that even highly dissimilar protein structures can be related. For example, according to SCOP, the ARM-repeat and TPR-like superfamilies belong to the same “fold” (i.e., α-alpha superhelix). These difficulties in classifying α-helical folds illustrate a general point about relatively high plasticity of protein structures (Petrey and Honig, 2009) and led us to apply the α-solenoid term relatively loosely to α-helical proteins that form a superhelical structure and are not clearly members of other canonical α-helical folds; α-solenoids could also be called α-helical scaffolding proteins or stacked α-helical proteins. Thus, divergent evolution and common function are not automatically implied for all α-solenoids. Instead, convergent evolution needs to be considered as an explicit alternative to the divergent evolution on a case-by-case basis, as illustrated by the study of the Nup84 complex of the NPC (Devos et al., 2004).
Despite apparent conservation of fold types in the coating complexes there is generally no significant similarity at the amino acid sequence level between components of the different complexes. Even at the structural level, domains of the same fold type can differ in size and shape, including the number and packing of secondary structure segments (Fig. 3). For example, COPI, COPII, clathrin/adaptin, and the Nup84 complex contain β-propeller domains with seven blades. In a few cases the seventh blade is borrowed from a neighboring protein; in Sec13 the seventh blade comes from Nup145 and Sec31 in the Nup84 complex and COPII, respectively, whereas the seventh blade of Seh1 comes from Nup85 in the Nup84 complex (Fath et al., 2007; Hsia et al., 2007; Debler et al., 2008; Brohawn et al., 2009). Other putative coatomers, specifically several components of the IFT complexes, are predicted to contain β-propellers with four to eight blades (unpublished data). The α-solenoid–like domains of coating complexes can vary even more significantly, from relatively compact structures of Nup85 and Nup84 to the highly extended “coils” of ∼1,700 residues in the heavy chain of clathrin (Fig. 3).
A coat lattice is defined by its vertices and connecting edges. The packing of coat proteins into a lattice varies greatly between coat complexes (Fig. 3). Although the edges invariably involve α-solenoid–like domains, vertices can be composed of β-propeller (COPI, COPII) or α-solenoid–like domains (clathrin). For example, the repeat unit of clathrin is a triskelion, consisting of three clathrin heavy chain edges that meet at the vertex through their C-terminal α-solenoid domains. In contrast, the COPII repeat unit consists of four Sec13/Sec31 edges that meet at the vertex through the N-terminal β-propeller domains of Sec31. In some respects, the COPI lattice may be intermediary between clathrin and COPII lattices, and was modeled to consist of three edges like clathrin, but with the vertex corresponding to the packing of β-propellers, like COPII (Lee and Goldberg, 2010). Most significantly, a single lattice architecture can stabilize membranes of differing curvature and hence vesicles of different sizes, as illustrated by the varied clathrin/adaptin vesicles (Fig. 3); here the different curvatures are achieved by varying packing angles between the edges of neighboring triskelions, while maintaining each individual triskelion structure, including the vertices, essentially unchanged (Fotin et al., 2004). In contrast, expansion of the COPII cage from the cuboctahedron to the icosidodecahdron is associated with the plasticity of the vertices rather than the edges, made possible by relatively limited physical contacts between the β-propeller domains forming these vertices (Stagg et al., 2008). Although the molecular details are still not known, it is clear that the Nup84 complex forms a pair of head-to-tail rings in each NPC rather than forming a homo-oligomeric cage (Alber et al., 2007a,b), again underscoring the plasticity available to this family of coating proteins. Evolutionarily the coat architecture benefits from the ability of α-solenoid–like domains to relatively easily add or delete helices or to change their packing (Kobe and Kajava, 2000; Devos et al., 2004; Karpenahalli et al., 2007; Forwood et al., 2010). Thus, it appears that the β-propeller and α-solenoid domains are permissive for evolving a large variety of geometries. This flexibility is consistent with these folds being involved in additional cellular processes; for example, β-propeller domains participate in G protein signal transduction and chromatin assembly (Chaudhuri et al., 2008; Smith, 2008).
The α-helical domains of several ESCRT and retromer components are of altogether distinct fold types as compared with each other and the α-solenoid–like protein coats (above; Fig. 3). Crystallographic studies revealed the BAR (Bin/Amphiphysin/Rvs-homology) domain as α-helix rich, with a core six-helix bundle (Masuda and Mochizuki, 2010). BAR domains form dimers with long extended curves and appear to be outstandingly suitable modules for enabling the formation of tubes or buds from a nearly flat membrane. The elongated, banana-shape structure has a positively charged concave surface of 22-nm diameter that is directly responsible for generating and/or sensing a highly curved membrane. The 18 known atomic structures reveal subtle variations in dimerization angles and helical kinks, adapting the BAR architecture to a variety of membrane-sculpting functions.
Six subcomplexes of the ESCRT complex were recently reviewed, relying largely on crystallographic and EM studies (Hurley, 2010). These subcomplexes vary in domain content, overall organization, and function (Ghazi-Tabatabai et al., 2008). For example, the yeast ESCRT-0 complex is a heterodimer of Hse1 and Vps27 that interact with each other via an elongated anti-parallel coiled-coil motif and swapped GAT domains. Hse1 also contains ubiquitin-binding VHS and UIM domains (8 helix bundles) as well as an SH3 domain (a β-barrel). Vps27 contains a membrane-binding FYVE domain that interacts with phosphatidylinositol 3-phosphate in the membrane (a zinc finger with two Zn2+-binding clusters, two double-stranded antiparallel β sheets, and a C-terminal α-helix), one VHS and two UIM domains, and a clathrin-binding domain. The C termini of both proteins are unstructured and contain a number of interaction motifs for other partners. Although comparatively little is known about ESCRT and retromer quaternary structures, components of both the BAR and ESCRT represent intricate, essentially nonrepetitive fold structures dedicated to forming a very particular set of shapes.
From where did the membrane compartment–forming machineries evolve?
Of the three known membrane-bending mechanisms, only the ESCRT system can be unequivocally traced back to prokaryotes (Samson et al., 2008), and this is restricted to the membrane deformation ESCRT III subunits and Vps4, the associated ATPase. However, we can likely infer that the protocoatomer and BAR domain proteins arose from prokaryotic precursors, but these have yet to be identified due to weak sequence similarities. Higher order structural data may reveal these relationships, but even a structure-based argument is complicated by the ubiquity of β-propeller, α-solenoid–like, and α-helical bundle (BAR-like) domains across both eukaryotes and prokaryotes (Kobe and Kajava, 2000; Karpenahalli et al., 2007; Chaudhuri et al., 2008; Smith, 2008). Nevertheless, characteristic combinations of these domains uniquely define membrane-deforming complexes, so the absence of such combinations from the vast majority of prokaryotes argues for an origin approximately coincident with the first eukaryotic common ancestor (FECA).
Recent findings have blurred the distinction between true coat complexes that build membrane-deforming lattices and tethering complexes, traditionally considered to be involved in vesicle docking. The HOPS/CORVET complex is classed as a tethering complex, but possesses β/α-domain subunits, recognizes curved membranes, acts in late endocytic targeting, is subject to sophisticated kinase-mediated control (Nickerson et al., 2009; Cabrera et al., 2010), and is a near-universal eukaryotic feature (Koumandou et al., 2007). It is unknown if HOPS/CORVET forms a lattice contributing to formation of transport intermediates, or is restricted to participation in the later stages of vesicle transport. Similarly, IFT complexes are found in essentially all eukaryotic flagellates and form raft-like structures transporting molecules along the flagellar membrane; the structural parallels of several IFT subunits to protocoatomer proteins are striking (Jékely and Arendt, 2006; Satir et al., 2008), but it is unclear if the IFT rafts act to coat and deform membrane or act in some other manner. It has yet to be determined if the newly characterized SEA complex, with some structural similarities to the HOPS/CORVET complex, is involved in tethering, coat formation, or another function (Dokudovskaya et al., 2011). Remarkably, since LECA, there has apparently been little innovation of new vesicle coat and membrane deformation systems (Dacks and Field, 2007).
In addition to movement between endomembrane compartments, eukaryotes require membrane deformation for multiple functions, including cell division and replication of both endosymbiotic and autogenously derived organelles. Also, several compartments require precise delineation of membrane subdomains; for example, the Golgi complex and fenestrations within ER membrane networks. There are several examples of specific budding processes taking place at specialized subdomains, including the ER and Golgi complex (Ladinsky et al., 1999; Budnik and Stephens, 2009), but a potent example is the nuclear pore complex (Fig. 1; Alber et al., 2007a,b).
Both the nuclear envelope and phagocytosis are ancient eukaryotic features (Cavalier-Smith, 2009, 2010; DeGrasse et al., 2009), and established in the transition between FECA and LECA. However, there are no molecular components specific to either the nuclear envelope or the phagocytic system with clear prokaryotic origin. The presence of internal membranes associated with the nucleoid in the Planctomycete bacterium Gemmata obscuriglobus, a structure segregating DNA from the remaining cytoplasm, stands as a good example of an apparently nuclear-like compartment in a prokaryote (Fuerst, 2005). More recently, parallels between the nucleoid and the nucleus were possibly extended by detection of possible nucleoid-associated proteins with a putative protocoatomer β-α architecture in these bacteria (Santarella-Mellwig et al., 2010). However significant to eukaryotic origins this proves, our view of the origins of endomembrane coats is incomplete, as protocoatomer proteins are not found in the majority of prokaryotes including Archaea. If major components of an ancestral membrane deformation machine were carried over from prokaryotes, then the absence of a conventional phagocytic/endocytic system from most prokaryotes is problematic, as many models assume that phagocytosis accounts for the initial endosymbiotic event (Embley and Martin, 2006). Further, the eukaryotic flagellum is also recognized as a feature of the very earliest eukaryotes and probably predates LECA, but again lacks an obvious prokaryotic molecular signature. Thus, where the β-α coating module originated from preLECA (and preFECA) is still unclear, and given its tremendous structural plasticity may even have been repurposed from a protein originally used for some prokaryotic function now lost to eukaryotes.
Why did membrane trafficking machinery evolve?
To grow and develop, all cells require their membranes to be dynamic and moldable. At its most basic, these features are required for the post-mitotic separation of daughter cells at cytokinesis. In addition, cells also benefit from packaging their interior into more efficient and/or functionally specialized compartments. Cell division represents a fundamental rationale for bending membrane, as even the most primitive cell has a requirement to replicate. Unsurprisingly then, these processes retain significant molecular conservation between prokaryotes and eukaryotes, which were overlooked by earlier studies due to very low sequence similarities. The evolutionary relationships were instead exposed by the structures of several prokaryotic and eukaryotic players, including FtsZ, the prokaryotic tubulin orthologue essential for the scission steps in cytokinesis, FtsA/MreB, an actin orthologue (Graumann, 2007), and bacterial dynamin-like proteins (BDLP; Low and Löwe, 2006), for which the ancestral function was likely also in cytokinesis (Miyagishima et al., 2008). All of these examples share less than 20% sequence identity with their eukaryotic orthologues but are structurally quite similar. Significantly, there is apparently more than one cytokinesis mechanism present in prokaryotes. For example, in the Archaeon Sulpholobus ESCRT III complex (Vps2 and Vps4) orthologues are essential for membrane scission (Graumann, 2007; Lindås et al., 2008; Samson et al., 2008). Interestingly, ESCRT- and FtsZ-mediated cytokinesis systems seem to be mutually exclusive. Significantly, ESCRT retains the function in cytokinesis in eukaryotes, being targeted to and extending the cleavage furrow in metazoan cells (Makarova et al., 2010). Therefore, the roles of DLPs, FtsA/actin, FtsZ/tubulin, and ESCRT components in the endomembrane system may represent acquisition of expanded functions (Low and Löwe, 2006), in addition to partly (or wholly) retaining the original roles in cytokinesis.
There remain two dominant roles for the original eukaryote-specific function of the endomembrane system; phagocytosis and establishment of the nucleus (Cavalier-Smith, 2002; Martin and Koonin, 2006). Both roles require compartmentalization of the cellular interior, and there are compelling arguments for each as the first to have arisen. Specifically, the selective advantage that phagocytosis offers a primitive eukaryote is significant, as it would be able to engulf its competitors and is a presumed precondition for acquisition of endosymbionts. Conversely, segregation of genetic material from the cytoplasm separates transcription and translation, protecting the genetic material and facilitating increasingly sophisticated mechanisms for gene control. Thus, regardless of the precise order of events at the dawn of the eukaryotes, there is compelling evidence for why eukaryotes became compartmentalized.
Why has one system come to dominate the eukaryotic cell?
Structurally, the membrane coat repertoire is dominated by one class, namely the β-propeller/α-solenoid–like proteins (Fig. 2). Like the scales of our ancestral primitive fish have evolved into teeth, horns, feathers, and fur, so the original “protocoatomer” has diverged widely into many complexes, including some such as the NPC with roles seemingly far removed from the original. Why did these systems come to dominate the eukaryotic cell, instead of other complexes that are clearly capable of deforming membranes? As the major known role for these proteins is forming scaffolds, we presume there to be some aspect of their molecular architecture especially well suited to adapting them to scaffolding a variety of different membranes. This selective advantage would allow expansion and diversification of this family, over and above the other membrane deformation systems.
BAR domain proteins assemble principally as dimers, and the individual subunits are comprised of helical bundles assembled into a banana shape (Fig. 3). Although BAR domain curvature can vary somewhat, the architecture of the BAR domain is relatively invariant (Masuda and Mochizuki, 2010). For ESCRT, only a single set of coat proteins is encoded in the genome (Leung et al., 2008), and hence only a single architecture is possible (Fig. 3). Therefore, for both BAR and ESCRT, there appears to be limited scope for structural variation; they are locked into a single molecular architecture. By contrast, the β/α coat proteins are more like a molecular LEGOTM (Fig. 3). Both α-solenoids and β-propellers are repetitive, making addition or removal of elements potentially facile in evolutionary terms with significant variation in primary structure allowed without disrupting the overall conformation of the protein, but instead varying the number of repeats. The α-solenoid is highly variable in length, with for example clathrin being approximately twice the molecular weight of Nup133 (Fig. 3). Though the β-propeller is inherently more constrained than the α-solenoid, the four to eight blades provide multiple sites for protein–protein interactions, and allow incorporation of additional loops between each compact β-sheet blade (Fig. 3).
This is consistent with comparative genomics allowing us to propose a configuration for the membrane-trafficking system in LECA (Dacks and Field, 2007; Field and Dacks, 2009). First, the full ESCRT I/II/III system was present but subsequent retention of the complex was then variable between lineages, with evidence for secondary loss of several of the ESCRT III and III-associated subunits (Leung et al., 2008). Second, although BAR domains are universally represented and present in LECA, many organisms have very small families with Trypanosoma for example having a single retromer-associated BAR domain protein (Koumandou et al., 2011). Lastly, the β-propeller/α-solenoid–like coat family contrasts sharply with BAR and ESCRT in their extensive representation; clathrin, COPI, COPII, IFT, the NPC, and HOPS/CORVET are all present in nearly every eukaryotic lineage we have examined, and therefore were established as independent complexes before LECA. This suggests that post-FECA organisms found in the protocoatomer family an evolutionary flexible template for development of the multiple mechanisms and compartments required to build a highly sophisticated and adaptable endomembrane system.
Conclusions
The emergence of eukaryotes is probably the most dramatic evolutionary transition after the origin of life, and represents a revolution in cellular architecture, gene expression mechanisms, and many other features, but with obvious continuity with the prokaryotic ancestors. The last eukaryotic common ancestor (LECA), which probably lived about one billion years ago, was a very sophisticated organism and bore many innovations specific to eukaryotes. Much of the innovation therefore predates LECA, and must be placed within the transitional period between FECA and LECA. Representing the most extreme bottleneck possible, LECA is the sole representative known to have survived from the transitional period. Basic metabolic pathways and biosynthetic mechanisms are excellent examples of persistent prokaryotic contributions, whereas membrane transport and the construction of intracellular compartments probably represents the major eukaryotic innovation, and facilitated massive increases to cellular complexity and functional flexibility. Few systems seem directly derived from prokaryotes, with the exception of essential roles in cytokinesis and perhaps endosymbiotic organelle replication. The majority of intracellular membranes are likely a result of evolutionary expansion of an ancestral protocoatomer complex and also additional paralogous families, which occurred pre-LECA. We suggest that the complex diverged massively to produce nuclear pore complexes, intraflagellar transport, and a great proportion of the endomembrane system. Structural flexibility, allowing incorporation of small, functionally significant variations within these proteins may explain how these proteins came to dominate the endomembrane system. Significantly, this provides a potent example of natural selection operating to select for functionality, and hence fitness, within the internal workings of the cell, rather than, as is more traditionally regarded, between organisms or populations.
The progress in the last decade in our understanding of the origins of many of the structures present within eukaryotes has been astonishing. These cellular features have left virtually no geological record, yet genome sequencing together with molecular modeling and solution of the molecular and atomic structures of key components has given us many of the necessary tools and evidence to reconstruct a probable trajectory of eukaryotic cellular evolution. Direct analysis of trafficking systems in diverse but evolutionarily key organisms is essential in providing an unbiased view of how present diversity arose, and may identify novel components absent from more traditional model organisms, further increasing our appreciation of the range of diversity produced by evolution, together with providing an improved grasp of the cell biology of many pathogens and ecologically important organisms. Similarly, structural biology efforts will take phylogenetic comparisons to the atomic level, revealing structural similarities that elude sequence-based methods. Taken together, these approaches should continue to revolutionize our view of how the eukaryotic cell came to be.
Acknowledgments
We thank Joel B. Dacks for many fruitful discussions on eukaryotic origins and trafficking, and Jeremy Phillips for help preparing Figure 3. Finally, we apologize to the very great many people whose primary work we were unable to cite due to limitations in space.
Work in the authors’ laboratories is supported by the following agencies; the Wellcome Trust, the Medical Research Council (UK), and the National Institutes of Health (Bethesda, MD).