Skip Navigation


MBE Advance Access originally published online on August 7, 2008
Molecular Biology and Evolution 2008 25(11):2255-2267; doi:10.1093/molbev/msn175
This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Supplementary Data
Right arrow All Versions of this Article:
25/11/2255    most recent
msn175v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Scofield, D. G.
Right arrow Articles by Lynch, M.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Scofield, D. G.
Right arrow Articles by Lynch, M.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2008. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oxfordjournals.org

Evolutionary Diversification of the Sm Family of RNA-Associated Proteins

Douglas G. Scofield1 and Michael Lynch

Department of Biology, Indiana University

E-mail: dgscofield{at}ucla.edu.


    Abstract
 TOP
 Abstract
 Introduction
 Phylogenetic Relationships among...
 Sm Protein Diversification in...
 Discussion
 Supplementary Material
 Acknowledgements
 References
 
The Sm family of proteins is closely associated with RNA metabolism throughout all life. These proteins form homomorphic and heteromorphic rings consisting of six or seven subunits with a characteristic central pore, the presence of which is critical for binding U-rich regions of single-stranded RNA. Eubacteria and Archaea typically carry one or two forms of Sm proteins and assemble one homomorphic ring per Sm protein. Eukaryotes typically carry 16 or more Sm proteins that assemble to form heteromorphic rings which lie at the center of a number of critical RNA-associated small nuclear ribonucleoproteins (snRNPs). High Sm protein diversity and heteromorphic Sm rings are features stretching back to the origin of eukaryotes; very deep phylogenetic divisions among existing Sm proteins indicate simultaneous evolution across essentially all existing eukaryotic life. Two basic forms of heteromorphic Sm rings are found in eukaryotes. Fixed Sm rings are highly stable and static and are assembled around an RNA cofactor. Flexible Sm rings also stabilize and chaperone RNA but assemble in the absence of an RNA substrate and, more significantly, associate with and dissociate from RNA substrates more freely than fixed rings. This suggests that the conformation of flexible Sm rings might be modified in some specific manner to facilitate association and dissociation with RNA. Diversification of eukaryotic Sm proteins may have been initiated by gene transfers and/or genome clashes that accompanied the origin of the eukaryotic cell itself, with further diversification driven by a greater need for steric specificity within increasingly complex snRNPs.

Key Words: Sm protein • RNA processing • snRNP • multimeric proteins • trans-splicing • spliceosome


    Introduction
 TOP
 Abstract
 Introduction
 Phylogenetic Relationships among...
 Sm Protein Diversification in...
 Discussion
 Supplementary Material
 Acknowledgements
 References
 
The Sm family of proteins, encompassing the Sm and Sm-like (Lsm) proteins (Séraphin 1995Go), are common participants in RNA metabolism in Eubacteria (Valentin-Hansen et al. 2004Go), Archaea (Salgado-Garrido et al. 1999Go; Mura et al. 2001Go), and eukaryotes (Mattaj and Derobertis 1985Go). Sm proteins primarily occur as small (~9–29 kDa) stand-alone proteins lacking other domains (Anantharaman et al. 2002Go; for an exception see Pillai et al. 2003Go) that assemble to form characteristic homomorphic or heteromorphic rings containing six or seven proteins. Members of the family are characterized by the conserved bipartite Sm domain or "Sm fold" which functions, at least in part, in binding to neighboring Sm proteins within such rings (Box 1 and Hermann et al. 1995Go; Séraphin 1995Go; Khusial et al. 2005Go). One highly conserved characteristic of Sm rings is the direct interaction of the central pore of the ring with short uracil-rich stretches of RNA, in both prokaryotes (Box 1 and Törö et al. 2001Go; Schumacher et al. 2002Go; Mura, Kozhukhovsky et al. 2003Go; Thore et al. 2003Go) and eukaryotes (Branlant et al. 1982Go; Liautard et al. 1982Go; Urlaub et al. 2001Go; Khusial et al. 2005Go). The Sm family in eukaryotes has undergone considerable diversification, with a variety of heteromorphic Sm rings participating within many RNA-processing pathways and snRNP complexes (Anantharaman et al. 2002Go; Khusial et al. 2005Go; Wilusz CJ and Wilusz J 2005Go). Some multidomain proteins involved in RNA processing carry divergent Sm domains but have not been found within Sm rings (Albrecht and Lengauer 2004Go; Albrecht et al. 2004Go; Anantharaman and Aravind 2004Go; Fleischer et al. 2006Go; Tritschler et al. 2007Go); here we focus on Sm protein members of Sm rings.


Box 1. Sm protein structure
Formula Box 1 FIG.—Sm protein structure. (A) Sequence alignment of Sm proteins SmD1 and SmD3 in human, Drosophila melanogaster, Schizosaccharomyces pombe and Arabidopsis thaliana. RG-rich regions characteristic of the 3'-ends of these particular Sm proteins have been removed. (B) Human SmD1 protein structural model (from PDB ID 1B34, Kambach, Walke, Young, et al. 1999Go). Green, light blue, brown, and red regions of the model correspond to the Sm motif and RNA-binding site sequences as shown in (A).

Sm proteins are small proteins characterized by the presence of the highly conserved Sm fold containing two conserved motifs each of which consists largely of β-strands and an embedded RNA-binding site. The motifs are separated by a "linker" region that varies in length among different Sm proteins (A; Hermann et al. 1995Go; Khusial et al. 2005Go). When a single Sm protein is folded, both RNA-binding sites are found within loops located in close proximity on one side of the protein (B). When an entire Sm ring is assembled, these loops are oriented toward the central pore of the ring (Kambach, Walke, Young, et al. 1999Go).

 


Box 2. Sm proteins in prokaryotes
Formula Box 2 FIG.—Prokaryotic Sm rings. (A) and (B) Archaeon Archaeoglobus fulgidus, two distinct Sm proteins: AF-Sm1, PDB 1I5L, complexed with oligo-U RNA (Törö et al. 2001Go), and AF-Sm2, PDB 1LJO (Törö et al. 2002Go). (C) Eubacteria Eschericia coli Hfq, PDB 1HK9 (Sauter et al. 2003Go). (D) Eubacteria Staphylococcus aureus Hfq, PDF 1KQ2, complexed with oligo-U RNA (Schumacher et al. 2002Go). Monomers are colored individually for clarity but are identical within each ring. All structures from Protein Data Bank (Berman et al. 2000Go) rendered with Protein Workshop (Moreland et al. 2005Go). Rings not all on same scale.

In prokaryotes, Sm proteins form six- or seven-membered homomeric rings with each member being an identical copy of the protein (Box 2 fig.). For example, Hfq from Escherichia coli forms 6-meric rings (Sauter et al. 2003Go), whereas the archaeon Archaeoglobus fulgidus has two Sm proteins: Sm1, which forms a homomorphic 7-meric ring, and Sm2, which forms a homomorphic ring containing either six (via crystallography, Törö et al. 2002Go) or seven (via electron microscopy when complexed with RNA, Achsel et al. 2001Go) Sm2 proteins. Thus, apart from differences that may be artifacts arising from methods of structural determination, the diversity of Sm rings in prokaryotes is determined on a one-to-one basis by the diversity of Sm proteins (Achsel et al. 2001Go; Törö et al. 2002Go).

Several solved structures (e.g., Box 2 fig. A and D) show prokaryotic Sm rings complexed with U-rich RNA encircling one face of the central pore of the ring (PDB IDs: 1I5L, Törö et al. 2001Go; 1LOJ, Mura, Kozhukhovsky et al. 2003Go; 1M8V, Thore et al. 2003Go), with one structure showing RNA encircling as well as penetrating the central pore (1KQ2, Schumacher et al. 2002Go). Strong affinity for U-rich RNA oligomers has been confirmed in the archaeon A. fulgidus (Törö et al. 2001Go) and affinity for A-rich regions, including the poly-A tails of transcripts, has been shown for the Sm protein Hfq in the Eubacteria E. coli (Hajnsdorf and Régnier 2000Go; Schumacher et al. 2002Go).

Functional roles include participation in a variety of RNA-processing steps. In E. coli, Hfq primarily functions as an RNA chaperone, by binding to A/U–rich regions with sufficient strength to destabilize nearby (transient) secondary structures and thereby promoting correct RNA–RNA complex formation; such actions are implicated in regulation of translation (reviewed in Schumacher et al. 2002Go).

All evidence to date indicates that though prokaryotes may vary in the number of Sm proteins encoded by their genomes (Sun et al. 2002Go; Mura, Phillips et al. 2003Go), a one-to-one in vivo relationship exists between prokaryotic Sm proteins and homomorphic Sm rings (Achsel et al. 2001Go; Törö et al. 2002Go). If this is, indeed, true, barring the unlikely discovery of an Sm ring assembly pathway in prokaryotes, Sm proteins within a multi-Sm prokaryote must be considerably more self-selective than eukaryotic Sm proteins, for which empirical data show rather promiscuous binding between Sm folds (Plessel et al. 1997Go; Collins et al. 2003Go). This self-selectivity may be enhanced by having one ring assemble spontaneously in vivo, whereas another assembles in the presence of RNA, as is the case with Sm1 and Sm2, respectively, in A. fulgidus (Achsel et al. 2001Go). Supporting predictions include the exclusive assembly of homomeric Sm rings from mixtures of different Sm proteins from the same prokaryote, enhanced self-assembly kinetics when co-occurring Sm proteins form 7-meric rings versus rings with different numbers of monomers, and reduced selectivity within in vitro and in vivo mixtures of Sm proteins from different prokaryotes, particularly when the Sm proteins originate from more distantly related prokaryotes, for example, Archaea and Eubacteria.

 

Though the evolutionary diversification of Sm proteins and Sm rings in eukaryotes clearly reflects a general increase in RNA-processing complexity in eukaryotes (Anantharaman et al. 2002Go), there are several dramatic discontinuities in comparison to prokaryotes. These include the formation of heteromorphic, 7-meric Sm rings; the highly stable, static association of Sm rings with snRNA cofactors; the use of a dedicated pathway for assembling Sm rings with snRNA cofactors; and the central participation of Sm–snRNA complexes within heterogeneous small nuclear ribonucleoproteins (snRNPs) having a variety of eukaryote-specific RNA-processing functions.

The existing "diversification–duplication" model for the evolution of the Sm protein family in eukaryotes proposes two primary steps early in the evolution of eukaryotes (Salgado-Garrido et al. 1999Go). The protein family first diverged to form a single heterogeneous Sm ring, followed by a large-scale duplication that allowed for the assembly of a second but related Sm ring; these two rings are the "canonical" Sm ring at the heart of the U1, U2, U4, and U5 spliceosomal snRNPs and the Lsm ring at the heart of the U6 spliceosomal snRNP. Subsequent smaller duplications led to the creation of other Sm proteins that participate in other Sm rings.

In this study, we consider the pattern of evolutionary diversification of Sm proteins in eukaryotes. We first examine the phylogenetic structure of Sm proteins across a wide variety of prokaryotes and eukaryotes and identify major evolutionary trends in the family. We find that eukaryotic Sm diversity reached much of its present breadth very early in eukaryotic evolution. We then review Sm structure, diversity and function in prokaryotes and eukaryotes, as well as the pathways for Sm ring assembly in eukaryotes. Based on the functional review and the results of the phylogenetic analysis, we partition Sm rings in eukaryotes into two classes: fixed Sm rings and flexible Sm rings, corresponding roughly to the rather informal nomenclature for "Sm-type" and "Lsm-type" rings. Our fixed and flexible Sm ring class designations reflect not only broad differences in the duration of the Sm ring–RNA cofactor association but also presumed differences in Sm ring–RNA-binding patterns and Sm ring structural stability that underlie differences in Sm ring–RNA associations. Finally, we consider some possible mechanisms for Sm ring evolution in eukaryotes. The analyses presented here provide insights into the evolution of this distinctive, ubiquitous, and critical family of RNA-associated proteins.


    Phylogenetic Relationships among Sm Proteins
 TOP
 Abstract
 Introduction
 Phylogenetic Relationships among...
 Sm Protein Diversification in...
 Discussion
 Supplementary Material
 Acknowledgements
 References
 
We examined the evolutionary relationships among Sm proteins across all life by assembling a phylogenetic tree containing 202 Sm proteins from GenBank records. Prokaryotic Sm proteins include those representing Archaea (figures in brackets indicate the number of Sm proteins in the taxa where greater than one): Archaeoglobus fulgidus [2], Thermoplasma volcanium GSS1 [2], Thermoplasma acidophilum DSM 1728 [2], Aeropyrum pernix K1 [2], Pyrobaculum aerophilum IM2 [3], and Methanopyrus kandleri AV19 and Eubacteria: Escherichia coli O157:H7, Staphylococcus aureus ssp. aureus JH1, Clostridium botulinum B strain Eklund 17B, Thermobifida fusca YX, {alpha}-proteobacterium HTCC2255 [2], and {gamma}-proteobacterium HTCC2207. We also broadly sampled Sm proteins from eukaryotes, including the diplomonad Giardia lamblia, the parabasalid Trichomonas vaginalis G3, the apicomplexan Plasmodium falciparum 3D7, the cryptomonad Guillardia theta, the amoebozoan Entamoeba histolytica, the chlorarachniophyte Bigelowiella natans, the kinetoplastid Trypanosoma cruzi strain CL Brener, the plant Arabidopsis thaliana, the microsporidian Encephalitozoon cuniculi, the fungus Saccharomyces cerevisiae, and animals, including Drosophila melanogaster, the urochordate Ciona intestinalis, and human. Sm proteins were initially identified by GenBank annotations with the sequence set incrementally expanded via PSI-Blast (Altschul et al. 1997Go). For species having insufficient GenBank coverage for Sm proteins, the excellent spliceosomal protein sequence database provided by Barbosa-Morais et al. (2006)Go was used; see their paper for further details of their sources. Multiple sequence alignment was performed using MUSCLE with default settings (Edgar 2004Go), with a neighbor-joining tree constructed from this alignment using ClustalW with gaps included and 1000 bootstraps (Chenna et al. 2003Go). Expanded information for the 202 sequences used may be found in supplementary table 1 (Supplementary Material online).

Although the extreme age and high degree of diversification of the Sm protein family has resulted in relatively high uncertainty at more basal nodes within clades, some features are readily apparent in figure 1 and its more detailed version in supplementary figure 1 (Supplementary Material online): 1) the basal positions of archaeal and eubacterial Sm proteins, with archaeal proteins scattered throughout the tree and eubacterial proteins clustering together in a single clade; 2) the extreme depth of the fundamental splits among individual eukaryotic Sm proteins; and 3) the subsequent evolution within individual Sm protein clades following, with some variation, the course of eukaryotic diversification. In this tree, there is relatively little support for any one configuration of deeper branches above clades for individual Sm proteins. The interrelationships among some Sm protein clades shifts somewhat with different compositions of the data set, though some neighbor clade pairs remain relatively robust, for example, Lsm8 + Lsm1, Lsm5 + SmE, and Lsm3 + SmD2 (data not shown). In this tree, some individual proteins are out of place, such as Lsm8, Lsm6, and SmB in S. cerevisiae. This is not surprising, given the great age of individual proteins in this family, but some of these placements may reflect species-specific evolutionary differences in RNA processing, for example, S. cerevisiae contains an unusually depauperate set of introns (Fink 1987Go).


Figure 1
View larger version (35K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
FIG. 1.— Evolution of Sm family proteins throughout the three domains of life. Bootstrap support "***"≥90%, "**"≥70%, "*"≥50%. A fully annotated version of this figure is available as supplementary figure 1 (Supplementary Material online).

 

    Sm Protein Diversification in Eukaryotes
 TOP
 Abstract
 Introduction
 Phylogenetic Relationships among...
 Sm Protein Diversification in...
 Discussion
 Supplementary Material
 Acknowledgements
 References
 
Eukaryotic Sm proteins have undergone considerable diversification in comparison to their prokaryotic ancestors and play a central role in many eukaryote-specific snRNPs. Nearly all in vivo Sm rings have been found to be 7-merous with each of the 7 members being a distinct Sm protein (Fig. 2A). Exceptions to date include two functional complexes containing 6 distinct Sm proteins: the Sm ring selectively binding U8 snoRNA in Xenopus (Tomasevic and Peculis 2002Go) and the Sm ring binding snR5 RNA in Saccharomyces (Fernandez et al. 2004Go). Eukaryotic Sm proteins are involved in snRNA stability (Mayes et al. 1999Go; Liu et al. 2004Go), interaction with nuclear import factors during snRNP maturation and recycling of spliceosomal multi-snRNP complexes (Palacios et al. 1997Go; Chan et al. 2003Go; Liu et al. 2004Go; Narayanan et al. 2004Go), stabilization of spliceosome–mRNA complexes (Zhang et al. 2001Go), and modification of a variety of RNA substrates (Pillai et al. 2003Go).


Figure 2
View larger version (37K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
FIG. 2.— Eukaryotic Sm ring and Sm-anchored snRNP. (A) Human fixed class Sm ring anchoring spliceosomal snRNPs (see text). Reprinted from Kambach, Walke, Young, et al. (1999)Go. ©1999 with permission from Elsevier. (B) Human U1 snRNP. Shown are the fixed Sm ring with its central pore; the U1 snRNA passing through the central pore of the ring with loops I, II, II, and IV and the Sm site in blue, red, green, orange, and yellow, respectively; and the locations of the 70K, U1-A, and B + C set of U1-associated proteins (Stark and Lührmann 2006Go). Figure kindly provided by H. Stark (Max Planck Institute for Biophysical Chemistry, Göttingen, Germany); labels added by authors.

 
In this section, we will define and use our fixed and flexible class designations where appropriate. We will consider these class designations more fully in the Discussion.

Fixed Class (Sm-Type) Sm Rings
Fixed class eukaryotic Sm rings form stable, long-term associations with RNA substrates, around which they are assembled via a dedicated pathway. Fixed class Sm rings form largely passive scaffolds around which RNA and protein cofactors assemble in a spatially explicit manner. RNA substrates of fixed Sm rings contain an "Sm site," which consists of PuAU4–6GPu flanked by two stem–loop structures (Branlant et al. 1982Go; Liautard et al. 1982Go; Urlaub et al. 2001Go; Khusial et al. 2005Go). Electron microscopy and UV cross-linking studies of the human U1 snRNP suggest that, rather than encircle one face of the Sm ring as in prokaryotes, the Sm site of U1 snRNA circles through the central pore (Fig. 2B and Stark et al. 2001Go; Urlaub et al. 2001Go; Stark and Lührmann 2006Go). The binding configuration of other fixed Sm ring–RNA associations is currently unknown.

The "canonical" fixed class eukaryotic Sm ring is found at the center of the U1, U2, U4, and U5 spliceosomal snRNPs (Fig. 2A). This Sm ring contains monomers in the order SmD1/SmD2, SmF/SmE/SmG, and SmD3/SmB, where the subgroups indicate spontaneously forming dimers and a trimer that are then assembled to form the complete ring (Kambach, Walke, and Nagai 1999Go; Kambach, Walke, Young, et al. 1999Go; Raker et al. 1999Go). This Sm ring is assembled around its various snRNA substrates in the cytoplasm via the Survival of Motor Neurons (SMN) pathway (Meister et al. 2002Go; Yong, Wan, and Dreyfuss 2004Go) with components that selectively bind both Sm proteins (Pu et al. 1999Go; Friesen et al. 2001Go) and snRNA (Pellizzoni, Yong, and Dreyfuss 2002Go; Yong, Pellizzoni, et al. 2002Go; Golembe, Yong, and Dreyfuss 2005Go). These Sm-snRNA pre-snRNPs are transported into the nucleus complexed with the SMN protein (Fischer et al. 1991Go; Palacios et al. 1997Go; Massenet et al. 2002Go; Narayanan et al. 2004Go) and are then matured within Cajal bodies (Jády et al. 2003Go; Kiss 2004Go; Cioce and Lamond 2005Go; Liu, Murphy, et al. 2006Go; Matera and Shpargel 2006Go; Stanek and Neugebauer 2006Go; Tycowski et al. 2006Go). The high selectivity of the SMN pathway ensures correct association of Sm site–containing snRNA with the appropriate fixed Sm ring, but it is not foolproof. RNAs associated with the primate virus Herpesvirus saimiri can outcompete host snRNAs for fixed Sm rings on this same pathway and thereby gain intracellular stability (Golembe, Yong, Battle, et al. 2005Go).

There are taxa- and tissue-specific variations in the composition of the fixed Sm ring in spliceosomal snRNPs. In trypanosomes, some spliceosomal snRNPs contain variant Sm rings. Trypanosomes’ U2 snRNAs have a divergent Sm site in comparison both to other spliceosomal snRNAs in trypanosomes and to U2 snRNAs in other eukaryotes, and the Sm ring assembled around this Sm site is also divergent (Wang et al. 2006Go). The SmD3/SmB dimer is replaced by the novel Sm proteins Sm16.5K/Sm15K, the closest sequence homologs of which are SmD3 and SmB, respectively (Wang et al. 2006Go). In trypanosomes’ U4 snRNP, SmD3 alone is replaced by a novel Sm protein not found in the U2 snRNP (Tkacz et al. 2007Go). In mammals, there are three variants of SmB (SmB, SmB', and SmN) that may all substitute for SmB in spliceosomal snRNPs; they are interrelated by alternative splicing and gene duplication (McAllister et al. 1988Go; McAllister et al. 1989Go; Chu and Elkon 1991Go; Griffith et al. 1992Go; Gray et al. 1999Go). SmN is most highly expressed in neural tissue, particularly postnatal brain, and its underexpression is associated with Prader–Willi syndrome (Gray et al. 1999Go; Nicholls and Knepper 2001Go).

There are other snRNPs that contain fixed Sm rings assembled and matured via the SMN pathway and Cajal bodies. The U7 snRNP (Schümperli and Pillai 2004Go) performs 3'-end processing of histone mRNAs and is highly conserved among metazoans. The U7 snRNA contains a divergent Sm site bound by an Sm ring similar in composition to that for the spliceosomal snRNPs but with Sm proteins Lsm10/Lsm11 substituted for SmD1/SmD2, with these new Sm proteins derived from SmD1 (or SmD3) and SmD2, respectively. Lsm11, in contrast to the largely passive role played by all other known Sm proteins found in Sm rings, contains an additional domain that is directly involved in histone mRNA processing (Pillai et al. 2003Go). The divergent Sm site in U7 snRNA is required for the SMN pathway to assemble the divergent Sm ring (Kolev and Steitz 2006Go), and Cajal bodies are involved in U7 snRNP maturation in the nucleus (Stanek and Neugebauer 2006Go).

The telomerase snRNP replicates chromosome ends in eukaryotes (Collins 2006Go), and there is evidence suggestive of metazoan telomerase snRNP hosting a fixed Sm ring at its core. Both human and yeast telomerase snRNA contain an Sm site around which Sm proteins assemble, yet curiously, in both humans and yeast only two Sm proteins have yet been identified, SmB and SmD3 in humans (Fu and Collins 2006Go), SmD1 and SmD3 in yeast (Seto et al. 1999Go). Given the composition of other Sm rings, it seems reasonable to expect that additional Sm proteins will eventually be found in these complexes. There is a great deal of divergence in maturation pathways among eukaryotic telomerase snRNPs, for example, ciliate telomerase does not contain Sm proteins (Collins 2006Go). Yeast telomerase snRNPs have been shown to be assembled in the cytoplasm and imported into the nucleus (Ferrezuelo et al. 2002Go; Teixeira et al. 2002Go), and in humans, telomerase components are found in Cajal bodies (Fu and Collins 2006Go), suggesting that maturation pathways in at least some metazoans are similar to those used by spliceosomal and U7 snRNPs.

A wide variety of eukaryotes process mono- and/or polycistronic transcripts via trans-splicing, in which a fragment of a "leader" RNA is spliced onto each cistron from a specialized splice leader (SL) snRNA found within a specialized SL snRNP (Hastings 2005Go). All SL snRNAs examined to date contain an Sm site (Mandelboim et al. 2003Go; Zeiner et al. 2004Go; Zhang et al. 2007Go) to which is bound an Sm ring that contains many if not all the same Sm proteins in the fixed Sm ring at the center of other spliceosomal snRNPs (Bruzik et al. 1988Go; Thomas et al. 1988Go; Palfi et al. 2000Go; Tkacz et al. 2007Go). A very curious consequence of trans-splicing is that the life cycle of the SL snRNP containing the SL snRNA is unusually short in comparison to that of other snRNPs anchored by fixed Sm rings (MacMorris et al. 2007Go). We will consider SL snRNPs further in the Discussion.

Flexible Class (Lsm-Type) Sm Rings
Flexible class eukaryotic Sm rings differ in several respects from fixed class Sm rings, the primary difference being a more fluid association with RNA substrates. Flexible Sm rings can assemble spontaneously and are stable in the absence of RNA (Achsel et al. 1999Go; Zaric et al. 2005Go). RNA substrates of flexible Sm rings lack Sm sites but do have U-rich tracts (Will and Luhrmann 2001Go). When attached to an RNA substrate, a flexible Sm ring stabilizes and chaperones RNA as does a fixed Sm ring, yet a flexible Sm ring may be more easily associated and dissociated from RNA in some presumably specific manner. A flexible Sm ring may also play a somewhat more active role, in that its presence or absence may signal a transition in the life cycle of an snRNP or RNA substrate.

A characteristic member of the flexible class of Sm rings is the "Lsm ring" at the center of the U6 spliceosomal snRNP, containing the seven proteins Lsm2 through Lsm8 (Achsel et al. 1999Go; Mayes et al. 1999Go). Several aspects of the assembly and maturation of the U6 snRNP and its flexible Sm ring differ from that for other spliceosomal snRNPs. In addition to assembling spontaneously in the absence of RNA (Achsel et al. 1999Go; Zaric et al. 2005Go), the U6-associated flexible ring appears to be transported into the nucleus as an assembled unit (Will and Luhrmann 2001Go). The U6 snRNA is transcribed by RNA pol III and is never exported from the nucleus (Kunkel et al. 1986Go), and assembly of the final U6 snRNP appears to take place entirely within the nucleus (Will and Luhrmann 2001Go). Though in at least some taxa proteins chaperone pre-U6 snRNP components, for example, the La protein with newly transcribed U6 snRNA in yeast (Xue et al. 2000Go), both ring assembly and ring–snRNA association steps in U6 snRNP maturation occur without use of the cytoplasmic SMN pathway characteristic of fixed Sm rings.

As details begin to emerge, additional functional differences serve to distinguish the flexible Sm ring at the heart of the U6 snRNP from the fixed Sm ring anchoring the other spliceosomal snRNPs (Karaduman et al. 2006Go). The flexible Sm ring binds to an U-rich region at the 3'-end of U6 snRNA (Achsel et al. 1999Go). The extensive base pairing of U4 and U6 snRNAs at the center of the U4/U6 di-snRNP is facilitated by conformational changes in U6 snRNA induced by the ring (Karaduman et al. 2006Go), and formation of the catalytic center of the spliceosome is critically dependent upon its subsequent dissociation from U6 snRNA (Chan et al. 2003Go). Following dissociation of the spliceosome, the nuclear retention of U6 snRNA and nuclear regeneration of U6 snRNP are both dependent upon the presence of the flexible Sm ring (Verdone et al. 2004Go; Spiller et al. 2007Go).

Other flexible Sm rings have similarly transient associations with their RNA substrates. In yeast, a flexible Sm ring containing the proteins Lsm1 through Lsm7 assembles in the absence of RNA (Zaric et al. 2005Go) and functions in cytoplasmic mRNA degradation (He and Parker 2000Go; Tharun et al. 2000Go; Bergman et al. 2007Go). The same flexible Sm ring at the center of U6 snRNP (Lsm2 through Lsm8) also plays a role in mRNA degradation within the nucleus (Kufel et al. 2004Go).

Flexible Sm rings are also common participants in RNA editing pathways, where the fluidity of their association with RNA substrates may be beneficial. Lsm2 through Lsm7, for example, associate with several RNAs: snR5, a box H/ACA snoRNA for guiding site-specific modifications of rRNA (Fernandez et al. 2004Go); with pre-RNase P RNA (Salgado-Garrido et al. 1999Go); and perhaps with all rRNA, tRNA, and certain U3 snoRNA precursors (Kufel et al. 2002Go; Kufel, Allmag, Petfalski, et al. 2003Go; Kufel, Allmag, Verdone, et al. 2003Go). A flexible Sm ring composed of a possibly different set of Sm proteins is involved in binding U8 snoRNA which edits rRNA in Xenopus oocytes (Tomasevic and Peculis 2002Go). It is likely that additional experimental evidence will reveal yet more flexible Sm rings and more variations among species.


    Discussion
 TOP
 Abstract
 Introduction
 Phylogenetic Relationships among...
 Sm Protein Diversification in...
 Discussion
 Supplementary Material
 Acknowledgements
 References
 
The basal locations of prokaryotic Sm proteins within the tree provide clear evidence that current eukaryotic Sm proteins are derived from those in prokaryotes (Fig. 1). Since it was first proposed, the diversification–duplication model has been the primary model for the evolution of Sm protein family in eukaryotes (Salgado-Garrido et al. 1999Go). Our results provide support for some aspects of this model, but there are a number of reasons to believe the picture is considerably more complicated. Consistent with the diversification–duplication model, our phylogenetic analysis revealed that, except for a few more recently derived proteins such as SmY and SmN, the establishment of nearly all existing Sm proteins occurred prior to the last eukaryotic common ancestor (Fig. 1). However, weak support for deeper nodes within the tree makes specific assignment of neighbor relationships among Sm protein clades difficult. A few neighbor clades are robust and largely consistent with the diversification–duplication model (Lsm7 + SmG, Lsm8 + Lsm1, Lsm5 + SmE, Lsm3 + SmD2), but other clades are nested, and neighbor relationships among several clades shift with different data set compositions. It is difficult to reconcile this with the occurrence of a single large-scale duplication event in the family; several partial duplications involving one or a few proteins seem more likely.

The primary novelty in eukaryotic Sm rings is provided by the heteromorphic nature of the Sm ring itself. Although a homomorphic Sm ring as found within prokaryotes can provide spatial specificity for binding and interactions with the pore and faces only within the rotational span of a monomer (Fig. 3A), a heteromorphic Sm ring provides spatial specificity throughout the entire rotational sweep (Fig. 3B). This ensures precise conformational specificity of bound RNA and protein components within the snRNP. This specificity is important for the snRNA, as demonstrated by site-specific pairings of Sm-site uracils in the snRNA with Sm ring pore residues (Hartmuth et al. 1999Go; Urlaub et al. 2001Go; Wang et al. 2006Go) and interactions between snRNA stem–loop secondary structures and faces of the Sm ring (Stark et al. 2001Go), and for other protein–RNA and protein–protein interactions within the snRNP (Fig. 2B and Urlaub et al. 2000Go; Dybkov et al. 2006Go; Stark and Lührmann 2006Go). The maintenance of consistent three-dimensional structure and snRNA–protein membership of spliceosomal and other snRNPs is certainly dependent upon the heterogeneous nature of the eukaryotic Sm ring. An additional benefit of a heterogeneous Sm ring is that individual Sm proteins are free to develop functional side chains without repetition around the ring (Pillai et al. 2003Go).


Figure 3
View larger version (11K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
FIG. 3.— Steric specificity for seven-membered Sm rings. (A) Homomorphic ring typical of prokaryotes. (B) Heteromorphic ring typical of eukaryotes.

 
With this view, substitutions of Sm proteins within Sm rings central to other snRNPs (e.g., the U7 snRNP, Pillai et al. 2001Go, 2003Go) reflect variations in spatial and conformational relationships among the differing RNA and protein cofactors and in the interactions of these partners with pore and faces of the Sm ring itself. The use of the "canonical" eukaryotic Sm ring (Fig. 2A and B) in six spliceosomal snRNPs (U1, U2, U4, U5, U11, and U12 of the minor spliceosome) that are themselves heterogeneous in form and function (Collins and Penny 2005Go; Russell et al. 2006Go; Will and Lührmann 2006Go) indicate that the relationship between snRNP and Sm ring is not exclusive. This suggests a common time of origin for these spliceosomal snRNPs, perhaps coincident with this form of fixed Sm ring.

In support of the necessity of steric specificity in Sm rings, we expect altered spatial conformations of snRNA and proteins within snRNPs anchored by nonstandard Sm rings, that snRNPs anchored by nonstandard Sm rings are more likely to attract novel proteins, that snRNPs containing nonstandard Sm rings may have limited functionality, and that smaller "minimal functional" snRNPs would be formed when anchored by standard Sm rings.

What might have been some of the early events leading from homomorphic Sm rings in prokaryotes to heteromorphic Sm rings in eukaryotes? The initial diversification may not have originated in prokaryotes; none of the proteins in the Archaea we examined with multiple (and self-selective) Sm proteins (P. aerophilum, A. pernix, T. volcanium, T. acidophilum, A. fulgidus) fell near each other in our tree (supplementary Fig. 1, Supplementary Material online). The "seeding" of Sm protein diversity may have occurred via contact between long-isolated Sm proteins that would have lost the "niche-exclusionary" ability to avoid binding with each other's Sm folds. Such contact could have occurred via lateral transfer among Archaea or between Archaea and Eubacteria or could have coincided with genome clashes and/or gene transfers proposed to have accompanied the endosymbiotic origin of the eukaryotic cell itself (Koonin 2006Go; Martin and Koonin 2006Go). The evolved self-affinity necessary for the formation of the original homomorphic rings would still be in place, but competition from a novel Sm fold would have resulted in the formation of heteromorphic Sm rings. Such heteromorphic rings could have remained functional, for two reasons: 1) the novel rings would have continued to bind U-rich RNA, much as this ability is maintained in novel in vitro Sm complexes in yeast (Collins et al. 2003Go) and 2) because of low rotational specificity in the original homomorphic rings, any protein–protein or protein–RNA interactions required for function likely did not require the entire homomorphic ring and would have been able to occur against the original monomers (Mikulecky et al. 2004Go). This could be tested by examining the in vitro behavior of novel mixtures of Sm proteins from closely and distantly related prokaryotes.

Once established, a heteromorphic Sm ring could have provided a template for further diversification, by allowing for greater steric specificity and thus greater complexity of Sm ring–associated snRNPs. Novel substitutions can occur with comparatively high frequency in heteromorphic Sm rings, as evidenced by clade-, species-, and even tissue-specific substitutions within eukaryotic Sm rings.

Finally, the heptameric structure of Sm rings could have accelerated the process of diversification. Consider a homomorphic heptamer, A7. A single substitution creates a 6 + 1 ring, A6B. For this structure to be stabilized, neighbor–neighbor relationships should evolve some specificity, but 7 is prime; there is no small multiple that eases this transition. In a hexamer, this could occur in pairs (AB)3 or (with an additional Sm protein) in triplets (ABC)2. Polymerization of such small subunits into novel hexamers and octamers has been observed in vitro for eukaryotic Sm proteins (Zaric et al. 2005Go). Thus, the most stable seven-member heteromorphic ring may be one that is entirely heteromorphic, with no repeated subunits.

Why do spliceosomal snRNPs contain two distinct Sm rings? The flexible Sm ring at the heart of the U6 spliceosomal snRNP is highly conserved in eukaryotes (Séraphin 1995Go; Mayes et al. 1999Go; Salgado-Garrido et al. 1999Go; Liu et al. 2004Go). There may be a functional basis derived from the formation of the U4/U6 di-snRNP, which then forms a tri-snRNP with the U5 snRNP in which RNA–RNA, RNA–protein and protein–protein contacts are all important (Vidal et al. 1999Go; Chan et al. 2003Go; Karaduman et al. 2006Go; Liu, Rauhut, et al. 2006Go). Especially during the formation of the U4/U6 di-snRNP, proper interactions may depend upon each snRNP containing entirely nonoverlapping sets of Sm proteins. The stabilization of these interactions was an early event in the evolution of the spliceosome (Collins and Penny 2005Go).

Fixed and Flexible Sm Rings
We have proposed nomenclature for classes of Sm rings––fixed (abbreviated "Fix") and flexible ("Flex") as replacements for the informal classes of Sm-type and Lsm-type, respectively––that is both reflective of the different functional roles played by each class and evocative of the manner in which they are associated and dissociated from their RNA cofactors (table 1). Our nomenclature is also free from potential confusions that may arise when an Sm ring such as that at the center of the metazoan U7 snRNP is composed of both "Sm"-prefixed and "Lsm"-prefixed protein monomers (Pillai et al. 2003Go). Some of these characteristics are shared by Sm proteins in Archaea, with one ring forming only in the presence of RNA and one forming spontaneously (Achsel et al. 2001Go). Proteins involved in fixed and flexible rings do not form monophyletic groups (Fig. 1), thus it may be these functional classes for Sm rings, broadly defined and suitably diversified, which have been maintained throughout the evolution of the Sm protein family, rather than the specific protein composition of the rings.


View this table:
[in this window]
[in a new window]

 
Table 1 Characteristics distinguishing fixed from flexible Sm rings

 
One currently somewhat speculative characteristic that distinguishes fixed from flexible eukaryotic Sm rings is the manner in which each class binds to associated RNA (table 1). The U1 snRNA appears to pass through the central pore of the fixed Sm ring at the center of the U1 snRNP (Fig. 2B, see also Stark et al. 2001Go; Urlaub et al. 2001Go; Stark and Lührmann 2006Go). Such a configuration would likely confer additional stability onto the fixed Sm ring–snRNA association, while increasing the cost of dissociation. In comparison, flexible Sm rings might be more easily dissociated from their RNA cofactors if the RNA circles around one face of the central pore without passing through, as appears to be common in prokaryotic Sm–RNA associations.

The Sm ring that anchors the SL snRNP in organisms that use trans-splicing is an anomalous fixed class ring with rapid association and dissociation from its RNA substrate. The steps required for this are 1) assembly of a fixed Sm ring around SL snRNA, 2) maturation of the SL snRNP, 3) participation of the SL snRNP in trans-splicing, 4) removal of non-Sm proteins from the SL snRNP, and 5) dissociation of the fixed Sm ring from the "spent" SL snRNA. Details of most of these steps are unknown (MacMorris et al. 2007Go). In trypanosomes, the fixed Sm ring at the center of the SL snRNP appears to be identical in protein composition to the one at the center of the U1, U4, and U5 snRNPs, so some particular feature of the SL snRNP allows this rapid turnover. In trypanosomes and other organisms with trans-splicing, there may be a dedicated pathway that recognizes spent SL snRNPs; though it seems likely to exist, evidence of such a pathway has not yet been found.

SL snRNPs represent a sort of functional intermediate between the longer term associations of fixed Sm rings and the more transient associations typical of flexible Sm rings. For the evolution of trans-splicing, these stumbling blocks would have to be removed. The use of a fixed Sm ring in SL snRNPs may indicate that the SL snRNP is derived from one of the other spliceosomal snRNPs and that trans-splicing is itself derived from ancestral cis-splicing, perhaps multiple times in several eukaryotic lineages (Nilsen 2001Go; Hastings 2005Go). The maintenance of the fixed Sm ring at the center of all SL snRNPs may represent simply a contingent characteristic or may indicate the need to have a fixed Sm ring at the heart of an SL snRNP, perhaps to provide greater stability during interactions and reconfigurations involving other spliceosomal snRNPs.

Recently, a number of multidomain RNA-associated proteins found in a wide range of eukaryotes contain divergent but identifiable Sm domains (Albrecht and Lengauer 2004Go; Albrecht et al. 2004Go; Tadauchi et al. 2004Go; Fleischer et al. 2006Go; Yang et al. 2006Go; Tritschler et al. 2007Go). Unlike Lsm11, which contains an additional domain functional in processing of histone mRNAs (Pillai et al. 2003Go), none of these proteins have been observed in any form of Sm ring. However, they may yet be found in Sm rings or may interact via their Sm domains to form novel multimeric complexes (Fleischer et al. 2006Go).

In conclusion, the evolutionary diversification of Sm rings coincided with the evolution of eukaryotes. We do not find much specific support for the diversification–duplication model for the development of separate fixed and flexible Sm rings. Fixed Sm rings such as those anchoring the majority of spliceosomal snRNPs and the U7 snRNP are stable, passive, noncatalytic, spatially specific protein scaffolds around which RNA and proteins that are active within snRNPs can organize. Flexible Sm rings share many of these characteristics but can be more freely associated and dissociated from RNA substrates. As further details concerning the diversity of RNA processing in eukaryotes are found, it is likely that additional forms of both fixed and flexible rings will be discovered.


    Supplementary Material
 TOP
 Abstract
 Introduction
 Phylogenetic Relationships among...
 Sm Protein Diversification in...
 Discussion
 Supplementary Material
 Acknowledgements
 References
 
Supplementary Table 1 and Figure 1 are available at Molecular Biology and Evolution online (http://www.mbe.oxfordjournals.org/).


    Acknowledgements
 TOP
 Abstract
 Introduction
 Phylogenetic Relationships among...
 Sm Protein Diversification in...
 Discussion
 Supplementary Material
 Acknowledgements
 References
 
This work was supported by National Science Foundation grants DBI-0434671 to D.G.S. and MCB-0342431 to M.L. D.G.S. thanks D. A. Campbell for helpful discussions and two anonymous reviewers for helpful comments that greatly improved the manuscript.


    Footnotes
 
1 Present address: Department of Ecology and Evolutionary Biology, University of California, Los Angeles. Back

Laura Katz, Associate Editor


    References
 TOP
 Abstract
 Introduction
 Phylogenetic Relationships among...
 Sm Protein Diversification in...
 Discussion
 Supplementary Material
 Acknowledgements
 References
 

    Achsel T, Brahms H, Kastner B, Bachi A, Wilm M, Lührmann R. A doughnut-shaped heteromer of human Sm-like proteins binds to the 3'-end of U6 snRNA, thereby facilitating U4/U6 duplex formation in vitro. EMBO J (1999) 18:5789–5802.[CrossRef][Web of Science][Medline]

    Achsel T, Stark H, Lührmann R. The Sm domain is an ancient RNA-binding motif with oligo(U) specificity. Proc Natl Acad Sci USA (2001) 98:3685–3689.[Abstract/Free Full Text]

    Albrecht M, Golatta M, Wullner U, Lengauer T. Structural and functional analysis of ataxin-2 and ataxin-3. Eur J Biochem (2004) 271:3155–3170.[Web of Science][Medline]

    Albrecht M, Lengauer T. Novel Sm-like proteins with long C-terminal tails and associated methyltransferases. FEBS Lett (2004) 569:18–26.[CrossRef][Web of Science][Medline]

    Altschul S, Madden T, Schaffer A, Zhang J, Zhang Z, Miller W, Lipman D. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res (1997) 25:3389–3402.[Abstract/Free Full Text]

    Anantharaman V, Aravind L. Novel conserved domains in proteins with predicted roles in eukaryotic cell-cycle regulation, decapping and RNA stability. BMC Genomics (2004) 5:45.[CrossRef][Medline]

    Anantharaman V, Koonin EV, Aravind L. Comparative genomics and evolution of proteins involved in RNA metabolism. Nucleic Acids Res (2002) 30:1427–1464.[Abstract/Free Full Text]

    Barbosa-Morais NL, Carmo-Fonseca M, Aparício S. Systematic genome-wide annotation of spliceosomal proteins reveals differential gene family expansion. Genome Res (2006) 16:66–77.[Abstract/Free Full Text]

    Bergman N, Moraes KCM, Anderson JR, Zaric B, Kambach C, Schneider RJ, Wilusz CJ, Wilusz J. Lsm proteins bind and stabilize RNAs containing 5' poly(A) tracts. Nat Struct Mol Biol (2007) 14:824–831.[CrossRef][Web of Science][Medline]

    Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, Shindyalov IN, Bourne PE. The Protein Data Bank. Nucleic Acids Res. (2000) 28:235–242.[Abstract/Free Full Text]

    Branlant C, Krol A, Ebel JP, Lazar E, Haendler B, Jacob M. U2 RNA shares a structural domain with U1, U4, and U5 RNAs. EMBO J (1982) 1:1259–1265.[Web of Science][Medline]

    Bruzik JP, Vandoren K, Hirsh D, Steitz JA. Trans splicing involves a novel form of small nuclear ribonucleoprotein particles. Nature (1988) 335:559–562.[CrossRef][Web of Science][Medline]

    Chan SP, Kao DI, Tsai WY, Cheng SC. The Prp19p-associated complex in spliceosome activation. Science (2003) 302:279–282.[Abstract/Free Full Text]

    Chenna R, Sugawara H, Koike T, Lopez R, Gibson TJ, Higgins DG, Thompson JD. Multiple sequence alignment with the Clustal series of programs. Nucleic Acids Res (2003) 31:3497–3500.[Abstract/Free Full Text]

    Chu JL, Elkon KB. The small nuclear ribonucleoproteins, SmB and B', are products of a single gene. Gene (1991) 97:311–312.[CrossRef][Web of Science][Medline]

    Cioce M, Lamond AI. Cajal bodies: a long history of discovery. Annu Rev Cell Dev Biol (2005) 21:105–131.[CrossRef][Web of Science][Medline]

    Collins BM, Cubeddu L, Naidoo N, Harrop SJ, Kornfeld GD, Dawes IW, Curmi PMG, Mabbutt BC. Homomeric ring assemblies of eukaryotic Sm proteins have affinity for both RNA and DNA: crystal structure of an oligomeric complex of yeast SmF. J Biol Chem (2003) 278:17291–17298.[Abstract/Free Full Text]

    Collins K. The biogenesis and regulation of telomerase holoenzymes. Nat Rev Mol Cell Biol (2006) 7:484–494.[CrossRef][Web of Science][Medline]

    Collins L, Penny D. Complex spliceosomal organization ancestral to extant eukaryotes. Mol Biol Evol (2005) 22:1053.[Abstract/Free Full Text]

    Dybkov O, Will CL, Deckert J, Behzadnia N, Hartmuth M, Luhrmann R. U2 snRNA-protein contacts in purified human 17S U2 snRNPs and in spliceosomal A and B complexes. Mol Cell Biol (2006) 26:2803–2816.[Abstract/Free Full Text]

    Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res (2004) 32:1792–1797.[Abstract/Free Full Text]

    Fernandez CF, Pannone BK, Chen XG, Fuchs G, Wolin SL. An Lsm2-Lsm7 complex in Saccharomyces cerevisiae associates with the small nucleolar RNA snR5. Mol Biol Cell (2004) 15:2842–2852.[Abstract/Free Full Text]

    Ferrezuelo F, Steiner B, Aldea M, Futcher B. Biogenesis of yeast telomerase depends on the importin Mtr10. Mol Cell Biol (2002) 22:6046–6055.[Abstract/Free Full Text]

    Fink GR. Pseudogenes in yeast. Cell (1987) 49:5–6.[CrossRef][Web of Science][Medline]

    Fischer U, Darzynkiewicz E, Tahara SM, Dathan NA, Lührmann R, Mattaj IW. Diversity in the signals required for nuclear accumulation of U snRNPs and variety in the pathways of nuclear transport. J Cell Biol (1991) 113:705–714.[Abstract/Free Full Text]

    Fleischer TC, Weaver CM, McAfee KJ, Jennings JL, Link AJ. Systematic identification and functional screens of uncharacterized proteins associated with eukaryotic ribosomal complexes. Genes Dev (2006) 20:1294–1307.[Abstract/Free Full Text]

    Friesen WJ, Massenet S, Paushkin S, Wyce A, Dreyfuss G. SMN, the product of the spinal muscular atrophy gene, binds preferentially to dimethylarginine-containing protein targets. Mol Cell (2001) 7:1111–1117.[CrossRef][Web of Science][Medline]

    Fu D, Collins K. Human telomerase and Cajal body ribonucleoproteins share a unique specificity of Sm protein association. Genes Dev (2006) 20:531–536.[Abstract/Free Full Text]

    Golembe TJ, Yong J, Dreyfuss G. Specific sequence features, recognized by the SMN complex, identify snRNAs and determine their fate as snRNPs. Mol Cell Biol (2005) 25:10989–11004.[Abstract/Free Full Text]

    Golembe TJ, Yong JS, Battle DJ, Feng WQ, Wan LL, Dreyfuss G. Lymphotropic Herpesvirus saimiri uses the SMN complex to assemble Sm cores on its small RNAs. Mol Cell Biol (2005) 25:602–611.[Abstract/Free Full Text]

    Gray TA, Smithwick MJ, Schaldach MA, Martone DL, Graves JA, McCarrey JR, Nicholls RD. Concerted regulation and molecular evolution of the duplicated SNRPB'/B and SNRPN loci. Nucleic Acids Res (1999) 27:4577–4584.[Abstract/Free Full Text]

    Griffith AJ, Schmauss C, Craft J. The murine gene encoding the highly conserved Sm B protein contains a nonfunctional alternative 3' splice site. Gene (1992) 114:195–201.[CrossRef][Web of Science][Medline]

    Hajnsdorf E, Régnier P. Host factor Hfq of Escherichia coli stimulated elongation of poly(A) tails by poly(A) polymerase I. Proc Natl Acad Sci USA (2000) 97:1501–1505.[Abstract/Free Full Text]

    Hartmuth K, Raker VA, Huber J, Branlant C, Luhrmann R. An unusual chemical reactivity of Sm site adenosines strongly correlates with proper assembly of core U snRNP particles. J Mol Biol (1999) 285:133–147.[CrossRef][Web of Science][Medline]

    Hastings KEM. SL trans-splicing: easy come or easy go? Trends Genet (2005) 21:240–247.[CrossRef][Web of Science][Medline]

    He WH, Parker R. Functions of Lsm proteins in mRNA degradation and splicing. Curr Opin Cell Biol (2000) 12:346–350.[CrossRef][Web of Science][Medline]

    Hermann H, Fabrizio P, Raker VA, Foulaki K, Hornig H, Brahms H, Luhrmann R. snRNP Sm proteins share two evolutionarily conserved sequence motifs which are involved in Sm protein-protein interactions. EMBO J (1995) 14:2076–2088.[Web of Science][Medline]

    Jády BE, Darzacq X, Tucker KE, Matera AG, Bertrand E, Kiss T. Modification of Sm small nuclear RNAs occurs in the nucleoplasmic Cajal body following import from the cytoplasm. EMBO J (2003) 22:1878–1888.[CrossRef][Web of Science][Medline]

    Kambach C, Walke S, Nagai K. Structure and assembly of the spliceosomal small nuclear ribonucleoprotein particles. Curr Opin Struct Biol (1999) 9:222–230.[CrossRef][Web of Science][Medline]

    Kambach C, Walke S, Young R, Avis JM, De la Fortelle E, Raker VA, Lührmann R, Li J, Nagai K. Crystal structures of two Sm protein complexes and their implications for the assembly of the spliceosomal snRNPs. Cell (1999) 96:375–387.[CrossRef][Web of Science][Medline]

    Karaduman R, Fabrizio P, Hartmuth K, Urlaub H, Lührmann R. RNA structure and RNA-protein interactions in purified yeast U6 snRNPs. J Mol Biol (2006) 356:1248–1262.[CrossRef][Web of Science][Medline]

    Khusial P, Plaag R, Zieve GW. LSm proteins form heptameric rings that bind to RNA via repeating motifs. Trends Biochem Sci (2005) 30:522–528.[CrossRef][Web of Science][Medline]

    Kiss T. Biogenesis of small nuclear RNPs. J Cell Sci (2004) 117:5949–5951.[Free Full Text]

    Kolev NG, Steitz JA. In vivo assembly of functional U7 snRNP requires RNA backbone flexibility within the Sm-binding site. Nat Struct Mol Biol (2006) 13:347–353.[CrossRef][Web of Science][Medline]

    Koonin EV. The origin of introns and their role in eukaryogenesis: a compromise solution to the introns-early versus introns-late debate? Biol Direct (2006) 1:22.[CrossRef][Medline]

    Kufel J, Allmang C, Petfalski E, Beggs J, Tollervey D. Lsm proteins are required for normal processing and stability of ribosomal RNAs. J Biol Chem (2003) 278:2147–2156.[Abstract/Free Full Text]

    Kufel J, Allmang C, Verdone L, Beggs J, Tollervey D. A complex pathway for 3' processing of the yeast U3 snoRNA. Nucleic Acids Res (2003) 31:6788–6797.[Abstract/Free Full Text]

    Kufel J, Allmang C, Verdone L, Beggs JD, Tollervey D. Lsm proteins are required for normal processing of pre-tRNAs and their efficient association with La-homologous protein Lhp1p. Mol Cell Biol (2002) 22:5248–5256.[Abstract/Free Full Text]

    Kufel J, Bousquet-Antonelli C, Beggs JD, Tollervey D. Nuclear pre-mRNA decapping and 5' degradation in yeast require the Lsm2-8p complex. Mol Cell Biol (2004) 24:9646–9657.[Abstract/Free Full Text]

    Kunkel GR, Maser RL, Calvet JP, Pederson T. U6 small nuclear RNA is transcribed by RNA polymerase III. Proc Natl Acad Sci USA (1986) 83:8575–8579.[Abstract/Free Full Text]

    Liautard JP, Sri-Widada J, Brunel C, Jeanteur P. Structural organization of ribonucleoproteins containing small nuclear RNAs from HeLa cells: proteins interact closely with a similar structural domain of U1, U2, U4 and U5 small nuclear RNAs. J Mol Biol (1982) 162:623–643.[CrossRef][Web of Science][Medline]

    Liu JL, Murphy C, Buszczak M, Clatterbuck S, Goodman R, Gall JG. The Drosophila melanogaster Cajal body. J Cell Biol (2006) 172:875–884.[Abstract/Free Full Text]

    Liu Q, Liang XH, Uliel S, Belahcen M, Unger R, Michaeli S. Identification and functional characterization of Lsm proteins in Trypanosoma brucei. J Biol Chem (2004) 279:18210–18219.[Abstract/Free Full Text]

    Liu S, Rauhut R, Vornlocher H-P, Lührmann R. The network of protein-protein interactions within the human U4/U6.U5 tri-snRNP. RNA (2006) 12:1418–1430.[Abstract/Free Full Text]

    MacMorris M, Kumar M, Lasda E, Larsen A, Kraemer B, Blumenthal T. A novel family of C. elegans snRNPs contains proteins associated with trans-splicing. RNA (2007) 13:511–520.[Abstract/Free Full Text]

    Mandelboim M, Barth S, Biton M, Liang XH, Michaeli S. Silencing of Sm proteins in Trypanosoma brucei by RNA interference captured a novel cytoplasmic intermediate in spliced leader RNA biogenesis. J Biol Chem (2003) 278:51469.[Abstract/Free Full Text]

    Martin W, Koonin EV. Introns and the origin of nucleus-cytosol compartmentalization. Nature (2006) 440:41–45.[CrossRef][Web of Science][Medline]

    Massenet S, Pellizzoni L, Paushkin S, Mattaj IW, Dreyfuss G. The SMN complex is associated with snRNPs throughout their cytoplasmic assembly pathway. Mol Cell Biol (2002) 22:6533–6541.[Abstract/Free Full Text]

    Matera AG, Shpargel KB. Pumping RNA: nuclear bodybuilding along the RNP pipeline. Curr Opin Cell Biol (2006) 18:317–324.[CrossRef][Web of Science][Medline]

    Mattaj IW, Derobertis EM. Nuclear segregation of U2 snRNA requires binding of specific snRNP proteins. Cell (1985) 40:111–118.[CrossRef][Web of Science][Medline]

    Mayes AE, Verdone L, Legrain P, Beggs JD. Characterization of Sm-like proteins in yeast and their association with U6 snRNA. EMBO J (1999) 18:4321–4331.[CrossRef][Web of Science][Medline]

    McAllister G, Amara SG, Lerner MR. Tissue-specific expression and cDNA cloning of small nuclear ribonucleoprotein-associated polypeptide N. Proc Natl Acad Sci USA (1988) 85:5296–5300.[Abstract/Free Full Text]

    McAllister G, Robyshemkovitz A, Amara SG, Lerner MR. cDNA sequence of the rat U snRNP-associated protein N: description of a potential Sm epitope. EMBO J (1989) 8:1177–1181.[Web of Science][Medline]

    Meister G, Eggert C, Fischer U. SMN-mediated assembly of RNPs: a complex story. Trends Cell Biol (2002) 12:472–478.[CrossRef][Web of Science][Medline]

    Mikulecky PJ, Kaw MK, Brescia CC, Takach JC, Sledjeski DD, Feig AL. Escherichia coli Hfq has distinct interaction surfaces for DsrA, rpoS and poly(A) RNAs. Nat Struct Mol Biol (2004) 11:1206–1214.[CrossRef][Web of Science][Medline]

    Moreland J, Gramada A, Buzko O, Zhang Q, Bourne P. The Molecular Biology Toolkit (MBT): a modular platform for developing molecular visualization applications. BMC Bioinformatics (2005) 6:21.[CrossRef][Medline]

    Mura C, Cascio D, Sawaya MR, Eisenberg DS. The crystal structure of a heptameric archaeal Sm protein: implications for the eukaryotic snRNP core. Proc Natl Acad Sci USA (2001) 98:5532–5537.[Abstract/Free Full Text]

    Mura C, Kozhukhovsky A, Gingery M, Phillips M, Eisenberg D. The oligomerization and ligand-binding properties of Sm-like archaeal proteins (SmAPs). Protein Sci (2003) 12:832–847.[CrossRef][Web of Science][Medline]

    Mura C, Phillips M, Kozhukhovsky A, Eisenberg D. Structure and assembly of an augmented Sm-like archaeal protein 14-mer. Proc Natl Acad Sci USA (2003) 100:4539–4544.[Abstract/Free Full Text]

    Narayanan U, Achsel T, Luhrmann R, Matera AG. Coupled in vitro import of U snRNPs and SMN, the spinal muscular atrophy protein. Mol Cell (2004) 16:223–234.[CrossRef][Web of Science][Medline]

    Nicholls RD, Knepper JL. Genome organization, function and imprinting in Prader-Willi and Angelman syndromes. Annu Rev Genomics Hum Genet (2001) 2:153–175.[CrossRef][Web of Science][Medline]

    Nilsen TW. Evolutionary origin of SL-addition trans-splicing: still an enigma. Trends Genet (2001) 17:678.[CrossRef][Web of Science][Medline]

    Palacios I, Hetzer M, Adam SA, Mattaj IW. Nuclear import of U snRNPs requires importin β. EMBO J (1997) 16:6783–6792.[CrossRef][Web of Science][Medline]

    Palfi Z, Lucke S, Lahm H-W, Lane WS, Kruft V, Bragado-Nilsson E, Seraphin B, Bindereif A. The spliceosomal snRNP core complex of Trypanosoma brucei: cloning and functional analysis reveals seven Sm protein constituents. Proc Natl Acad Sci USA (2000) 97:8967–8972.[Abstract/Free Full Text]

    Pellizzoni L, Yong J, Dreyfuss G. Essential role for the SMN complex in the specificity of snRNP assembly. Science (2002) 298:1775–1779.[Abstract/Free Full Text]

    Pillai RS, Grimmler M, Meister G, Will CL, Luhrmann R, Fischer U, Schumperli D. Unique Sm core structure of U7 snRNPs: assembly by a specialized SMN complex and the role of a new component, Lsm11, in histone RNA processing. Genes Dev (2003) 17:2321–2333.[Abstract/Free Full Text]

    Pillai RS, Will CL, Luhrmann R, Schumperli D, Muller B. Purified U7 snRNPs lack the Sm proteins D1 and D2 but contain Lsm10, a new 14 kDa Sm D1-like protein. EMBO J (2001) 20:5470–5479.[CrossRef][Web of Science][Medline]

    Plessel G, Lührmann R, Kastner B. Electron microscopy of assembly intermediates of the snRNP core: morphological similarities between the RNA-free (E.F.G) protein heteromer and the intact snRNP core. J Mol Biol (1997) 265:87–94.[CrossRef][Web of Science][Medline]

    Pu WT, Krapivinsky GB, Krapivinsky L, Clapham DE. pICln inhibits snRNP biogenesis by binding core spliceosomal proteins. Mol Cell Biol (1999) 19:4113–4120.[Abstract/Free Full Text]

    Raker VA, Hartmuth K, Kastner B, Luhrmann R. Spliceosomal U snRNP core assembly: Sm proteins assemble onto an Sm site RNA nonanucleotide in a specific and thermodynamically stable manner. Mol Cell Biol (1999) 19:6554–6565.[Abstract/Free Full Text]

    Russell AG, Charette JM, Spencer DF, Gray MW. An early evolutionary origin for the minor spliceosome. Nature (2006) 443:863–866.[CrossRef][Web of Science][Medline]

    Salgado-Garrido J, Bragado-Nilsson E, Kandels-Lewis S, Séraphin B. Sm and Sm-like proteins assemble in two related complexes of deep evolutionary origin. EMBO J (1999) 18:3451–3462.[CrossRef][Web of Science][Medline]

    Sauter C, Basquin J, Suck D. Sm-like proteins in Eubacteria: the crystal structure of the Hfq protein from Escherichia coli. Nucleic Acids Res (2003) 31:4091–4098.[Abstract/Free Full Text]

    Schumacher MA, Pearson RF, Moller T, Valentin-Hansen P, Brennan RG. Structures of the pleiotropic translational regulator Hfq and an Hfq-RNA complex: a bacterial Sm-like protein. EMBO J (2002) 21:3546–3556.[CrossRef][Web of Science][Medline]

    Schümperli D, Pillai R. The special Sm core structure of the U7 snRNP: far-reaching significance of a small nuclear ribonucleoprotein. Cell Mol Life Sci (2004) 61:2560–2570.[CrossRef][Web of Science][Medline]

    Séraphin B. Sm and Sm-like proteins belong to a large family: identification of proteins of the U6 as well as the U1, U2, U4 and U5 snRNPs. EMBO J (1995) 14:2089–2098.[Web of Science][Medline]

    Seto AG, Zaug AJ, Sobel SG, Wolin SL, Cech TR. Saccharomyces cerevisiae telomerase is an Sm small nuclear ribonucleoprotein particle. Nature (1999) 401:177–180.[CrossRef][Web of Science][Medline]

    Spiller MP, Boon KL, Reijns MAM, Beggs JD. The Lsm2-8 complex determines nuclear localization of the spliceosomal U6 snRNA. Nucleic Acids Res (2007) 35:923–929.[Abstract/Free Full Text]

    Stanek D, Neugebauer KM. The Cajal body: a meeting place for spliceosomal snRNPs in the nuclear maze. Chromosoma (2006) 115:343–354.[CrossRef][Web of Science][Medline]

    Stark H, Dube P, Luehrmann R, Kastner B. Arrangement of RNA and proteins in the spliceosomal U1 small nuclear ribonucleoprotein particle. Nature (2001) 409:539–542.[CrossRef][Web of Science][Medline]

    Stark H, Lührmann R. Cryo-electron microscopy of spliceosomal components. Annu Rev Biophys Biomol Struct (2006) 35:435–457.[CrossRef][Web of Science][Medline]

    Sun X, Zhulin I, Wartell RM. Predicted structure and phyletic distribution of the RNA-binding protein Hfq. Nucleic Acids Res (2002) 30:3662–3671.[Abstract/Free Full Text]

    Tadauchi T, Inada T, Matsumoto K, Irie K. Posttranscriptional regulation of HO expression by the Mkt1-Pbp1 Complex. Mol Cell Biol (2004) 24:3670–3681.[Abstract/Free Full Text]

    Teixeira MT, Forstemann K, Gasser SM, Lingner J. Intracellular trafficking of yeast telomerase components. EMBO Rep (2002) 3:652–659.[CrossRef][Web of Science][Medline]

    Tharun S, He WH, Mayes AE, Lennertz P, Beggs JD, Parker R. Yeast Sm-like proteins function in mRNA decapping and decay. Nature (2000) 404:515–518.[CrossRef][Web of Science][Medline]

    Thomas JD, Conrad RC, Blumenthal T. The C. elegans trans-spliced leader RNA is bound to Sm and has a trimethylguanosine cap. Cell (1988) 54:533–539.[CrossRef][Web of Science][Medline]

    Thore S, Mayer C, Sauter C, Weeks S, Suck D. Crystal structures of the Pyrococcus abyssi Sm core and its complex with RNA: common features of RNA binding in Archaea and Eukarya. J Biol Chem (2003) 278:1239–1247.[Abstract/Free Full Text]

    Tkacz ID, Lustig Y, Stern MZ, Biton M, Salmon-Divon M, Das A, Bellofatto V, Michaeli S. Identification of novel snRNA-specific Sm proteins that bind selectively to U2 and U4 snRNAs in Trypanosoma brucei. RNA (2007) 13:30–43.[Abstract/Free Full Text]

    Tomasevic N, Peculis BA. Xenopus LSm proteins bind U8 snoRNA via an internal evolutionarily conserved octamer sequence. Mol Cell Biol (2002) 22:4101–4112.[Abstract/Free Full Text]

    Törö I, Basquin J, Teo-Dreher H, Suck D. Archaeal Sm proteins form heptameric and hexameric complexes: crystal structures of the Sm1 and Sm2 proteins from the hyperthermophile Archaeoglobus fulgidus. J Mol Biol (2002) 320:129–142.[CrossRef][Web of Science][Medline]

    Törö I, Thore S, Mayer C, Basquin J, Séraphin B, Suck D. RNA binding in an Sm core domain: x-ray structure and functional analysis of an archaeal Sm protein complex. EMBO J (2001) 20:2293–2303.[CrossRef][Web of Science][Medline]

    Tritschler F, Eulalio A, Truffault V, Hartmann MD, Helms S, Schmidt S, Coles M, Izaurralde E, Weichenrieder O. A divergent Sm fold in EDC3 proteins mediates DCPI binding and P-body targeting. Mol Cell Biol (2007) 27:8600–8611.[Abstract/Free Full Text]

    Tycowski KT, Kolev NG, Conrad NK, Fok V, Steitz JA. The ever-growing world of small nuclear ribonucleoproteins. In: The RNA world—Gesteland RF, Cech TR, Atkins JF, eds. (2006) Cold Spring Harbor (NY): Cold Spring Harbor Laboratory Press. 327–368.

    Urlaub H, Hartmuth K, Kostka S, Grelle G, Luhrmann R. A general approach for identification of RNA-protein cross-linking sites within native human spliceosomal small nuclear ribonucleoproteins (snRNPs): analysis of RNA-protein contacts in native U1 and U4/U6.U5 snRNPs. J Biol Chem (2000) 275:41458–41468.[Abstract/Free Full Text]

    Urlaub H, Raker VA, Kostka S, Luhrmann R. Sm protein-Sm site RNA interactions within the inner ring of the spliceosomal snRNP core structure. EMBO J (2001) 20:187–196.[CrossRef][Web of Science][Medline]

    Valentin-Hansen P, Eriksen M, Udesen C. The bacterial Sm-like protein Hfq: a key player in RNA transactions. Mol Microb (2004) 51:1525–1533.[CrossRef][Web of Science][Medline]

    Verdone L, Galardi S, Page D, Beggs JD. Lsm proteins promote regeneration of pre-mRNA splicing activity. Curr Biol (2004) 14:1487–1491.[CrossRef][Web of Science][Medline]

    Vidal VPI, Verdone L, Mayes AE, Beggs JD. Characterization of U6 snRNA-protein interactions. RNA (1999) 5:1470–1481.[Abstract]

    Wang PP, Palfi Z, Preusser C, Lucke S, Lane WS, Kambach C, Bindereif A. Sm core variation in spliceosomal small nuclear ribonucleoproteins from Trypanosoma brucei. EMBO J (2006) 25:4513–4523.[CrossRef][Web of Science][Medline]

    Will CL, Luhrmann R. Spliceosomal UsnRNP biogenesis, structure and function. Curr Opin Cell Biol (2001) 13:290–301.[CrossRef][Web of Science][Medline]

    Will CL, Lührmann R. Spliceosome structure and function. In: The RNA world—Gesteland RF, Cech TR, Atkins JF, eds. (2006) Cold Spring Harbor (NY): Cold Spring Harbor Laboratory Press. 369–400.

    Wilusz CJ, Wilusz J. Eukaryotic Lsm proteins: lessons from bacteria. Nat Struct Mol Biol (2005) 12:1031–1036.[CrossRef][Web of Science][Medline]

    Xue DH, Rubinson DA, Pannone BK, Yoo CJ, Wolin SL. U snRNP assembly in yeast involves the La protein. EMBO J (2000) 19:1650–1660.[CrossRef][Web of Science][Medline]

    Yang WH, Yu JH, Gulick T, Bloch KD, Bloch DB. RNA-associated protein 55 (RAP55) localizes to mRNA processing bodies and stress granules. RNA (2006) 12:547–554.[Abstract/Free Full Text]

    Yong JS, Pellizzoni L, Dreyfuss G. Sequence-specific interaction of U1 snRNA with the SMN complex. EMBO J (2002) 21:1188–1196.[CrossRef][Web of Science][Medline]

    Yong JS, Wan LL, Dreyfuss G. Why do cells need an assembly machine for RNA-protein complexes? Trends Cell Biol (2004) 14:226–232.[CrossRef][Web of Science][Medline]

    Zaric B, Chami M, Remigy H, Engel A, Ballmer-Hofer K, Winkler FK, Kambach C. Reconstitution of two recombinant LSm protein complexes reveals aspects of their architecture, assembly, and function. J Biol Chem (2005) 280:16066–16075.[Abstract/Free Full Text]

    Zeiner GM, Foldynova S, Sturm NR, Lukes J, Campbell DA. SmD1 is required for spliced leader RNA biogenesis. Eukaryot Cell (2004) 3:241–244.[Abstract/Free Full Text]

    Zhang D, Abovich N, Rosbash M. A biochemical function for the Sm complex. Mol Cell (2001) 7:319–329.[CrossRef][Web of Science][Medline]

    Zhang H, Hou YB, Miranda L, Campbell DA, Sturm NR, Gaasterland T, Lin SJ. Spliced leader RNA trans-splicing in dinoflagellates. Proc Natl Acad Sci USA (2007) 104:4618–4623.[Abstract/Free Full Text]

Accepted for publication July 12, 2008.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?



This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Supplementary Data
Right arrow All Versions of this Article:
25/11/2255    most recent
msn175v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Scofield, D. G.
Right arrow Articles by Lynch, M.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Scofield, D. G.
Right arrow Articles by Lynch, M.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?