Skip Navigation


MBE Advance Access originally published online on May 23, 2007
Molecular Biology and Evolution 2007 24(8):1832-1842; doi:10.1093/molbev/msm101
This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Supplementary Material
Right arrow All Versions of this Article:
24/8/1832    most recent
msm101v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Khan, H.
Right arrow Articles by Archibald, J. M.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Khan, H.
Right arrow Articles by Archibald, J. M.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2007. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oxfordjournals.org

Research Articles

Plastid Genome Sequence of the Cryptophyte Alga Rhodomonas salina CCMP1319: Lateral Transfer of Putative DNA Replication Machinery and a Test of Chromist Plastid Phylogeny

Hameed Khan*, Natalie Parks*, Catherine Kozera{dagger}, Bruce A. Curtis{dagger}, Byron J. Parsons{dagger}, Sharen Bowman{dagger} and John M. Archibald*

* Genome Atlantic and the Canadian Institute for Advanced Research, Program in Evolutionary Biology, Department of Biochemistry and Molecular Biology, Dalhousie University, Halifax, Nova Scotia, Canada
{dagger} The Atlantic Genome Centre, Halifax, Nova Scotia, Canada

Email: jmarchib{at}dal.ca.


    Abstract
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results and Discussion
 Conclusion
 Suppementary Material
 Acknowledgements
 References
 
Cryptophytes are a group of unicellular algae with chlorophyll c–containing plastids derived from the uptake of a secondary (i.e., eukaryotic) endosymbiont. Biochemical and molecular data indicate that cryptophyte plastids are derived from red algae, yet the question of whether or not cryptophytes acquired their red algal plastids independent of those in heterokont, haptophyte, and dinoflagellate algae is of long-standing debate. To better understand the origin and evolution of the cryptophyte plastid, we have sequenced the plastid genome of Rhodomonas salina CCMP1319: at 135,854 bp, it is the largest secondary plastid genome characterized thus far. It also possesses interesting features not seen in the distantly related cryptophyte Guillardia theta or in other red secondary plastids, including pseudogenes, introns, and a bacterial-derived gene for the tau/gamma subunit of DNA polymerase III (dnaX), the first time putative DNA replication machinery has been found encoded in any plastid genome. Phylogenetic analyses indicate that dnaX was acquired by lateral gene transfer (LGT) in an ancestor of Rhodomonas, most likely from a firmicute bacterium. A phylogenomic survey revealed no additional cases of LGT, beyond a noncyanobacterial type rpl36 gene similar to that recently characterized in other cryptophytes and haptophytes. Rigorous concatenated analysis of 45 proteins encoded in 15 complete plastid genomes produced trees in which the heterokont, haptophyte, and cryptophyte (i.e., chromist) plastids were monophyletic, and heterokonts and haptophytes were each other's closest relatives. However, statistical support for chromist monophyly disappears when amino acids are recoded according to their chemical properties in order to minimize the impact of composition bias, and a significant fraction of the concatenate appears consistent with a sister-group relationship between cryptophyte and haptophyte plastids.

Key Words: cryptophytes • plastid • chromists • chromalveolates • secondary endosymbiosis


    Introduction
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results and Discussion
 Conclusion
 Suppementary Material
 Acknowledgements
 References
 
Plastids (chloroplasts) evolved from free-living cyanobacteria within the confines of a nonphotosynthetic host eukaryote (Gray and Doolittle 1982Go). The "primary" endosymbiotic origin of plastids is widely believed to have occurred only once (Moreira, Le Guyader, and Phillippe 2000; Palmer 2003; Keeling 2004; Rodriguez-Ezpeleta et al. 2005GoGoGoGo) and 3 modern-day eukaryotic lineages—red algae, green algae (including land plants), and glaucophytes—harbor plastids stemming directly from this landmark event (Palmer 2003; Keeling 2004GoGo). More recently, the plastids of red and green algae have spread laterally across the eukaryotic tree via "secondary endosymbiosis," i.e., the uptake of a photosynthetic eukaryote by an unrelated nonphotosynthetic host (Delwiche 1999; Archibald and Keeling 2002, 2005; Palmer 2003; Keeling 2004GoGoGoGoGo). Unlike primary plastids, which are surrounded by 2 membranes, secondary plastids are characterized by the presence of 3 or 4 bounding membranes, a feature that complicates the import of nucleus-encoded, plastid-targeted proteins in secondary plastid-containing organisms (Cavalier-Smith 1999; McFadden 1999; Soll and Schleiff 2004GoGoGo). Some of the most abundant and ecologically significant eukaryotic phototrophs on Earth acquired their plastids secondarily, yet many of the details surrounding the pattern and process of secondary endosymbiosis remain unclear.

Determining the number of endosymbioses that gave rise to the known spectrum of red algal–derived plastids has been particularly challenging, in large part due to the tremendous morphological, biochemical, and molecular diversity exhibited by the organisms bearing them. These include heterokonts (e.g., diatoms and kelp), haptophytes, dinoflagellates, and (probably) apicomplexan parasites, the latter group containing a highly derived nonphotosynthetic organelle whose precise origin has been difficult to discern (reviewed by Delwiche 1999; Archibald and Keeling 2002, 2005; Palmer 2003; Keeling 2004GoGoGoGoGo). The cryptophytes are a ubiquitous group of flagellated unicells that also possess secondary plastids of red algal origin. Together with the chlorarachniophytes, an unrelated group of algae with green algal secondary plastids (Ishida, Green, and Cavalier-Smith 1999; Keeling 2004; Gilson et al. 2006; Rogers et al. 2007GoGoGoGo), cryptophytes are unlike other secondary plastid–containing algae in that the primary endosymbiont nucleus—the "nucleomorph"—persists in the remnant cytosol of the engulfed algal cell between the inner and outer pairs of plastid membranes (Gilson, Maier, and McFadden 1997; Archibald 2007GoGo). Nucleomorph (Douglas et al. 2001Go) and plastid (Douglas and Penny 1999Go) genome sequence data from the model cryptophyte Guillardia theta convincingly show a red algal ancestry for the cryptophyte plastid, although the precise relationship between cryptophyte plastids and other red secondary plastids is not known.

Based on the unique membrane topology of their plastids and the shared presence of the photosynthetic pigment chlorophyll c2, Cavalier-Smith (1982, 1986)GoGo placed the cryptophytes, heterokonts, and haptophytes together in the kingdom Chromista, hypothesizing that photosynthesis evolved in these organisms only once as a result of a single endosymbiosis in their common ancestor. More specifically, cryptophytes have been argued to be the deepest branching of the 3 groups, with the haptophytes and heterokonts having diverged from one another more recently (Cavalier-Smith 2003Go). The so-called "chromalveolate" hypothesis goes 1 step further in postulating that the endosymbiosis that gave rise to the chromist plastid occurred even earlier, in a common ancestor these organisms shared with dinoflagellates and apicomplexans (Cavalier-Smith 1999, 2004GoGo). The main rationale behind this idea is to invoke the fewest number of secondary endosymbioses possible, given that each plastid acquisition requires extensive nucleus-to-nucleus gene transfers and the evolution of a protein targeting system (Cavalier-Smith 1999; McFadden 1999GoGo). However, inferring the minimum number of secondary endosymbioses has important implications for how we interpret the evolutionary history of a significant fraction of eukaryotic biodiversity. This is because many dinoflagellates and heterokonts are aplastidic and/or nonphotosynthetic, and the ciliates, which together with dinoflagellates and apicomplexans comprise the alveolates, are an entirely nonphotosynthetic lineage. If correct, the chromalveolate hypothesis demands that all dinoflagellates, heterokonts, and ciliates evolved from plastid-bearing ancestors (for discussion see Bachvaroff, Sanchez Puerta, and Delwiche 2005; Bodyl 2005GoGo). The recently sequenced macronuclear genome of the ciliate Tetrahymena thermophila provided no evidence for a photosynthetic ancestry in ciliates (Eisen et al. 2006), although plastid-associated genes were found in the genome of the aplastidic heterokont pathogen Phytophthora (Tyler et al. 2006Go), suggesting that outright organelle loss is possible.

Early molecular data brought to bear on the origin(s) of chromist plastids failed to provide support for their common ancestry. Phylogenies of RuBisCO and plastid small subunit ribosomal DNA (SSU rDNA) placed cryptophytes, heterokonts, and haptophytes as independent lineages within red algae (e.g., Medlin et al. 1995; Daugbjerg and Andersen 1997; Oliveira and Bhattacharya 2000GoGoGo) and were interpreted as evidence that their plastids were acquired independent of one another. Early nuclear SSU rDNA phylogenies failed to unite the host components of these lineages (Bhattacharya, Helmchen, and Melkonian 1995Go), consistent with separate plastid origins. More recent analyses using larger concatenated datasets have provided evidence both for (Yoon et al. 2002; Rogers et al. 2007GoGo) and against (Martin et al. 2002Go) the hypothesis that chromist plastids are monophyletic. Analysis of endosymbiotic replacements involving nucleus-encoded, plastid-targeted proteins have provided strong evidence in favor of chromist and chromalveolate monophyly (Fast et al. 2001; Harper and Keeling 2003GoGo; Patron, Rogers, and Keeling 2004Go), although the significance of such replacements has been questioned (Bodyl 2005Go).

A consistent trend in molecular phylogenies that do recover chromist monophyly is the basal position of cryptophytes with respect to the other chlorophyll c-containing lineages (Yoon et al. 2002; Bachvaroff, Sanchez Puerta, and Delwiche 2005; Rogers et al. 2007GoGoGo). In contrast, a recent comprehensive survey of plastid genomes revealed an extremely rare lateral gene transfer (LGT) involving a noncyanobacterial rpl36 gene in the plastid genome of cryptophytes and haptophytes but not heterokonts, suggesting that the former 2 lineages could be each other's closest relatives (Rice and Palmer 2006Go). On balance, the evolutionary history of chromist plastids is controversial, and attempts to resolve the issue have been hampered by the limited amount of genome sequence data available from diverse members of the 3 groups. Combined with the considerable evolutionary distance between chromist plastids and those of red algae, such limited taxon sampling has made phylogenetic inferences especially prone to artifact.

Here we present the completely sequenced plastid genome of the cryptophyte Rhodomonas salina CCMP1319 and analyze it in the context of 15 other genomes, including 5 other chromist plastids and 4 red algae. The R. salina genome is unique in containing a laterally transferred dnaX gene encoding the tau/gamma subunit of bacterial DNA polymerase III and, unlike previously sequenced red secondary plastids, contains pseudogenes and introns. Concatenated phylogenies inferred from a dataset of 45 plastid proteins and subsets thereof provide insight into the complex nature of the phylogenetic signal in support of a common ancestry of haptophyte and heterokont plastids to the apparent exclusion of cryptophytes.


    Materials and Methods
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results and Discussion
 Conclusion
 Suppementary Material
 Acknowledgements
 References
 
Genome Sequencing, Assembly, and Annotation
Six fosmid library clones with inserts derived from the R. salina plastid genome were identified in a previous study (Khan et al. 2007Go). These were subcloned and sequenced to ~8x coverage using ET terminator chemistry and MegaBace capillary DNA sequencers as described therein. Based on synteny shared with the plastid genome of G. theta (Douglas and Penny 1999Go), exact match and degenerate primers were designed to genomic regions between psaB and rpl13, and long-range PCR was used to amplify the remaining regions of the R. salina genome. High-fidelity Taq polymerase was used, and PCR products were purified using the MinElute Gel Extraction Kit (Qiagen Sciences, Valencia, CA). PCR products were cloned using the Topo TA Cloning Kit (Invitrogen) according to the manufacturer's protocol, and at least 3 independent clones were sequenced per amplicon. Plasmids were isolated using the Fastplasmid Mini Kit (Eppendorf AG, Hamburg, Germany) and sequenced using the CEQ Dye Terminator Cycle Sequencing (DCTS) kit (Beckman Coulter, Inc., Fullerton, CA) and run on Beckman Coulter CEQ8000 capillary DNA sequencers. Sequences were assembled into contigs using Staden (Staden 1996Go). Protein genes were annotated using NCBI ORF-finder (http://www.ncbi.nlm.nih.gov/gorf/gorf.html) and BLASTX and BLASTN searches at NCBI (http://www.ncbi.nlm.nih.gov/). Ribosomal RNA genes were annotated via comparison to previously published rRNA sequences, and tRNA genes were identified using tRNAScan (http://www.genetics.wustl.edu/eddy/tRNAscan-SE/). The circular genome map was constructed using CIRDNA (http://bioweb.pasteur.fr/seqanal/interfaces/cirdna.html). The complete R. salina genome is available from GenBank (EF508371 [GenBank] ).

dnaX Amplification, Cloning, and Sequencing
Cryptophyte cell cultures were obtained from the Provasoli-Guillard National Center for Culture of Marine Phytoplankton (CCMP) and the Roscoff Culture Collection (RCC) and grown under conditions described previously (Khan et al. 2007Go). Based on the R. salina dnaX sequence and the groEL and psb28 genes flanking it, degenerate PCR primers were designed to amplify dnaX-coding regions from additional Rhodomonas species (R. baltica RCC350, R. salina CCMP1170, and Rhodomonas sp. CCMP1178). PCR products were cloned and sequenced as described above.

Phylogenetic Analyses
A concatenated dataset of 45 plastid-encoded protein sequences common among 15 algae and 2 cyanobacteria (Synechocystis sp. PCC6803 and Prochlorococcus marinus subsp. pastoris str. CCMP1986) were aligned using ClustalX (Chenna et al. 2003Go) and MacClade 4.06 (Maddison and Maddison 2003Go). The 45 genes included in the analysis were atpA, atpB, atpE, atpF, atpH, petA, petB, petD, petG, psaA, psaB, psaC, psaJ, psbA, psbB, psbC, psbD, psbE, psbF, psbH, psbI, psbJ, psbK, psbL, psbN, psbT, rpl2, rpl14, rpl16, rpl20, rpoA, rpoB, rps2, rps3, rps4, rps7, rps8, rps11, rps12, rps14, rps18, rps19, ycf3, ycf4, and psbZ. PhyML 2.4 (Guindon and Gascuel 2003Go) and IQPNNI (Vinh le and Von Haeseler 2004Go) were used to perform ML analysis on individual proteins and on the concatenated set of 9,081 amino acid positions using the cpREV, WAG, and JTT amino acid substitution matrices with a gamma distribution approximated by 4 or 8 categories to model site rate heterogeneity. Statistical support for individual branches was examined by bootstrapping (100 replicates). To reduce systematic errors associated with saturation and homoplasy, the fastest-evolving sites were determined using Tree-Puzzle version 5.2 (Strimmer and von Haeseler 1996Go). Trees were constructed from the concatenated alignment with sites corresponding to rates 6, 7, and 8 removed, recognizing that site rate assignments can vary slightly depending on the tree topology considered. Bayesian analyses were performed using PhyloBayes with the site-heterogeneous CAT model as described in Lartillot, Brinkmann, and Philippe (2007)Go using 4 gamma-distributed rate categories. Invariant sites were removed prior to analysis. We also used Tree-Puzzle to perform a site-by-site likelihood analysis of all 9,081 amino acid positions in the full dataset given 2 tree topologies, one in which cryptophytes and haptophytes were each other's closest relatives, the other where haptophytes and heterokonts shared a common branch. Differences in log likelihood were plotted in order to assess support for the 2 trees. These and other trees were compared to one another using the site likelihoods calculated above and "approximately unbiased" (AU) tests of significance. AU tests were performed using CONSEL 0.1 (Shimodaira and Hasegawa 2001Go) with default scaling and replicate values. Protein sequences were "recoded" according to Hrdy et al. (2004)Go. Hydrophobic (MILV) and aromatic (FYW) amino acids were combined into a single category, and cysteine was coded as "missing data." This allowed use of PAUP (Swofford 2002Go) to perform ML analyses on the recoded alignment using the general-time-reversible (GTR) matrix with 4 rate categories. The frequency of recoded amino acids, as well as the proportion of invariable sites parameter was estimated from the data.

The R. salina plastid-encoded dnaX protein was used as a query to identify and retrieve a diverse set of dnaX and replication factor C proteins from public databases. Sequences were aligned using MacClade 4.06 (Maddison and Maddison 2003Go), manually adjusted, and analyzed using PhyML and IQPNNI as described above. The dnaX-only alignment contained 72 sequences and 240 unambiguously aligned residues. AU tests were performed as described above.


    Results and Discussion
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results and Discussion
 Conclusion
 Suppementary Material
 Acknowledgements
 References
 
Rhodomonas salina Plastid Genome Sequence
The complete R. salina plastid genome is 135,854 bp (fig. 1), the largest secondary plastid genome characterized thus far. The overall G+C content is ~34%, and like other chromist and red algal plastid genomes (Kowallik et al. 1995; Douglas and Penny 1999; Ohta et al. 2003; Puerta, Bachvaroff, and Delwiche 2005GoGoGoGo), the R. salina genome possesses highly similar inverted rDNA cistrons with a G+C content of ~50%. The R. salina genome is predicted to encode 183 genes, including rRNAs and 31 tRNA genes. The tRNA (M) CAU (not present in the cryptophyte G. theta) is found in R. salina, and tRNA (S) GGA is substituted with tRNA (S) GCU in G. theta. The ochre termination codon TAA is used in R. salina 79% of the time, with amber (TAG) and opal (TGA) codons being used 18% and 3%, respectively. In 9 cases a valine start codon (GTG) is used rather than methionine (chll, rbcS, rpl23, rpl24, rps8, rps13, ycf27, ORF99, and dnaB). Five instances of overlapping genes are found in R. salina, many of which are also found in other chromist plastid genomes. The psbD-psbC overlap found in R. salina exists in all sequenced chromist genomes, although the amount of overlap varies. Overlaps involving atpD-atpF and rpl4-rpl23 are common to heterokonts and cryptophytes, but not haptophytes. Single nucleotide overlaps between rpl16-rpl29 and orf142-orf146 are present in R. salina, the former being found only in G. theta.


Figure 1
View larger version (34K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
FIG. 1.— Plastid genome map of Rhodomonas salina CCMP1319. Genes present on the outside of the circle are transcribed clockwise. Annotated genes are colored according to the functional categories shown in the inset box.

 
Two very short regions of the R. salina genome could not be sequenced despite repeated attempts using varied sequencing parameters and progressively smaller subcloned fragments as template. These areas presumably correspond to regions forming extensive secondary structure. Using restriction enzyme digestion, the sizes of the gaps were determined to be ~10 bp (between rps10-petF) and ~100 bp (between ftsH25-ycf33) in size. Given their small size and the fact that all the genes present in G. theta (except for hypothetical ORFs) are accounted for in R. salina, it is highly unlikely that they harbor additional genes.

Overall, the R. salina plastid genome shows a high degree of synteny with the previously sequenced genome of the cryptophyte G. theta (Douglas and Penny 1999Go). There are 12 genes present in R. salina that are absent in G. theta, and 8 genes (mostly hypothetical ORFs) are present in G. theta and not in R. salina (supplementary table 1). HlpA, a histone-like protein previously found only in the G. theta plastid genome, is also in R. salina and the red algae Cyanidioschyzon merolae (Ohta et al. 2003Go) and Galdieria sulphuraria, but absent in haptophytes and heterokonts and nucleus-encoded in apicomplexans (Hall et al. 2002; Nierman et al. 2005GoGo; Pain et al. 2005). Additional genes present in cryptophytes but absent in heterokonts and/or haptophytes include cpeB, ilvB, ilvH, and infB (supplementary table 1). Almost all of the genes encoded in the genome of the haptophyte Emiliania huxleyi (Puerta, Bachvaroff, and Delwiche 2005Go) are present in the R. salina and G. theta genomes, while several genes present in heterokont plastids are not found in haptophytes and cryptophytes.

An intriguing feature of the R. salina plastid genome is the presence of 2 group II introns, none of which are present in G. theta (Douglas and Penny 1999Go) or any other chromist plastids (Oudot-Le Secq et al. 2007Go). A previous study identified a putative "twintron" in the groEL gene in another Rhodomonas species (Maier et al. 1995Go), although the R. salina groEL gene presented here lacks an intron. One of the R. salina introns resides within the photosystem gene psbN, while the second intron, located between ycf37 and ycf12 (fig. 1), appears to be degenerate (the evolution of the R. salina introns will be presented in detail elsewhere). Surprisingly, the R. salina plastid genome encodes remnants of 3 subunits of light-independent protocholorophyllide reductase (LIPOR; {Psi}chlL, {Psi}chlN, {Psi}chlB; fig. 1), an enzyme involved in the light-independent synthesis of chlorophyll (Shi and Shi 2006Go). These genes are not found in the G. theta plastid genome nor in any secondary plastids of red algal origin, but are present in the red alga Porphyra purpurea (Reith and Munholland 1995Go), cyanobacteria (Kaneko et al. 1996Go), glaucophytes (Stirewalt 1995), and numerous organisms belonging to the green algal lineage, including land plants (Martin et al. 1998, 2002GoGo) (supplementary table 1). A comparison of the salient features of the R. salina plastid genome compared to those of other chromists and red algae is presented in supplementary table 1.

Lateral Gene Transfer
The most unexpected finding in the R. salina plastid genome is the presence of a gene with strong similarity to dnaX, which encodes the tau/gamma components of bacterial DNA polymerase (Blinkova et al. 1993; Dallmann and McHenry 1995GoGo). Although plastid DNA polymerases have been purified and characterized enzymatically (e.g., Gaikwad, Hop, and Mukherjee 2002Go), the process of plastid DNA replication is very poorly understood, and the enzymes directly involved are invariably nucleus-encoded. The R. salina gene represents the first instance of a putative DNA polymerase enzyme encoded in plastid DNA. Using degenerate PCR primers designed to the R. salina dnaX and to 2 genes flanking it (groEL and psb28; fig. 1), we successfully amplified dnaX from several additional Rhodomonas plastid genomes, including species CCMP1178, one of the deepest branching lineages in the Rhodomonas cluster (Lane et al. 2006Go).

Phylogenetic analysis reveals that the Rhodomonas dnaX genes were acquired by LGT. Analysis of dnaX proteins together with their closest eukaryotic homologs, i.e., components of replication factor C (Waga and Stillman 1998Go), clearly indicates that Rhodomonas dnaX is derived from the former and not the latter (fig. 2A). A more focused analysis of a larger set of dnaX homologs in isolation (fig. 2B) failed to definitively identify the donor of the gene, although in most of our analyses the Rhodomonas sequences branch (with weak to moderate support) with a subset of firmicutes, i.e., the parasitic mycoplasmas and related organisms. Phylogenies of the genes flanking dnaX clearly place R. salina within the chromist/red algal clade (data not shown), indicating that the LGT solely involved dnaX. Significantly, Rhodomonas dnaX appears distantly related to homologs in cyanobacteria, eliminating the (remote) possibility that the gene is an ancestral plastid gene. It is nevertheless intriguing that with most phylogenetic methods the cyanobacterial dnaX homologs branch modestly with the only other eukaryotic sequences in the dnaX cluster, the so-called "STICHEL" proteins of Oryza sativa and Arabidopsis thaliana. These nucleus-encoded, apparently land plant-specific proteins are ~800 amino acids larger than dnaX and are not believed to be involved in DNA replication, but are instead thought to play a role in plant cell morphogenesis (Ilgenfritz et al. 2003Go). Putative nuclear localization signals suggest that STICHEL is targeted to the nucleus, and not plastid-localized (Ilgenfritz et al. 2003Go). AU tests of significance further support the idea that Rhodomonas dnaX is not specifically related to cyanobacterial dnaX or to plant STICHEL proteins; such tests reject alternate trees in which the Rhodomonas sequences are placed sister to cyanobacteria (P < 0.05) and STICHEL (P < 0.05), as well as next to the Chlorobiales/Bacteroidetes clade (P < 0.01). Interestingly, a variety of other positions for Rhodomonas dnaX within the noncyanobacterial/plant portion of the tree (e.g., alternate placements within and between Firmicutes and Proteobacteria) are not significantly worse than the tree shown in figure 2B. It is thus possible that the relationship between the Rhodomonas dnaX sequences and those of mycoplasmas is the result of long-branch attraction or a shared compositional bias. The role of dnaX in the Rhodomonas plastid is not known, but an obvious possibility is that it functions in DNA replication, perhaps interacting with or replacing some of the nucleus-encoded proteins that must be imported into the organelle. Experiments to better understand the distribution of dnaX in cryptophytes are currently underway.


Figure 2
View larger version (29K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
FIG. 2.— Lateral gene transfer of dnaX in Rhodomonas. (A) PhyML phylogeny of dnaX proteins from 4 Rhodomonas species in the context of a range of bacterial dnaX proteins rooted with eukaryotic replication factor C homologs. Cryptophyte plastid-encoded dnaX homologs are highlighted. (B) Phylogeny constructed from an alignment of 72 dnaX homologs covering the known breadth of bacterial diversity including cyanobacterial sequences. The STICHEL proteins found in Arabidopsis and Oryza are also present (see text). Bootstrap values are shown where >50%. Scale bars indicate inferred number of substitutions per amino acid site.

 
LGT is increasingly recognized as a major factor in plant mitochondrial genome evolution (e.g., Bergthorsson et al. 2003, 2004; Davis and Wurdack 2004GoGoGo), yet LGTs involving plastid genes appear to be exceedingly rare. Indeed, a recent comprehensive survey of 204 plastid-encoded genes by Rice and Palmer (2006)Go revealed only 1 convincing case of LGT in addition to the previously characterized proteobacterial-derived large and small subunits of RuBisCO in red algal plastids and their derivatives (Delwiche and Palmer 1996Go). In this instance, a noncyanobacterial type rpl36 gene was identified in cryptophytes and haptophytes (Rice and Palmer 2006Go); the simplest interpretation is that the gene replaced the canonical cyanobacterial homolog in a common ancestor of cryptophytes and haptophytes after they diverged from heterokonts. The dnaX transfer presented here represents only the third plastid LGT to be documented and the second directly involving cryptophyte algae. The significance of this observation is not clear, although differences in the apparent frequency of mitochondrial and plastid LGTs have been linked to the presence/absence of DNA uptake systems (Rice and Palmer 2006Go). It is possible that cryptophyte plastids and, more generally, those of chromists and red algae are atypical in this regard. While single-gene phylogenies of all analyzable R. salina plastid proteins revealed no additional convincing cases of LGT beyond dnaX and rpl36 (data not shown), the presence of multiple apparently unrelated mobile genetic elements in different genomic contexts in the plastids of Rhodomonas species (Archibald Lab, unpublished) is consistent with the notion of enhanced DNA uptake in cryptophyte plastids.

Concatenated Protein Phylogenies and Chromist Plastid Evolution
While phylogenetic studies have demonstrably shown that cryptophyte, heterokont, and haptophyte plastids are derived from red algae (reviewed by Bhattacharya, Yoon, and Hackett 2003; Palmer 2003; Keeling 2004; Archibald and Keeling 2005GoGoGoGo), single- and multi-gene analyses of plastid proteins have produced conflicting results regarding their origin(s). With complete plastid genome sequences now available from 2 cryptophytes (this study; Douglas and Penny 1999Go), 3 heterokonts (Kowallik 1995; Oudot-Le Secq et al. 2007Go), a haptophyte (Puerta, Bachvaroff, and Delwiche 2005Go) and 4 red algae (Reith 1995; Glöckner, Rosenthal, and Valentin 2000; Ohta et al. 2003; Hagopian et al. 2004GoGoGo), we assembled a concatenate of 45 proteins encoded in 15 plastid genomes as well as homologs from 2 cyanobacteria (9,081 amino acids in total with no missing data). This dataset (and derivatives thereof) was subjected to a battery of phylogenetic analyses aimed at rigorously testing relationships within and between chromists and red algae. Due to the limited coding capacity of their plastid genomes (see Bachvaroff, Sanchez Puerta, and Delwiche 2005Go and references therein), sequences from chlorophyll-c-containing dinoflagellates were not included.

The 45-protein dataset was first analyzed using PhyML (Guindon and Gascuel 2003Go) and IQPNNI (Vinh le and Von Haeseler 2004Go), 2 methods that explore tree space using maximum likelihood (ML), as well as PhyloBayes, a method employing a site-heterogeneous model to account for position-specific characteristics of protein evolution (Lartillot and Philippe 2004; Lartillot, Brinkmann, and Philippe 2007GoGo). All 3 methods recovered a monophyletic chromist assemblage with high support (fig. 3A, Supplementary figs. S1A–C). Heterokonts and haptophytes consistently branched as each other's closest relatives, as has been observed in recent analyses (Yoon et al. 2002; Rogers et al. 2007GoGo), and the chromists branched with the Gracilaria/Porphyra clade of red algae to the exclusion of the cyanidiales (fig. 3A). More generally, the position of the glaucophyte Cyanophora paradoxa relative to green algae/land plants and red algae/chromists varied depending on the method used, branching weakly with green plastids in the PhyloBayes tree. Removal of the fastest-evolving sites (Materials and Methods) had no effect on the resulting tree topologies, although statistical support for chromist monophyly decreased from 93 to 77% with PhyML (fig. 3A). Significantly, the improved chromist taxon sampling used here produced trees largely unaffected by the inclusion of proteins involved in transcription and translation in the full concatenate, unlike previous results (Martin et al. 1998; Hagopian et al. 2004GoGo). Anomalous results were, however, obtained in analyses of transcription and translation proteins on their own. For example, PhyloBayes analysis of a 16-protein transcription/translation-only dataset resulted in a topology in which cryptophytes and the haptophyte Emiliania huxleyi branched as sister taxa with a posterior probability of 0.97 (Supplementary fig. S1N; see below). Exclusion of the cyanidiales, which represent long branches in our trees and have been shown to be problematic in concatenated analyses of nuclear genes (Rodriguez-Ezpeleta et al. 2005Go), did not impact the relative branching order of the 3 chromist groups or support for chromist monophyly (Supplementary figs. S1Q–1T).


Figure 3
View larger version (31K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
FIG. 3.— Phylogeny of chromist plastids. (A) PhyloBayes tree constructed from a concatenate of 45 proteins (9,081 amino acids). PhyloBayes (PB) posterior probabilities are provided for all nodes, while PB and PhyML (PML) support is provided for significant nodes for all sites and with the fastest-evolving sites removed. (B) Protein maximum likelihood tree constructed using the full 45-protein dataset with amino acids recoded according to their biochemical characteristics (see text). Bootstrap values are provided. Scale bars indicate inferred number of substitutions per amino acid site.

 
We next tested the impact of protein composition bias on our concatenated phylogenies by recoding the amino acids into 4 groups with biochemically similar properties as done by Hrdy et al. (2004)Go, coding cysteine as missing data and combining hydrophobic (MILV) and aromatic (FYW) amino acids as a single category (Rodriguez-Ezpeleta et al. 2007Go). Analysis of the 45-protein recoded dataset produced strikingly different results (fig. 3B) from those described above. While the chromists and red algae still branched together, support for chromist monophyly disintegrated. Cryptophytes and haptophytes branched weakly as sister groups, with the heterokonts branching outside a cluster of these taxa together with Porphyra and Gracilaria. When the fastest-evolving sites were removed from the recoded dataset, chromist monophyly was once again recovered, although support was still weak (Supplementary fig. S1H). A consistent observation in both the standard and recoded analyses was the long branches observed in the heterokont sequences as well as those of the cyanidiales.

The dramatic differences seen in analyses performed with and without amino acid recoding led us to explore the impact of protein composition bias further. Specifically, we calculated GARP/FYMINK ratios (Foster and Hickey 1999Go) for each protein and each taxon and assessed the between-taxon variation for each. The results indicate that a subset of the proteins in our concatenate exhibit significant variation in GARP/FYMINK ratios, with the cyanidiales, heterokonts, and single haptophyte being the outliers in most cases (Supplementary fig. S2A). Photosynthesis- and nonphotosynthesis-related proteins were distributed more or less randomly when proteins were ranked from lowest to highest in terms of standard deviation (SD). We performed analyses on subsets of the concatenate: 9 proteins with SD > 0.4, the remaining 36 proteins, and 21 proteins with SD < 0.2 (Supplementary Figs. S3A–S3L). Interestingly, the proteins with the most inter-taxon variation in GARP/FYMINK ratio usually produced topologies incongruent with the full concatenate, often with cryptophytes and haptophytes as sister taxa and with the heterokonts branching elsewhere in the tree (e.g., Supplementary fig. S2B). PhyloBayes was the only method that still recovered chromist monophyly when the most compositionally varied proteins were analyzed in isolation (Supplementary fig. S3B). This is consistent with the fact that the method appears resistant to systematic artifacts such as long-branch attraction (Lartillot, Brinkmann, and Philippe 2007Go).

Rice and Palmer (2006) recently described the presence of an LGT-derived noncyanobacterial-type rpl36 gene in the plastids of cryptophytes and haptophytes, which was interpreted as evidence for the sisterhood of these 2 groups to the exclusion of heterokonts and alveolates. This relationship is inconsistent with the bulk of the plastid phylogenies presented in this study and elsewhere (Yoon et al. 2002; Hagopian et al. 2004; Rogers et al. 2007GoGoGo). Nevertheless, the sisterhood of cryptophytes and haptophytes is observed in some of our analyses (e.g., fig. 3B, Supplementary fig. S1N) and recent phylogenies of nucleus-encoded proteins have provided support for a specific relationship between these 2 groups (Harper, Waanders, and Keeling 2005; Hackett et al. 2007; Patron, Inagaki, and Keeling 2007GoGoGo). We used AU tests to further assess the relative branching order within and between chromists and red algae using the PhyloBayes tree shown in figure 3A as a reference. From a set of 315 alternate trees, only 3 topologies were not rejected at P < 0.05. Significantly, these include the tree shown in figure 3B and 2 trees in which the red algae G. tenuistitipata and P. purpurea were placed as sister to the cryptophytes and the haptophytes and heterokonts were monophyletic. Our genome-wide single-gene/protein analyses of the R. salina genome revealed that, of the 103 proteins analyzed, a full 38% produced phylogenies in which cryptophytes and haptophytes were sister taxa, consistent with the results of Rice and Palmer (2006)Go. Statistical support for this relationship was highly variable (<20%–100%; average = 48%), as expected from single-gene phylogenies inferred from anciently diverged sequences.

In an effort to shed light on the apparently contradictory scenarios for the evolutionary relationship amongst the 3 chromist lineages, we performed a site-by-site likelihood analysis of all 9,081 amino acids present in the 45-protein concatenate under 2 different topologies, one in which haptophytes and heterokonts were each others’ closest relatives (i.e., the topology shown in fig. 3A), the other where cryptophytes and haptophytes branch together, as predicted by the rpl36 LGT (Rice and Palmer 2006Go). The differences in log likelihood ({Delta}lnL) for each site were then plotted and ordered according to their inferred evolutionary rate (fig. 4). In total, 5,413 sites support a haptophyte-heterokont relationship, while 3,668 sites favor a cryptophyte-haptophyte relationship (fig. 4; the majority of the sites in the alignment [3,370] were invariant with very small {Delta}lnL values and a negligible impact on support for one topology over the other). When considering sites belonging to site rate categories 2–8, 1,994 sites support a haptophyte-cryptophyte relationship, while 3,718 sites support a haptophyte-heterokont relationship. This result is consistent with the full-dataset concatenated phylogenies (figs. 3A, Supplementary figs. S1 and S2) that support a haptophyte-heterokont relationship. The only 2 groups of amino acid sites that support a haptophyte-cryptophyte relationship over heterokonts-haptophytes fall into site rate categories 7 and 8, i.e., the fast-evolving sites. As expected, removal of the fast-evolving sites from the full-concatenated dataset resulted in increased bootstrap support for the monophyly of haptophytes and heterokonts (Supplementary fig. S1), but decreased support for the node uniting chromists. The majority of the sites that support a haptophyte-heterokont relationship in site rate categories 2–8 have {Delta}lnL values between 0 and –0.1 (3,063 sites), the same being the case for the haptophyte+cryptophyte relationship (1,342 sites between 0 and +0.1). Unexpectedly, the total number of sites with more extreme {Delta}lnL values (i.e., > +0.1 or < –0.1) in support of the 2 topologies is approximately equal: 652 sites for haptophytes-cryptophytes, 655 for heterokonts-haptophytes (fig. 4). This indicates that the high statistical support for the haptophyte-heterokont relationship seen in the concatenated phylogenies is the result of the combined signal from a proportionately large number of amino acid positions (3,063) that do not strongly discriminate between the 2 topologies. The total number of sites that strongly support haptophytes+heterokonts versus cryptophytes+haptophytes is very similar.


Figure 4
View larger version (40K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
FIG. 4.— Site-by-site maximum likelihood analysis of all 9,081 residues in the 45-protein dataset given 2 competing tree topologies. The trees differ only in the relative branching order of cryptophytes, haptophytes and heterokonts. Differences in site log likelihoods given the 2 trees ({Delta}lnL) are plotted and ranked according to the topology they support and their inferred rate of evolution. Only 3 sites in the alignment fell into site rate category 1 and possessed very small {Delta}lnL values: they are therefore included in the set of invariant sites in this graph.

 
As discussed above, 38% of the individual proteins analyzed in this study showed cryptophytes and haptophytes as each other's closest relatives. In order to identify individual proteins in support of the alternative topologies shown in figure 4, we performed a likelihood analysis of each of the 45 proteins analyzed in the full dataset in isolation. Consistent with the results described above, and a similar analysis performed by Rice and Palmer (2006)Go, 28 out of 45 proteins (62%) supported a haptophyte-heterokont relationship (fig. S4). Nine genes/proteins (20%) possessed {Delta}lnL values in the range of +0.033 to –0.05 (atpF, petG, rpl20, rps18, psbN, psbF, psbK, psbL, rps11), indicating that they support neither topology very strongly. With the exception of rpl20, all of the above-mentioned proteins have relatively high GARP/FYMINK ratios (Supplementary fig. S2A), and most are less than 100 amino acids in size. On the one extreme, the photosystem genes psbB, psaA, and petD strongly support haptophytes+heterokonts, as was shown to be the case previously (Yoon et al. 2002Go). On the other, rps14, rpoB, and atpB strongly favor cryptophytes+haptophytes.


    Conclusion
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results and Discussion
 Conclusion
 Suppementary Material
 Acknowledgements
 References
 
We have completely sequenced the plastid genome of the cryptophyte alga Rhodomonas salina and identified the first known case of a putative DNA polymerase enzyme encoded in plastid DNA. Phylogenies constructed from a large concatenated set of plastid proteins provide support for the monophyly of chromist plastids and consistently favor a relationship between heterokonts and haptophytes to the exclusion of cryptophytes. However, we have also shown that support for the chromist plastid monophyly and heterokont-haptophyte sisterhood vanishes when the data is recoded to minimize the impact of amino acid composition bias, leaving open the possibility that an as yet undetermined systematic bias is responsible for the strongly supported trees presented herein. We can thus neither confirm nor refute a specific relationship between the plastids of cryptophytes and haptophytes as suggested by the shared presence of an LGT-derived rpl36 gene (Rice and Palmer 2006Go) and recent large-scale analyses of nuclear genes (Hackett et al. 2007; Patron, Inagaki, and Keeling 2007GoGo). Additional plastid genome sequences from diverse haptophytes, heterokonts, and cryptophytes will be required to further break up the long terminal branches that presently characterize chromist plastid phylogenies. It is also important to consider that sequences from the fourth chlorophyll c–containing algal lineage, the dinoflagellates, were not included in our analysis, and complete plastid genomes are currently only available from a small fraction of the known diversity of red algae (Yoon et al. 2006Go). The impact of these missing data on the relative branching order of the 3 chromist lineages and, more generally, support for chromist monophyly is presently unclear. Together with trees inferred from concatenated nuclear gene sequences from chromists, dinoflagellates, and apicomplexans, analysis of a further expanded plastid protein dataset should make it possible to assess the congruence between the host cell and plastid components of these complex organisms and definitively test the hypothesis of a single endosymbiotic origin of plastids in chromist and chromalveolate taxa (Cavalier-Smith 1999Go).


    Suppementary Material
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results and Discussion
 Conclusion
 Suppementary Material
 Acknowledgements
 References
 
Supplementary figures and table are available at Molecular Biology and Evolution online (http://www.mbe.oxfordjournals.org/).


    Acknowledgements
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results and Discussion
 Conclusion
 Suppementary Material
 Acknowledgements
 References
 
We are grateful to Marie-Pierre Oudot-Le Secq and Beverly R. Green for providing access to the T. pseudonana and P. tricornutum plastid genomes before public release and to Jessica Leigh, Andrew Roger, Susan Douglas, and Gabino Sanchez Perez for help with analyses and for comments on the manuscript. Three anonymous reviewers are also thanked for helpful suggestions. This work was supported by Genome Atlantic and an operating grant awarded to J.M.A. from NSERC. J.M.A. is a Scholar of the Canadian Institute for Advanced Research, Program in Evolutionary Biology.


    Footnotes
 
Geoffrey McFadden, Associate Editor


    References
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results and Discussion
 Conclusion
 Suppementary Material
 Acknowledgements
 References
 

    Archibald JM. Nucleomorph genomes: structure, function, origin and evolution. Bioessays (2007) 4:392–402.

    Archibald JM, Keeling PJ. Recycled plastids: a green movement in eukaryotic evolution. Trends Genet (2002) 18:577–584.[CrossRef][Web of Science][Medline]

    Archibald JM, Keeling PJ. On the origin and evolution of plastids. In: Microbial phylogeny and evolution—Saap J, ed. (2005) New York: Oxford University Press. 238–260.

    Bachvaroff TR, Sanchez Puerta MV, Delwiche CF. Chlorophyll c-containing plastid relationships based on analyses of a multigene data set with all four chromalveolate lineages. Mol Biol Evol. (2005) 22:1772–1782.[Abstract/Free Full Text]

    Bergthorsson U, Adams KL, Thomason B, Palmer JD. Widespread horizontal transfer of mitochondrial genes in flowering plants. Nature (2003) 424:197–201.[CrossRef][Medline]

    Bergthorsson U, Richardson AO, Young GJ, Goertzen LR, Palmer JD. Massive horizontal transfer of mitochondrial genes from diverse land plant donors to the basal angiosperm Amborella. Proc Natl Acad Sci USA (2004) 101:17747–17752.[Abstract/Free Full Text]

    Bhattacharya D, Helmchen T, Melkonian M. Molecular evolutionary analyses of nuclear-encoded small subunit ribosomal RNA identify an independent rhizopod lineage containing the Euglyphidae and the Chlorarachniophyta. J Eukaryot Microbiol. (1995) 42:64–68.[Medline]

    Bhattacharya D, Yoon HS, Hackett JD. Photosynthetic eukaryotes unite: endosymbiosis connects the dots. Bioessays (2003) 26:50–60.[CrossRef][Web of Science]

    Blinkova A, Hervas C, Stukenberg PT, Onrust R, O'Donnell ME, Walker JR. The Escherichia coli DNA polymerase III holoenzyme contains both products of the dnaX gene, tau and gamma, but only tau is essential. J Bacteriol (1993) 175:6018–6027.[Abstract/Free Full Text]

    Bodyl A. Do plastid-related characters support the chromalveolate hypothesis? J Phycol (2005) 41:712–719.[CrossRef][Web of Science]

    Cavalier-Smith T. The origins of plastids. Biol J Linn Soc. (1982) 17:289–306.[CrossRef][Web of Science]

    Cavalier-Smith T. The kingdom Chromista: origin and systematics. Progr Phycol Res. (1986) 4:309–347.

    Cavalier-Smith T. Principles of protein and lipid targeting in secondary symbiogenesis: euglenoid, dinoflagellate, and sporozoan plastid origins and the eukaryote family tree. J Eukaryot Microbiol. (1999) 46:347–366.[CrossRef][Web of Science][Medline]

    Cavalier-Smith T. Genomic reduction and evolution of novel genetic membranes and protein-targeting machinery in eukaryote-eukaryote chimaeras (meta-algae). Philos Trans R Soc Lond B Biol Sci. (2003) 358:109–133. discussion 133–104.[Abstract/Free Full Text]

    Cavalier-Smith T. Chromalveolate diversity and cell megaevolution: interplay of membranes, genomes and cytoskeleton. In: Organelles, genomes and eukaryotic evolution—Hirt RP, Horner D, eds. (2004) London: Taylor and Francis. 75–108.

    Chenna R, Sugawara H, Koike T, Lopez R, Gibson TJ, Higgins DG, Thompson JD. Multiple sequence alignment with the Clustal series of programs. Nucleic Acids Res. (2003) 31:3497–3500.[Abstract/Free Full Text]

    Dallmann HG, McHenry CS. DnaX complex of Escherichia coli DNA polymerase III holoenzyme. Physical characterization of the DnaX subunits and complexes. J Biol Chem. (1995) 270:29563–29569.[Abstract/Free Full Text]

    Daugbjerg N, Andersen RA. Phylogenetic analyses of the rbcL sequences from haptophytes and heterokont algae suggest their chloroplasts are unrelated. Mol Biol Evol. (1997) 14:1242–1251.[Abstract]

    Davis CC, Wurdack KJ. Host-to-parasite gene transfer in flowering plants: phylogenetic evidence from Malpighiales. Science (2004) 305:676–678.[Abstract/Free Full Text]

    Delwiche CF. Tracing the thread of plastid diversity through the tapestry of life. Am Nat. (1999) 154(Suppl):S164–S177.[CrossRef][Web of Science][Medline]

    Delwiche CF, Palmer JD. Rampant horizontal transfer and duplication of rubisco genes in eubacteria and plastids. Mol Biol Evol. (1996) 13:873–882.[Abstract]

    Douglas SE, Penny SL. The plastid genome from the cryptomonad alga, Guillardia theta: complete sequence and conserved synteny groups confirm its common ancestry with red algae. J Mol Evol. (1999) 48:236–244.[CrossRef][Web of Science][Medline]

    Douglas SE, Zauner S, Fraunholz M, Beaton M, Penny S, Deng L, Wu X, Reith M, Cavalier-Smith T, Maier UG. The highly reduced genome of an enslaved algal nucleus. Nature (2001) 410:1091–1096.[CrossRef][Medline]

    Eisen JA, Coyne RS, Wu M, et al, (53 co-authors). Macronuclear genome sequence of the ciliate Tetrahymena thermophila, a model eukaryote. PLoS Biol. (2006) 4:e286. (53 co-authors).[CrossRef][Medline]

    Fast NM, Kissinger JC, Roos DS, Keeling PJ. Nuclear-encoded, plastid-targeted genes suggest a single common origin for apicomplexan and dinoflagellate plastids. Mol Biol Evol. (2001) 18:418–426.[Abstract/Free Full Text]

    Foster PG, Hickey DA. Compositional bias may affect both DNA-based and protein-based phylogenetic reconstructions. J Mol Evol. (1999) 48:284–290.[CrossRef][Web of Science][Medline]

    Gaikwad A, Hop DV, Mukherjee SK. A 70-kDa chloroplast DNA polymerase from pea (Pisum sativum) that shows high processivity and displays moderate fidelity. Mol Genet Genomics (2002) 267:45–56.[CrossRef][Web of Science][Medline]

    Gilson PR, Maier UG, McFadden GI. Size isn't everything: lessons in genetic miniaturisation from nucleomorphs. Curr Opin Genet Dev. (1997) 7:800–806.[CrossRef][Web of Science][Medline]

    Gilson PR, Su V, Slamovits CH, Reith ME, Keeling PJ, McFadden GI. From the Cover: Complete nucleotide sequence of the chlorarachniophyte nucleomorph: Nature's smallest nucleus. Proc Natl Acad Sci USA (2006) 103:9566–9571.[Abstract/Free Full Text]

    Glöckner G, Rosenthal A, Valentin K. The structure and gene repertoire of an ancient red algal plastid genome. J Mol Evol. (2000) 51:382–390.[Web of Science][Medline]

    Gray MW, Doolittle WF. Has the endosymbiont hypothesis been proven? Microbiol Rev. (1982) 46:1–42.[Free Full Text]

    Guindon S, Gascuel O. A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst Biol. (2003) 52:696–704.[Abstract/Free Full Text]

    Hackett JD, Yoon HS, Li S, Reyes-Prieto A, Rümmele SE, Bhattacharya D. Phylogenomic analysis supports the monophyly of cryptophytes and haptophytes and the association of ‘Rhizaria’ with chromalveolates. Mol Biol Evol. (2007) doi:10.1093/molbev/msm089.

    Hagopian JC, Reis M, Kitajima JP, Bhattacharya D, de Oliveira MC. Comparative analysis of the complete plastid genome sequence of the red alga Gracilaria tenuistipitata var. liui provides insights into the evolution of rhodoplasts and their relationship to other plastids. J Mol Evol. (2004) 59:464–477.[CrossRef][Web of Science][Medline]

    Hall N, Pain A, Berriman M, et al, (80 co-authors). Sequence of Plasmodium falciparum chromosomes 1, 3–9 and 13. Nature (2002) 419:527–531. (80 co-authors).[CrossRef][Medline]

    Harper JT, Keeling PJ. Nucleus-encoded, plastid-targeted glyceraldehyde-3-phosphate dehydrogenase (GAPDH) indicates a single origin for chromalveolate plastids. Mol Biol Evol. (2003) 20:1730–1735.[Abstract/Free Full Text]

    Harper JT, Waanders E, Keeling PJ. On the monophyly of chromalveolates using a six-protein phylogeny of eukaryotes. Int J Syst Evol Microbiol. (2005) 55:487–496.[Abstract/Free Full Text]

    Hrdy I, Hirt RP, Dolezal P, Bardonova L, Foster PG, Tachezy J, Embley TM. Trichomonas hydrogenosomes contain the NADH dehydrogenase module of mitochondrial complex I. Nature (2004) 432:618–622.[CrossRef][Medline]

    Ilgenfritz H, Bouyer D, Schnittger A, Mathur J, Kirik V, Schwab B, Chua NH, Jurgens G, Hulskamp M. The Arabidopsis STICHEL gene is a regulator of trichome branch number and encodes a novel protein. Plant Physiol (2003) 131:643–655.[Abstract/Free Full Text]

    Ishida K, Green BR, Cavalier-Smith T. Diversification of a chimaeric algal group, the chlorarachniophytes: phylogeny of nuclear and nucleomorph small-subunit rRNA genes. Mol Biol Evol (1999) 16:321–331.[Web of Science]

    Kaneko T, Sato S, Kotani H, et al, (24 co-authors). Sequence analysis of the genome of the unicellular cyanobacterium Synechocystis sp. strain PCC6803. II. Sequence determination of the entire genome and assignment of potential protein-coding regions (supplement). DNA Res. (1996) 3:185–209. (24 co-authors).[CrossRef][Medline]

    Keeling PJ. Diversity and evolutionary history of plastids and their hosts. Am J Bot. (2004) 91:1481–1493.[Abstract/Free Full Text]

    Khan H, Kozera C, Curtis BA, Bussey JT, Theophilou S, Bowman S, Archibald JM. Retrotransposons and tandem repeat sequences in the nuclear genomes of cryptomonad algae. J Mol Evol. (2007) 64:223–236.[CrossRef][Web of Science][Medline]

    Kowallik KV, Stoebe B, Schaffran I, Kroth-Pancic P, Freier U. The chloroplast genome of a chlorophyll a+c-containing alga, Odontella sinensis. Plant Mol Biol Rep (1995) 13:336–342.[CrossRef][Web of Science]

    Kowallik KV, Stoebe B, Schaffran I, Frier U. The chloroplast genome of chlorophyll a+c-containing alga Odontella sinensis. Plant Mol Biol Rep (1995) 13:336–342.[CrossRef][Web of Science]

    Lane CE, Khan H, MacKinnon M, Fong A, Theophilou S, Archibald JM. Insight into the diversity and evolution of the cryptomonad nucleomorph genome. Mol Biol Evol. (2006) 23:856–865.[Abstract/Free Full Text]

    Lartillot N, Brinkmann H, Philippe H. Suppression of long-branch attraction artefacts in the animal phylogeny using a site-heterogenious model. BMC Evol Biol. (2007) 7:S4.[CrossRef][Medline]

    Lartillot N, Philippe H. A Bayesian mixture model for across-site heterogeneities in the amino-acid replacement process. Mol Biol Evol. (2004) 21:1095–1109.[Abstract/Free Full Text]

    Maddison W, Maddison D. MacClade (2003) Sunderland, Massachusetts: Sinauer Associates.

    Maier UG, Rensing SA, Igloi GL, Maerz M. Twintrons are not unique to the Euglena chloroplast genome: structure and evolution of a plastome cpn60 gene from a cryptomonad. Mol Gen Genet. (1995) 246:128–131.[CrossRef][Web of Science][Medline]

    Martin W, Rujan T, Richly E, Hansen A, Cornelsen S, Lins T, Leister D, Stoebe B, Hasegawa M, Penny D. Evolutionary analysis of Arabidopsis, cyanobacterial, and chloroplast genomes reveals plastid phylogeny and thousands of cyanobacterial genes in the nucleus. Proc Natl Acad Sci USA (2002) 99:12246–12251.[Abstract/Free Full Text]

    Martin W, Stoebe B, Goremykin V, Hansmann S, Hasegawa M, Kowallik KV. Gene transfer to the nucleus and the evolution of chloroplasts. Nature (1998) 393:162–165.[CrossRef][Medline]

    McFadden GI. Plastids and protein targeting. J. Eukaryot. Microbiol. (1999) 46:339–346.[Web of Science][Medline]

    Medlin LK, Cooper A, Hill C, Wrieden S, Wellbrock U. Phylogenetic position of the Chromista plastids based on small subunit rRNA coding regions. Curr Genet. (1995) 28:560–565.[CrossRef][Web of Science][Medline]

    Moreira D, Le Guyader H, Phillippe H. The origin of red algae and the evolution of chloroplasts. Nature (2000) 405:69–72.[CrossRef][Medline]

    Nierman WC, Pain A, Anderson MJ, et al, (98 co-authors). Genomic sequence of the pathogenic and allergenic filamentous fungus Aspergillus fumigatus. Nature (2005) 438:1151–1156. (98 co-authors).[CrossRef][Medline]

    Ohta N, Matsuzaki M, Misumi O, Miyagishima SY, Nozaki H, Tanaka K, Shin IT, Kohara Y, Kuroiwa T. Complete sequence and analysis of the plastid genome of the unicellular red alga Cyanidioschyzon merolae. DNA Res. (2003) 10:67–77.[Abstract]

    Oliveira MC, Bhattacharya D. Phylogeny of the Bangiophycidae (Rhodophyta) and the secondary endosymbiotic origin of algal plastids. Am J Bot (2000) 87:482–492.[Abstract/Free Full Text]

    Oudot-Le Secq M-P, Grimwood J, Shapiro H, Armbrust EV, Bowler C, Green BR. Chloroplast genomes of the diatoms Phaeodactylum tricornutum and Thalassiosira pseudonana: comparison with other plastid genomes of the red lineage. Mol Genet Genom (2007) 277:427–439.[CrossRef][Web of Science][Medline]

    Pain A, Renauld H, Berriman M, et al, (50 co-authors). Genome of the host-cell transforming parasite Theileria annulata compared with T. parva. Science (2005) 309:131–133. (50 co-authors).[Abstract/Free Full Text]

    Palmer JD. The symbiotic birth and spread of plastids: how many times and whodunnit? J. Phycol (2003) 39:4–11.[CrossRef][Web of Science]

    Patron NJ, Inagaki Y, Keeling PJ. Multiple gene phylogenies support the monophyly of cryptomonad and haptophyte host lineages. Curr Biol. (2007) 17:887–891.[CrossRef][Web of Science][Medline]

    Patron NJ, Rogers MB, Keeling PJ. Gene replacement of fructose-1,6-bisphosphate aldolase supports the hypothesis of a single photosynthetic ancestor of chromalveolates. Eukaryot Cell (2004) 3:1169–1175.[Abstract/Free Full Text]

    Puerta MV, Bachvaroff TR, Delwiche CF. The complete plastid genome sequence of the haptophyte Emiliania huxleyi: a comparison to other plastid genomes. DNA Res. (2005) 12:151–156.[Abstract]

    Reith ME, Munholland J. Complete nucleotide sequence of the Porphyra purpurea chloroplast genome. Plant Mol Bio Rep (1995) 13:333–335.[CrossRef][Web of Science]

    Rice DW, Palmer JD. An exceptional horizontal gene transfer in plastids: gene replacement by a distant bacterial paralog and evidence that haptophyte and cryptophyte plastids are sisters. BMC Biol. (2006) 4:31.[CrossRef][Medline]

    Rodriguez-Ezpeleta N, Brinkmann H, Burey SC, Roure B, Burger G, Loffelhardt W, Bohnert HJ, Philippe H, Lang BF. Monophyly of primary photosynthetic eukaryotes: green plants, red algae, and glaucophytes. Curr Biol. (2005) 15:1325–1330.[CrossRef][Web of Science][Medline]

    Rodriguez-Ezpeleta N, Philippe H, Brinkmann H, Becker B, Melkonian M. Phylogenetic analyses of nuclear, mitochondrial and plastid multi-gene datasets support the placement of Mesostigma in the Streptophyta. Mol Biol Evol. (2007) 24:723–731.[Abstract/Free Full Text]

    Rogers MB, Gilson PR, Su V, McFadden GI, Keeling PJ. The complete chloroplast genome of the chlorarachniophyte Bigelowiella natans: evidence for independent origins of chlorarachniophyte and euglenid secondary endosymbionts. Mol Biol Evol. (2007) 24:54–62.[Abstract/Free Full Text]

    Shi C, Shi X. Characterization of three genes encoding the subunits of light-independent protochlorophyllide reductase in Chlorella protothecoides CS-41. Biotechnol Prog (2006) 22:1050–1055.[CrossRef][Medline]

    Shimodaira H, Hasegawa M. CONSEL: for assessing the confidence in of phylogenetic tree selection. Bioinformatics (2001) 17:1246–1247.[Abstract/Free Full Text]

    Soll J, Schleiff E. Protein import into chloroplasts. Nat Rev Mol Cell Biol. (2004) 5:198–208.[CrossRef][Web of Science][Medline]

    Staden R. The Staden sequence analysis package. Molecular Biotechnology (1996) 5:233–241.[Web of Science][Medline]

    Stirewalt V, Michalowski CB, Loffelhardt W, Bohnert HJ, Bryant DA. Nucleotide sequence of the cyanelle genome from Cyanophora paradoxa. Plant Molecular Biology Reporter (1995) 13:327–332.[CrossRef][Web of Science]

    Strimmer K, von Haeseler A. Quartet puzzling: a quartet maximum likelihood method for reconstructing tree topologies. Mol Biol Evol. (1996) 13:964–969.[Web of Science]

    Swofford DL. Phylogenetic analysis using parsimony (*and other methods), version 4.0B10 PPC (2002) Sunderland, MA: Sinauer Associates.

    Tyler BM, Tripathy S, Zhang X, et al, (53 co-authors). Phytophthora genome sequences uncover evolutionary origins and mechanisms of pathogenesis. Science (2006) 313:1261–1266. (53 co-authors).[Abstract/Free Full Text]

    Vinh le S, Von Haeseler A. IQPNNI: moving fast through tree space and stopping in time. Mol Biol Evol. (2004) 21:1565–1571.[Abstract/Free Full Text]

    Waga S, Stillman B. The DNA replication fork in eukaryotic cells. Annu Rev Biochem (1998) 67:721–751.[CrossRef][Web of Science][Medline]

    Yoon HS, Hackett JD, Pinto G, Bhattacharya D. The single, ancient origin of chromist plastids. Proc Natl Acad Sci USA (2002) 99:15507–15512.[Abstract/Free Full Text]

    Yoon HS, Müller KM, Sheath RG, Ott FD, Bhattacharya D. Defining the major lineages of red algae (Rhodophyta). J Phycol (2006) 42:482–492.[CrossRef][Web of Science]

Accepted for publication May 17, 2007.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Gen Biol EvolHome page
F. Burki, Y. Inagaki, J. Brate, J. M. Archibald, P. J. Keeling, T. Cavalier-Smith, M. Sakaguchi, T. Hashimoto, A. Horak, S. Kumar, et al.
Large-Scale Phylogenomic Analyses Reveal That Two Enigmatic Protist Lineages, Telonemia and Centroheliozoa, Are Related to Photosynthetic Chromalveolates
Gen Biol Evol, October 19, 2009; 2009(0): 231 - 238.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
M. Turmel, M.-C. Gagnon, C. J. O'Kelly, C. Otis, and C. Lemieux
The Chloroplast Genomes of the Green Algae Pyramimonas, Monomastix, and Pycnococcus Shed New light on the Evolutionary History of Prasinophytes and the Origin of the Secondary Chloroplasts of Euglenids
Mol. Biol. Evol., March 1, 2009; 26(3): 631 - 648.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
R. Frommolt, S. Werner, H. Paulsen, R. Goss, C. Wilhelm, S. Zauner, U. G. Maier, A. R. Grossman, D. Bhattacharya, and M. Lohr
Ancient Recruitment by Chromists of Green Algal Genes Encoding Enzymes for Carotenoid Biosynthesis
Mol. Biol. Evol., December 1, 2008; 25(12): 2653 - 2667.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
H. Khan and J. M. Archibald
Lateral transfer of introns in the cryptophyte plastid genome
Nucleic Acids Res., May 1, 2008; 36(9): 3043 - 3053.
[Abstract] [Full Text] [PDF]


Home page
Eukaryot CellHome page
A. Fong and J. M. Archibald
Evolutionary Dynamics of Light-Independent Protochlorophyllide Oxidoreductase Genes in the Secondary Plastids of Cryptophyte Algae
Eukaryot. Cell, March 1, 2008; 7(3): 550 - 553.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Supplementary Material
Right arrow All Versions of this Article:
24/8/1832    most recent
msm101v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Khan, H.
Right arrow Articles by Archibald, J. M.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Khan, H.
Right arrow Articles by Archibald, J. M.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?