MBE Advance Access originally published online on April 27, 2005
Molecular Biology and Evolution 2005 22(7):1539-1542; doi:10.1093/molbev/msi151
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Letter |
Reciprocal Fusions of Two Genes in the Formaldehyde Detoxification Pathway in Ciliates and Diatoms
Department of Ecology and Evolutionary Biology, Princeton University
E-mail: nick{at}genome.stanford.edu.
| Abstract |
|---|
|
|
|---|
During the course of a pilot genome project for the ciliate Oxytricha trifallax, we discovered a fusion gene never before described in any taxa. This gene, FSF1, encodes a putative fusion protein comprising an entire formaldehyde dehydrogenase (FALDH) homolog at one end and an S-formylglutathione hydrolase (SFGH) homolog at the other, two proteins that catalyze serial steps in the formaldehyde detoxification pathway. We confirmed the presence of the Oxytricha fusion gene in vivo and detected transcripts of the full-length fusion gene. A survey of other large-scale sequencing projects revealed a similar fusion protein in a distantly related ciliate, Tetrahymena thermophila, and a possible fusion of these two genes in the diatoms Phaeodactylum tricornutum and Thalassiosira pseudonana, but in the reverse order, with the SFGH domain encoded upstream of the FALDH domain. Orthologs of these fusion proteins may be widespread within the ciliates and diatoms.
Key Words: fusion gene formaldehyde dehydrogenase S-formylglutathione hydrolase alcohol dehydrogenase III esterase D bikont phylogeny
| Introduction |
|---|
|
|
|---|
Development of the ciliate macronucleus involves widespread chromosome breakage followed by telomere addition (reviewed in Prescott 1994
|
|
FALDH and SFGH, the two putative proteins that these gene fusions merge, are both members of evolutionarily ancient protein families, with homologs of each protein present in a wide variety of prokaryotes and eukaryotes. While enzymes in both families are known to act on a broad range of substrates, much research has been devoted to their mutual involvement in the detoxification of intracellular formaldehyde (Harms et al. 1996
While no naturally occurring fusion of the FALDH and SFGH genes has been described prior to this report, a number of other protein-coding gene fusions have been observed in both prokaryotes (Suhre and Claverie 2004
) and eukaryotes. Fusions of genes involved in pyrimidine production and modification have recently been used to aid in studies of eukaryote evolution. The first three enzymes of the six-step pyrimidine biosynthetic pathway are fused at the genetic level in unikonts (animals, fungi, and amoebozoans) and exist as a fusion protein in these species (Nara, Hshimoto, and Aoki 2000
). The fifth and sixth genes in the same pathway, which code for orotate phosphoribosyltransferase (OPRT) and orotidine-5'-monophosphate decarboxylase (OMPDC), have fused into a separate multidomain protein (OPRT-OMPDC) in many eukaryotes (Nara, Hshimoto, and Aoki 2000
). The fusion of these two genes appears to have occurred independently in trypanosomatids, where OPRT comprises the C-terminal half of the protein (OMPDC-OPRT). These combinatorial fusions are akin to the reciprocal arrangements we report here for the FALDH and SFGH genes of ciliates and diatoms. While in both cases these arrangements most likely indicate independent fusion of coexpressed, functionally related proteins, it is possible that the constituent domains may have swapped positions following their initial fusions. Further analysis at the base of the ciliate and diatom trees may help determine if the diatom or ciliate genes fused independently or rearranged in one or the other lineage.
In later steps of pyrimidine synthesis, thymidylate synthase (TS) catalyzes the methylation of deoxyuridine monophosphate to form deoxythymidine monophosphate and dihydrofolate reductase (DHFR) catalyzes the reduction of 7,8-dihydrofolate, a by-product of the methylation reaction (Myllykallio et al. 2003
). The TS and DHFR genes are transcribed separately in unikonts and prokaryotes but have fused to encode a DHFR-TS protein in bikonts (plants and many protist species, including ciliates and diatoms). This gene fusion has provided evidence that bikonts form a single clade, which diverged early in eukaryote evolution (Stechmann and Cavalier-Smith 2002
; Stechmann and Cavalier-Smith 2003
).
With the help of the above fusion genes, the root of the eukaryotic tree has recently been suggested to be between unikonts and bikonts (reviewed in Baldauf 2003
). However, determining the evolutionary relationships among the many clades within these two divisions still remains a major challenge for evolutionary biologists. A thorough investigation of FALDH-SFGH and SFGH-FALDH gene fusions in a variety of protists may help define the origins of two major bikont clades.
| Methods |
|---|
|
|
|---|
We amplified and sequenced the FSF1 gene from O. trifallax (S. histriomuscorum) DNA (Chang et al. 2004
Searches of the Thalassiosira pseudonana genome were performed using the Joint Genome Institute T. pseudonana Blast server at http://aluminum.jgi-psf.org/prod/bin/runBlast.pl?db=thaps1. The T. pseudonana SFGH-FALDH fusion (SFF1) gene is located between base pairs 60683 and 63183 of scaffold 8 (Release Version 1) (Armbrust et al. 2004
). Online database searches using the T. pseudonana SFF1 gene performed using the National Center for Biotechnology Information Blast server (Altschul et al. 1997
) identified two overlapping EST clones from Phaeodactylum tricornutum. These sequences are listed in GenBank under accession numbers CD378851 and CD382924 (Scala et al. 2002
).
| Supplementary Material |
|---|
|
|
|---|
The sequences of the genes and peptides described in this paper are available at Molecular Biology and Evolution online (www.mbe.oupjournals.org).
| Acknowledgements |
|---|
|
|
|---|
We thank Wei-Jen Chang for his gift of the O. trifallax RNA and Aaron Turkewitz for assistance with the T. thermophila EST clone. Preliminary genomic sequence data for T. thermophila were obtained from The Institute for Genomic Research Web site at http://www.tigr.org. Thalassiosira pseudonana genome sequence data were produced by the U.S. Department of Energy Joint Genome Institute, http://www.jgi.doe.gov. This work was supported by National Institute of General Medical Sciences Grant GM59708 and National Science Foundation Grant EIA0121422 to L.F.L.
| Footnotes |
|---|
1 Present address: Department of Genetics, Stanford University School of Medicine.
Geoffrey McFadden, Associate Editor
| References |
|---|
|
|
|---|
Altschul, S. F., T. L. Madden, A. A. Schaffer, J. Zhang, Z. Zhang, W. Miller, and D. J. Lipman. 1997. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25:33893402.
Armbrust, E. V., J. A. Berges, C. Bowler et al. (42 co-authors). 2004. The genome of the diatom Thalassiosira pseudonana: ecology, evolution, and metabolism. Science 306:7986.
Baldauf, S. L. 2003. The deep roots of eukaryotes. Science 300:17031706.
Blattner, F. R., G. Plunkett III, C. A. Bloch et al. (14 co-authors). 1997. The complete genome sequence of Escherichia coli K-12. Science 277:14531474.
Cavalcanti, A. R., N. A. Stover, L. Orecchia, T. G. Doak, and L. F. Landweber. 2004. Coding properties of Oxytricha trifallax (Sterkiella histriomuscorum) macronuclear chromosomes: analysis of a pilot genome project. Chromosoma 113:6976.[ISI][Medline]
Chang, W. J., N. A. Stover, V. M. Addis, and L. F. Landweber. 2004. A micronuclear locus containing three protein-coding genes remains linked during macronuclear development in the spirotrichous ciliate Holosticha. Protist 155:245255.[Medline]
Doak, T. G., A. R. Cavalcanti, N. A. Stover, D. M. Dunn, R. Weiss, G. Herrick, and L. F. Landweber. 2003. Sequencing the Oxytricha trifallax macronuclear genome: a pilot project. Trends Genet. 19:603607.[CrossRef][ISI][Medline]
Harms, N., J. Ras, W. N. Reijnders, R. J. van Spanning, and A. H. Stouthamer. 1996. S-Formylglutathione hydrolase of Paracoccus denitrificans is homologous to human esterase D: a universal pathway for formaldehyde detoxification? J. Bacteriol. 178:62966299.
Holmquist, B., and B. L. Vallee. 1991. Human liver class III alcohol and glutathione dependent formaldehyde dehydrogenase are the same enzyme. Biochem. Biophys. Res. Commun. 178:13711377.[CrossRef][Medline]
Jahn, C. L., and L. A. Klobutcher. 2002. Genome remodeling in ciliated protozoa. Annu. Rev. Microbiol. 56:48.
Marchler-Bauer, A., J. B. Anderson, C. DeWeese-Scott et al. (24 co-authors). 2003. CDD: a curated Entrez database of conserved domain alignments. Nucleic Acids Res. 31:383387.
Myllykallio, H., D. Leduc, J. Filee, and U. Liebl. 2003. Life without dihydrofolate reductase FolA. Trends Microbiol. 11:220223.[ISI][Medline]
Nara, T., T. Hshimoto, and T. Aoki. 2000. Evolutionary implications of the mosaic pyrimidine-biosynthetic pathway in eukaryotes. Gene 257:209222.[CrossRef][ISI][Medline]
Prescott, D. M. 1994. The DNA of ciliated protozoa. Microbiol. Rev. 58:233267.
Prescott, D. M., J. D. Prescott, and R. M. Prescott. 2002. Coding properties of macronuclear DNA molecules in Sterkiella nova (Oxytricha nova). Protist 153:7177.[Medline]
Riley, J. L., and L. A. Katz. 2001. Widespread distribution of extensive chromosomal fragmentation in ciliates. Mol. Biol. Evol. 18:13721377.
Scala, S., N. Carels, A. Falciatore, M. L. Chiusano, and C. Bowler. 2002. Genome properties of the diatom Phaeodactylum tricornutum. Plant Physiol. 129:9931002.
Shaw, W. H., T. Arioli, and J. Plazinski. 1998. Cloning and sequencing of a S-formylglutathione hydrolase (FGH) gene from the cyanobacterium Anabaena azollae (Accession No AF035558) (PGR98-024). Plant Physiol. 116:868.
Stechmann, A., and T. Cavalier-Smith. 2002. Rooting the eukaryote tree by using a derived gene fusion. Science 297:8991.
. 2003. The root of the eukaryote tree pinpointed. Curr. Biol. 13:R665R666.[CrossRef][ISI][Medline]
Suhre, K., and J. M. Claverie. 2004. FusionDB: a database for in-depth analysis of prokaryotic gene fusion events. Nucleic Acids Res. 32:D273D276.
Tatusova, T. A., and T. L. Madden. 1999. BLAST 2 sequences, a new tool for comparing protein and nucleotide sequences. FEMS Microbiol. Lett. 174:247250.[CrossRef][ISI][Medline]
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
C. F. Gonzalez, M. Proudfoot, G. Brown, Y. Korniyenko, H. Mori, A. V. Savchenko, and A. F. Yakunin Molecular Basis of Formaldehyde Detoxification: CHARACTERIZATION OF TWO S-FORMYLGLUTATHIONE HYDROLASES FROM ESCHERICHIA COLI, FrmB AND YeiG J. Biol. Chem., May 19, 2006; 281(20): 14514 - 14522. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. A. Stover, C. J. Krieger, G. Binkley, Q. Dong, D. G. Fisk, R. Nash, A. Sethuraman, S. Weng, and J. M. Cherry Tetrahymena Genome Database (TGD): a new genomic resource for Tetrahymena thermophila research Nucleic Acids Res., January 1, 2006; 34(suppl_1): D500 - D503. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||



