MBE Advance Access originally published online on March 10, 2004
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Mol. Biol. Evol. 21(6):1081-1084. 2004
DOI: 10.1093/molbev/msh110
© 2004 by the Society for Molecular Biology and Evolution. ISSN: 0737-4038
NUMTs in Sequenced Eukaryotic Genomes
Max-Planck-Institut für Züchtungsforschung, Köln, Germany
E-mail: leister{at}mpiz-koeln.mpg.de.
| Abstract |
|---|
|
|
|---|
Mitochondrial DNA sequences are frequently transferred to the nucleus giving rise to the so-called nuclear mitochondrial DNA (NUMT). Analysis of 13 eukaryotic species with sequenced mitochondrial and nuclear genomes reveals a large interspecific variation of NUMT number and size. Copy number ranges from none or few copies in Anopheles, Caenorhabditis, Plasmodium, Drosophila, and Fugu to more than 500 in human, rice, and Arabidopsis. The average size is between 62 (baker's yeast) and 647 bps (Neurospora), respectively. A correlation between the abundance of NUMTs and the size of the nuclear or the mitochondrial genomes, or of the nuclear gene density, is not evident. Other factors, such as the number and/or stability of mitochondria in the germline, or species-specific mechanisms controlling accumulation/loss of nuclear DNA, might be responsible for the interspecific diversity in NUMT accumulation.
Key Words: duplication gene transfer genome evolution mitochondria NUMT pseudogene
|
|
|---|
In eukaryotes, nuclear DNA exists that is homologous to mitochondrial DNA (mtDNA). These sequences, which originate from the invasion of nuclear DNA by mtDNA, are designated nuclear mtDNA (NUMT) (Lopez et al. 1994). NUMTs exhibit different degrees of homology to their mitochondrial counterparts; are variable in size; evenly distributed within and among chromosomes, and, in cases, are highly rearranged and/or fragmented (Zhang and Hewitt 1996; Ricchetti, Fairhead, and Dujon 1999; Woischnik and Moraes 2002). Available data indicate a predominant descent of NUMTs from nonhomologous recombination of nuclear DNA with mtDNA fragments leaking out of damaged mitochondria (Henze and Martin 2001; Mourier et al. 2001; Woischnik and Moraes 2002). Moreover, it is likely that the accumulation of NUMTs is a continuous evolutionary process (Mourier et al. 2001; Woischnik and Moraes 2002; Bensasson, Feldman, and Petrov 2003). However, the duplication of NUMTs preexisting in the nuclear genome should have also contributed to increase their number (Lopez et al. 1994; Bensasson, Zhang, and Hewitt 2000; Tourmen et al. 2002; Hazkani-Covo, Sorek, and Graur 2003). NUMTs have been so far detected in more than 82 species (Bensasson et al. 2001a): 30 in baker's yeast (Ricchetti, Fairhead, and Dujon 1999) and between 296 and 612 in the human genome (Mourier et al. 2001; Tourmen et al. 2002; Woischnik and Moraes 2002; Bensasson, Feldman, and Petrov 2003). In Caenorhabditis elegans, Plasmodium falciparum, Drosophila melanogaster, and other species they are rare or even absent (Bensasson et al., 2001a). Here, an updated inventory of NUMTs in the sequenced eukaryotic genomes and causes for the specificity of NUMT accumulation are discussed.
Both the mitochondrial and the nuclear genome sequence are known for 13 eukaryotic species. Employing BLASTN searches at a range of different threshold levels allowed us to identify diverged and/or small NUMTs, as well as conserved and/or long ones. The result is that dramatic differences in the content of NUMTs in the different genomes are evident (fig. 1a): at a threshold of 104 they range from less than ten in Fugu, Drosophila, Plasmodium, and Caenorhabditis to more than 500 in human, rice, and Arabidopsis. Between 10 and 100 NUMTs are present in rat, Ciona, Neurospora, and in the yeast species. No NUMTs at all have been detected in Anopheles. For N. crassa only a preliminary estimate is possible because sequence information of its mtDNA is still incomplete; nevertheless, between 11 (threshold <1050) and 22 (104) NUMTs exist in this fungal species (fig. 1a).
|
In human, mouse, and Ciona, most or almost all mtDNA sequences were transferred to the nucleus (table 1), indicating that in principle all mitochondrial sequences are transferable, as argued by Allen (1993). Size distributions of NUMTs in species with more than 10 copies are shown in figure 1b. The longest NUMTs are present in Neurospora, Arabidopsis, and Ciona, whereas the yeast species and rat contain, in average, the smallest ones. When the length of NUMTs is normalized based on the size of the mitochondrial chromosome, Ciona, human and mouse were the species with the longest NUMTs (fig. 1b).
|
Our data extend the previous observations of Bensasson et al. (2001a) concerning the NUMT content across species. The content is highly variable, ranging from 0 in Anopheles to more than 400 kbp in rice (table 1). Relative to the genome size, the two plant species contain the largest fraction of NUMTs (around 0.10% and 0.17% in rice and Arabidopsis, respectively). Organisms of related genera, such as different insect species (Sunnucks and Hales 1996; Bensasson et al. 2001a) or mammals as human, mouse, and rat (this study), differ substantially in their NUMT content (table 1 and fig. 1a). It is known that the fraction of noncoding nuclear DNA varies among eukaryotes, even among related species (Hartl 2000; Petrov 2001). If the frequency of NUMTs in noncoding regions of the genome were similar among species, their number should increase in species with more noncoding nuclear DNA, based on the assumption that transfers of mtDNA fragments into expressed regions of the genome is counterselected. This can explain the abundance of NUMTs in Homo sapiens but not their rarity or absence in species such as Caenorhabditis and Anopheles (table 1). Furthermore, the size of the mitochondrial chromosome does not correlate with NUMT's frequency or size distribution (fig. 1b and table 1).
Then what is the reason behind the variable abundance of NUMTs in different species? Two explanations can be suggested. (1) The frequency of DNA transfer from mitochondria to the nucleus differs between species. The mtDNA escape into the cytoplasm, and ultimately its transfer to the nucleus, can be influenced by the vulnerability of mitochondria to stress and other factors (Bensasson et al. 2001a; Woischnik and Moraes 2002), as well as by the number of mitochondria present in each cellparticularly of the germline. Accordingly, species-specific differences in the formation of the germline and/or the number of mitochondria per cell may account for some of the interspecific differences in NUMT abundance observed. The number of mitochondria per cell could, for instance, explain the low number of NUMTs in Plasmodiuman organism having only one mitochondrion per cell (Divo et al. 1985; Hopkins et al. 1999). Furthermore, the number of somatic cell divisions from zygote to meiosis (and the loss of the nuclear envelope during each division) should influence the frequency of mitochondrion-to-nucleus DNA transfer (Walbot and Evans 2003). This might be the reason for the high NUMT content in the plants rice and Arabidopsis. Also, the efficiency of nuclear import of mtDNA and/or of its integration into the nuclear genome might differ between species. (2) The rate of loss of NUMTs is different among species. The rate and spectrum of DNA loss from the nucleus might shape the accumulation and size pattern of NUMTs. A specific spectrum of DNA loss could favor the deletion of NUMTs while still allowing the accumulation of massive amounts of noncoding DNA elements with different size. This type of DNA loss could lead to genomes with a large fraction of noncoding DNA but only with few NUMTs. Vice versa, a different control on DNA loss would allow more compact genomes to accumulate many NUMTs (such as in Arabidopsis). It is well known that the rate of DNA loss varies substantially for different fragment sizes and among species (Petrov et al. 2000; Bensasson et al. 2001b; Devos, Brown, and Bennetzen 2002), and this could explain the absence of a strict correlation between the abundances of noncoding nuclear DNA and NUMTs.
In conclusion, the causes for the interspecific diversity of NUMTs with respect to both copy number and length distribution remain obscure. The analysis of additional eukaryotic genomes to be completely sequenced in the future, in combination with the experimental analysis of the rates of mtDNA migration to the nucleusparticularly in related species that differ dramatically in their NUMT contentsshould shed light onto the question how and to which extent eukaryotes deal with NUMTs and other pseudogenes in their genomes.
| Materials and Methods |
|---|
|
|
|---|
Sequence Analyses
Full-length mtDNA sequences were retrieved from the National Center for Biotechnology Information (NCBI) (http://www.ncbi.nlm.nih.gov/genomes/static/euk_o.html). Nuclear DNA sequences were obtained from the NCBI (http://www.ncbi.nlm.nih.gov/genomes/static/EG_T.html; Caenorhabditis elegans, Drosophila melanogaster, Anopheles gambiae, Homo sapiens, Mus musculus, Plasmodium falciparum, Rattus norvegicus, Schizosaccharomyces pombe), Munich Information Center for Protein Sequences (http://mips.gsf.de/proj/thal/db/index.html; Arabiopsis thaliana), Joint Genome Institute (http://www.jgi.doe.gov/genomes/index.html; Ciona intestinalis, Fugu rubripes), Saccharomyces Genome Data-base (http://www.yeastgenome.org/; Saccharomyces cerevisiae), Center for Genomics Research (http://www.broad.mit.edu/annotation/fungi/neurospora/; N. crassa), and The Institute for Genomic Research (ftp://ftp.tigr.org/pub/data/Eukaryotic_Projects/o_sativa/annotation_dbs/; Oryza sativa).
NCBI-BLASTN (Altschul et al. 1990) was carried out locally with standard settings and thresholds ranging from 104 to <1050. Whole mitochondrial genomes were BLASTed either against draft nuclear genome sequences (human, mouse, rice, and rat) or, in all other cases, against complete genomes.
Numbers for total genome sizes and the amount of intergenic sequences were extracted from the Web pages listed above. Values of nonprotein-coding DNA listed in table 1 were extracted from Taft and Mattick (2003).
Web Site
When additional genome sequences become available, updated versions of figure 1 and table 1 will be made available at: http://www.mpiz-koeln.mpg.de/
leister/mbe_2004.html.
| Acknowledgements |
|---|
|
|
|---|
D.L. is supported by a Heisenberg stipend of the Deutsche Forschungsgemeinschaft (LE 1265/8). We thank Francesco Salamini for valuable comments on the manuscript.
| Footnotes |
|---|
William Martin, Associate Editor
| Literature Cited |
|---|
|
|
|---|
Allen, J. F. 1993. Control of gene expression by redox potential and the requirement for chloroplast and mitochondrial genomes. J. Theor. Biol. 165:609-631.[CrossRef][ISI][Medline]
Altschul, S. F., W. Gish, W. Miller, E. W. Myers, and D. J. Lipman. 1990. Basic local alignment search tool. J. Mol. Biol. 215:403-410.[CrossRef][ISI][Medline]
Bensasson, D., M. W. Feldman, and D. A. Petrov. 2003. Rates of DNA duplication and mitochondrial DNA insertion in the human genome. J. Mol. Evol. 57:343-354.[CrossRef][ISI][Medline]
Bensasson, D., D. A. Petrov, D. X. Zhang, D. L. Hartl, and G. M. Hewitt. 2001b. Genomic gigantism: DNA loss is slow in mountain grasshoppers. Mol. Biol. Evol. 18:246-253.
Bensasson, D., D. Zhang, D. L. Hartl, and G. M. Hewitt. 2001a. Mitochondrial pseudogenes: evolution's misplaced witnesses. Trends Ecol. Evol. 16:314-321.[CrossRef][Medline]
Bensasson, D., D. X. Zhang, and G. M. Hewitt. 2000. Frequent assimilation of mitochondrial DNA by grasshopper nuclear genomes. Mol. Biol. Evol. 17:406-415.
Devos, K. M., J. K. Brown, and J. L. Bennetzen. 2002. Genome size reduction through illegitimate recombination counteracts genome expansion in Arabidopsis. Genome Res. 12:1075-1079.
Divo, A. A., T. G. Geary, J. B. Jensen, and H. Ginsburg. 1985. The mitochondrion of Plasmodium falciparum visualized by rhodamine 123 fluorescence. J. Protozool. 32:442-446.[Medline]
Hartl, D. L. 2000. Molecular melodies in high and low C. Nat. Rev. Genet. 1:145-149.[ISI][Medline]
Hazkani-Covo, E., R. Sorek, and D. Graur. 2003. Evolutionary dynamics of large numts in the human genome: rarity of independent insertions and abundance of post-insertion duplications. J. Mol. Evol. 56:169-174.[CrossRef][ISI][Medline]
Henze, K., and W. Martin. 2001. How do mitochondrial genes get into the nucleus? Trends Genet. 17:383-387.[CrossRef][ISI][Medline]
Hopkins, J., R. Fowler, S. Krishna, I. Wilson, G. Mitchell, and L. Bannister. 1999. The plastid in Plasmodium falciparum asexual blood stages: a three-dimensional ultrastructural analysis. Protist 150:283-295.[Medline]
Lopez, J. V., N. Yuhki, R. Masuda, W. Modi, and S. J. O'Brien. 1994. Numt, a recent transfer and tandem amplification of mitochondrial DNA to the nuclear genome of the domestic cat. J. Mol. Evol. 39:174-190.[ISI][Medline]
Mourier, T., A. J. Hansen, E. Willerslev, and P. Arctander. 2001. The Human Genome Project reveals a continuous transfer of large mitochondrial fragments to the nucleus. Mol. Biol. Evol. 18:1833-1837.
Petrov, D. A. 2001. Evolution of genome size: new approaches to an old problem. Trends Genet. 17:23-28.[CrossRef][ISI][Medline]
Petrov, D. A., T. A. Sangster, J. S. Johnston, D. L. Hartl, and K. L. Shaw. 2000. Evidence for DNA loss as a determinant of genome size. Science 287:1060-1062.
Ricchetti, M., C. Fairhead, and B. Dujon. 1999. Mitochondrial DNA repairs double-strand breaks in yeast chromosomes. Nature 402:96-100.[CrossRef][Medline]
Sunnucks, P., and D. F. Hales. 1996. Numerous transposed sequences of mitochondrial cytochrome oxidase I-II in aphids of the genus Sitobion (Hemiptera: Aphididae). Mol. Biol. Evol. 13:510-524.[Abstract]
Taft, R. J., and J. S. Mattick. 2003. Increasing biological complexity is positively correlated with the relative genome-wide expansion of non-protein-coding DNA sequences. Genome Biology http://genomebiology.com/2003/5/I/PI.
Tourmen, Y., O. Baris, P. Dessen, C. Jacques, Y. Malthiery, and P. Reynier. 2002. Structure and chromosomal distribution of human mitochondrial pseudogenes. Genomics 80:71-77.[CrossRef][ISI][Medline]
Walbot, V., and M. M. Evans. 2003. Unique features of the plant life cycle and their consequences. Nat. Rev. Genet. 4:369-379.[CrossRef][ISI][Medline]
Woischnik, M., and C. T. Moraes. 2002. Pattern of organization of human mitochondrial pseudogenes in the nuclear genome. Genome Res. 12:885-893.
Zhang, D. X., and G. M. Hewitt. 1996. Nuclear integrations: challenges for mitochondrial DNA markers. Trends Ecol. Evol. 11:247-251.[CrossRef]
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
A. N. Lough, L. M. Roark, A. Kato, T. S. Ream, J. C. Lamb, J. A. Birchler, and K. J. Newton Mitochondrial DNA Transfer to the Nucleus Generates Extensive Insertion Site Variation in Maize Genetics, January 1, 2008; 178(1): 47 - 55. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. U. Pontius, J. C. Mullikin, D. R. Smith, Agencourt Sequencing Team, K. Lindblad-Toh, S. Gnerre, M. Clamp, J. Chang, R. Stephens, B. Neelam, et al. Initial sequence and comparative analysis of the cat genome Genome Res., November 1, 2007; 17(11): 1675 - 1689. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Wang, Y.-W. Wu, A. C.-C. Shih, C.-S. Wu, Y.-N. Wang, and S.-M. Chaw Transfer of Chloroplast Genomic DNA to Mitochondrial Genome Occurred At Least 300 MYA Mol. Biol. Evol., September 1, 2007; 24(9): 2040 - 2048. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Antunes, J. Pontius, M. J. Ramos, S. J. O'Brien, and W. E. Johnson Mitochondrial Introgressions into the Nuclear Genome of the Domestic Cat J. Hered., July 28, 2007; (2007) esm062v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. K. Behura Analysis of Nuclear Copies of Mitochondrial Sequences in Honeybee (Apis mellifera) Genome Mol. Biol. Evol., July 1, 2007; 24(7): 1492 - 1505. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Pamilo, L. Viljakainen, and A. Vihavainen Exceptionally High Density of NUMTs in the Honeybee Genome Mol. Biol. Evol., June 1, 2007; 24(6): 1340 - 1346. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Rodriguez, J. Albornoz, and A. Dominguez Cytochrome b Pseudogene Originated from a Highly Divergent Mitochondrial Lineage in Genus Rupicapra J. Hered., May 1, 2007; 98(3): 243 - 249. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Hazkani-Covo and D. Graur A Comparative Analysis of numt Evolution in Human and Chimpanzee Mol. Biol. Evol., January 1, 2007; 24(1): 13 - 18. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. E. Schlick, M. I. Jensen-Seaman, K. Orlebeke, A. E. Kwitek, H. J. Jacob, and J. Lazar Sequence analysis of the complete mitochondrial DNA in 10 commonly used inbred rat strains Am J Physiol Cell Physiol, December 1, 2006; 291(6): C1183 - C1192. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Kaplan and M. Linial ProtoBee: Hierarchical classification and annotation of the honey bee proteome Genome Res., November 1, 2006; 16(11): 1431 - 1438. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. E. Welch, M. Z. Darnell, and D. E. McCauley Variable Populations Within Variable Populations: Quantifying Mitochondrial Heteroplasmy in Natural Populations of the Gynodioecious Plant Silene vulgaris Genetics, October 1, 2006; 174(2): 829 - 837. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Simons, M. Pheasant, I. V. Makunin, and J. S. Mattick Transposon-free regions in mammalian genomes Genome Res., February 1, 2006; 16(2): 164 - 172. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Y. Huang, N. Grunheit, N. Ahmadinejad, J. N. Timmis, and W. Martin Mutational Decay and Age of Chloroplast and Mitochondrial Genomes Transferred Recently to Angiosperm Nuclear Chromosomes Plant Physiology, July 1, 2005; 138(3): 1723 - 1733. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Noutsos, E. Richly, and D. Leister Generation and evolutionary fate of insertions of organelle DNA in the nuclear genomes of flowering plants Genome Res., May 1, 2005; 15(5): 616 - 628. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Pons and A. P. Vogler Complex Pattern of Coalescence and Fast Evolution of a Mitochondrial rRNA Pseudogene in a Recent Radiation of Tiger Beetles Mol. Biol. Evol., April 1, 2005; 22(4): 991 - 1000. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. W. Clifton, P. Minx, C. M.-R. Fauron, M. Gibson, J. O. Allen, H. Sun, M. Thompson, W. B. Barbazuk, S. Kanuganti, C. Tayloe, et al. Sequence and Comparative Analysis of the Maize NB Mitochondrial Genome Plant Physiology, November 1, 2004; 136(3): 3486 - 3503. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Richly and D. Leister NUPTs in Sequenced Eukaryotes and Their Genomic Organization in Relation to NUMTs Mol. Biol. Evol., October 1, 2004; 21(10): 1972 - 1980. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||






