MBE Advance Access originally published online on October 20, 2006
Molecular Biology and Evolution 2007 24(1):13-18; doi:10.1093/molbev/msl149
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Letters |
A Comparative Analysis of numt Evolution in Human and Chimpanzee

* Department of Zoology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Ramat Aviv 69978, Israel
Department of Biology and Biochemistry, University of Houston
E-mail: dgraur{at}uh.edu.
| Abstract |
|---|
|
|
|---|
Mitochondrial DNA sequences are frequently transferred into the nuclear genome, giving rise to numts (nuclear DNA sequences of mitochondrial origin). So far, the evolutionary history of numts has largely been studied by using single genomes. Here, we present the first attempt to study numt evolution in a comparative manner by using a pairwise genomic alignment. The total number of numts was estimated to be 452 in human and 469 in chimpanzee. numts that were found in both genomes at identical loci were deemed to be orthologous; 391 numts (>80%) were classified as such. The preponderance of orthologous numts is due to the very short divergence time between the 2 hominoids. The rest of numts were deemed to be nonorthologous. Nonorthologous numts were subdivided into 1) ancestral numts that have lost an ortholog in one species through deletion (12 in human and 11 in chimpanzee), 2) new numts acquired by the insertion of a mitochondrial sequence after the divergence of the 2 species (34 in human and 46 in chimpanzee), and 3) paralogous numts created by the tandem duplication of a preexisting numt (2 in human). This approach also enabled us to reconstruct the numt repertoire in the common ancestor of humans and chimpanzees (409 numts). Our comparative approach is also useful in identifying the exact boundaries of numts.
Key Words: numts comparative evolution promiscuous DNA human genome chimpanzee genome mitochondrial DNA genome evolution pseudogenes
Mitochondrial DNA sequences are frequently transferred into the nuclear genome, giving rise to numts (nuclear DNA sequences of mitochondrial origin, Lopez et al. 1994
). numts have been described in more than 80 species (Bensasson et al. 2001
). For most species, the estimate of numt content and abundance is still incomplete. However, with fully sequenced genomes, it is possible to obtain an accurate estimate of numt abundance (Richly and Leister 2004
). There is no correlation between the fraction of noncoding DNA and numt abundance (Richly and Leister 2004
). The reason for the variation in numt abundance among genomes is not known. Conceptually, the differences might be due to 1) different rates of numt insertion, 2) different rates of numt deletion, and 3) different rates of numt postinsertional duplication.
All mammalian numt studied to date were found to be functionless, and it is thought that they became pseudogenized on arrival into the nucleus because of the differences between the nuclear and mitochondrial genetic codes (Gellissen and Michaelis 1987
; Perna and Kocher 1996
). In yeast, numts are transferred under natural conditions during the repair of double-strand breaks (Ricchetti et al. 1999
), and it was suggested that this is the cause for the ongoing colonization of different genomes by numts. The continuing process of numt integration into the nuclear genome is evidenced by the finding of numts that have been inserted into the human genome after the human–chimpanzee divergence (Ricchetti et al. 2004
). Some of these numts are variable with respect to genomic presence or absence, indicating that they have only arisen recently in the human population. Transposition of numts into genes has also been associated with human diseases (Willett-Brozick et al. 2001
; Turner et al. 2003
; Goldin et al. 2004
).
From human genome data, different estimates of the number of numts have been put forward in the literature (Mourier et al. 2001
; Tourmen et al. 2002
; Woischnik and Moraes 2002
; Bensasson et al. 2003
; Richly and Leister 2004
). Additionally, phylogenetic methods have been suggested for dating the insertion of numts into the nuclear genome (Mourier et al. 2001
; Woischnik and Moraes 2002
). Initial results indicated a fairly rapid process of numt insertion, however, some studies ignored the possibility of postinsertional nuclear duplication (e.g., Bensasson et al. 2000
) resulting in overestimation of numt insertion rates. Hazkani-Covo et al. (2003)
suggested a methodology for dating the insertion of numts into the nuclear genome by using a single nuclear genome sequence and a mitochondrial phylogenetic tree. This methodology had the advantage of being able to detect numt duplication events. We discovered that the rate of numt insertion on the branch leading to humans was much lower than previously reported (Mourier et al. 2001
; Woischnik and Moraes 2002
). Most numts turned out to be paralogs of preexisting numts, rather than new insertions.
Two numts are defined as orthologous if they are derived from a speciation event, but as paralogous if they are derived from a duplication event. So far, the evolutionary history of numts has largely been studied by means of paralogous comparisons within single genomes (Mourier et al. 2001
; Woischnik and Moraes 2002
; Hazkani-Covo et al. 2003
). The availability of closely related completely sequenced genomes has enabled us to use comparative methods to study directly orthologous numt evolution. We note that by using the methodology of Hazkani-Covo et al. (2003)
, the existence of orthologous numt in species other than humans was inferred indirectly. That inference, however, yielded a testable prediction. Thus, for example, a numt that was inferred to have been inserted in the common ancestor of human and chimpanzee should possess orthologs in both species. However, this prediction could be wrong if the mitochondrial phylogenetic tree is not the true tree. In addition, this methodology is only applicable to long numts that have sufficient phylogenetic signal. With 2 or more genomes, the presence of orthologous numts can be inferred directly, even when the numts are short.
In the following, we suggest a protocol based on genome alignment to estimate the number of numts in closely related species. We apply this approach to the genomes of human (Lander et al. 2001
) and chimpanzee (Pan troglodytes; Mikkelsen et al. 2005
), and use the alignment to identify evolutionary events that may have affected numt composition in each genome, as well as to reconstruct the numt makeup in the common ancestor of human and chimpanzee.
Because there are no hot spots for numt insertion (Zischler 2000
), the presence of a numt at a particular locus in both genomes was taken to imply orthology (fig. 1). Nonorthologous numts that are present in only one genome are further classified into insertions, partial or total deletions, or tandem duplications (fig. 1). Each such event can take place in either lineage. Nonorthologous numts are identified by a gap in the alignment. The distinction between insertions and deletions is based on the fact that there exists no known mechanism for the precise excision of numts. Thus, if the gap coincides precisely with the boundaries of the numt, an insertion is inferred. If the gap is smaller or larger than the numt in the other genome, we infer the occurrence of a partial or total deletion, respectively. Tandem numt duplications are characterized by adjacent homologous numts and a gap coinciding perfectly with the boundaries of the homolog from the other species. The assumptions used for numt classifications here were also used in PCR-based numt recognition (e.g., Lopez et al. 1994
; Zischler et al. 1998
; Herrnstadt et al. 1999
).
|
Our analyses were based on genomic sequences and annotations from the University of California at Santa Cruz (Karolchik et al. 2004
|
We found a similar number of numts in both genomes: 452 numts in human and 469 numts in chimpanzee (table 1). The total number of numts in the 2 genomes was found to be similar to previous estimates in the literature. Unsurprisingly, because of the short time that has passed since the divergence of the 2 hominoids, 391 numts (87% in human and 84% in chimpanzee) were classified as orthologous, that is, were inserted into the nuclear genome before the divergence between the 2 lineages (table S1 in Supplementary Material online).
|
We identified 46 previously undescribed postspeciation numts in the chimpanzee. These ranged in size between 37 and 3,076 bp. In addition, we identified 34 numts in human. Our study, thus, increases the number of known human-specific numts (Ricchetti et al. 2004
|
From among the postspeciation numts, only 2 cases of tandem duplication were found (both in the human genome). In the first case, an internal segment of 30 bp within a numt located on chromosome 10 was duplicated once. The second case, in chromosome 12, includes 18 tandem duplications of a 47-bp sequence (fig. 3).
|
The number of events in which numts were deleted from the genome is fairly similar between the 2 species. There are 12 deletion events in human, of which 11 are total deletions and 1 is a partial one. In chimpanzee, there are 11 deletion events, of which 7 are total deletions and 4 are partial. As far as the total deletions are concerned, one can distinguish between 2 separate groups: most of the numts seem to have been deleted from the genome as part of a much larger segment. However, in a few cases, the numt deletion included only a limited flanking region.
The number of nonorthologous numts is not large enough to be able to detect differences in numt evolutionary dynamics (insertion, deletion, or tandem duplication) between the 2 lineages. Still, we are now able to reconstruct the numts constitution in the common ancestor of the 2 hominoids. The number of numts in the common ancestor of human and chimpanzee is estimated at 409. This number includes 391 numts that are still found in the 2 genomes, and a total of 18 numts that were lost from 1 of the 2 genomes. Given the very low rate of numt deletion, the possibility that a numt has been lost in both genomes seems negligible.
We suggest that in comparison to single genome analyses, our methodology resulted not only in a more accurate estimate of the number of numts but also in a more precise identification of their boundaries. First, this protocol distinguishes between orthologous and nonorthologous numts. Second, by using genome alignment, we identified orthologous numts that escaped detection by the usual Blasting of mitochondrial sequences against the nuclear genome. In 145 out of 391 cases, numts were identified in only one of the genomes when the Blast analysis was used. However, in the majority of cases, alignment of those numts to the corresponding fragment in the second genome revealed a cryptic or quasi-cryptic ortholog. In 15 cases, the existence of orthologous numts in chimpanzee was inferred on the basis of a small stretch of Ns similar in size to the human numt in the homologous position. Finally, our protocol enables a more precise identification of the genomic coordinates of numts. The comparative method allows concatenation of fragments that may otherwise be identified as independent numts.
| Supplementary Material |
|---|
|
|
|---|
Supplementary data and tables are available at Molecular Biology and Evolution online (http://www.mbe.oxfordjournals.org/).
| Acknowledgements |
|---|
|
|
|---|
We thank Shay Covo and Tal Dagan for their help. This work was supported in part by a grant (DBI-0543342) from the National Science Foundation.
| Footnotes |
|---|
1 Present address: National Evolutionary Synthesis Center, Durham, North Carolina, USA
William Martin, Associate Editor
| References |
|---|
|
|
|---|
Bensasson D, Feldman MW, Petrov DA. (2003) Rates of DNA duplication and mitochondrial DNA insertion in the human genome. J Mol Evol 57:343–354.[CrossRef][ISI][Medline]
Bensasson D, Zhang D, Hartl DL, Hewitt GM. (2001) Mitochondrial pseudogenes: evolution's misplaced witnesses. Trends Ecol Evol 16:314–321.[CrossRef][Medline]
Bensasson D, Zhang DX, Hewitt GM. (2000) Frequent assimilation of mitochondrial DNA by grasshopper nuclear genomes. Mol Biol Evol 17:406–415.
Gellissen G and Michaelis G. (1987) Gene transfer. Mitochondria to nucleus. Ann N Y Acad Sci 503:391–401.[ISI][Medline]
Goldin E, Stahl S, Cooney AM, Kaneski CR, Gupta S, Brady RO, Ellis JR, Schiffmann R. (2004) Transfer of a mitochondrial DNA fragment to MCOLN1 causes an inherited case of mucolipidosis IV. Hum Mutat 24:460–465.[CrossRef][ISI][Medline]
Goodman M, Porter CA, Czelusniak J, Page SL, Schneider H, Shoshani J, Gunnell G, Groves CP. (1998) Toward a phylogenetic classification of primates based on DNA evidence complemented by fossil evidence. Mol Phylogenet Evol 9:585–598.[CrossRef][ISI][Medline]
Hazkani-Covo E, Sorek R, Graur D. (2003) Evolutionary dynamics of large numts in the human genome: rarity of independent insertions and abundance of post-insertion duplications. J Mol Evol 56:169–174.[CrossRef][ISI][Medline]
Herrnstadt C, Clevenger W, Ghosh SS, Anderson C, Fahy E, Miller S, Howell N, Davis RE. (1999) A novel mitochondrial DNA-like sequence in the human nuclear genome. Genomics 60:67–77.[CrossRef][ISI][Medline]
Karolchik D, Hinrichs AS, Furey TS, Roskin KM, Sugnet CW, Haussler D, Kent WJ. (2004) The UCSC Table Browser data retrieval tool. Nucleic Acids Res 32:493–496.[CrossRef]
Lander ES, Linton LM, Birren B, et al. (256 co-authors). (2001) Initial sequencing and analysis of the human genome. Nature 409:860–921.[CrossRef][Medline]
Lopez JV, Yuhki N, Masuda R, Modi W, O'Brien SJ. (1994) Numt, a recent transfer and tandem amplification of mitochondrial DNA to the nuclear genome of the domestic cat. J Mol Evol 39:174–190.[ISI][Medline]
Mikkelsen TS, Hillier LW, Eichler EE, et al. (67 co-authors). (2005) Initial sequence of the chimpanzee genome and comparison with the human genome. Nature 437:69–87.[CrossRef][Medline]
Mourier T, Hansen AJ, Willerslev E, Arctander P. (2001) The human genome project reveals a continuous transfer of large mitochondrial fragments to the nucleus. Mol Biol Evol 18:1833–1837.
Perna NT and Kocher TD. (1996) Mitochondrial DNA: molecular fossils in the nucleus. Curr Biol 6:128–129.[CrossRef][ISI][Medline]
Ricchetti M, Fairhead C, Dujon B. (1999) Mitochondrial DNA repairs double-strand breaks in yeast chromosomes. Nature 402:96–100.[CrossRef][Medline]
Ricchetti M, Tekaia F, Dujon B. (2004) Continued colonization of the human genome by mitochondrial DNA. PLoS Biol 2:E273.[CrossRef][Medline]
Richly E and Leister D. (2004) NUMTs in sequenced eukaryotic genomes. Mol Biol Evol 21:1081–1084.
Tourmen Y, Baris O, Dessen P, Jacques C, Malthiery Y, Reynier P. (2002) Structure and chromosomal distribution of human mitochondrial pseudogenes. Genomics 80:71–77.[CrossRef][ISI][Medline]
Turner C, Killoran C, Thomas NS, Rosenberg M, Chuzhanova NA, Johnston J, Kemel Y, Cooper DN, Biesecker LG. (2003) Human genetic disease caused by de novo mitochondrial-nuclear DNA transfer. Hum Genet 112:303–309.[ISI][Medline]
Willett-Brozick JE, Savul SA, Richey LE, Baysal BE. (2001) Germ line insertion of mtDNA at the breakpoint junction of a reciprocal constitutional translocation. Hum Genet 109:216–223.[CrossRef][ISI][Medline]
Woischnik M and Moraes CT. (2002) Pattern of organization of human mitochondrial pseudogenes in the nuclear genome. Genome Res 12:885–893.
Zischler H. (2000) Nuclear integrations of mitochondrial DNA in primates: inference of associated mutational events. Electrophoresis 21:531–536.[CrossRef][ISI][Medline]
Zischler H, Geisert H, Castresana J. (1998) A hominoid-specific nuclear insertion of the mitochondrial D-loop: implications for reconstructing ancestral mitochondrial sequences. Mol Biol Evol 15:463–469.[Abstract]
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
C. Fan, Y. Chen, and M. Long Recurrent Tandem Gene Duplication Gave Rise to Functionally Divergent Genes in Drosophila Mol. Biol. Evol., July 1, 2008; 25(7): 1451 - 1458. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. Deusch, G. Landan, M. Roettger, N. Gruenheit, K. V. Kowallik, J. F. Allen, W. Martin, and T. Dagan Genes of Cyanobacterial Origin in Plant Nuclear Genomes Point to a Heterocyst-Forming Plastid Ancestor Mol. Biol. Evol., April 1, 2008; 25(4): 748 - 761. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Antunes, J. Pontius, M. J. Ramos, S. J. O'Brien, and W. E. Johnson Mitochondrial Introgressions into the Nuclear Genome of the Domestic Cat J. Hered., July 28, 2007; (2007) esm062v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. K. Behura Analysis of Nuclear Copies of Mitochondrial Sequences in Honeybee (Apis mellifera) Genome Mol. Biol. Evol., July 1, 2007; 24(7): 1492 - 1505. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||




