Skip Navigation


MBE Advance Access originally published online on June 2, 2006
Molecular Biology and Evolution 2006 23(8):1493-1503; doi:10.1093/molbev/msl027
This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow All Versions of this Article:
23/8/1493    most recent
msl027v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (8)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Kullberg, M.
Right arrow Articles by Janke, A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Kullberg, M.
Right arrow Articles by Janke, A.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2006. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oxfordjournals.org

Research Article

Housekeeping Genes for Phylogenetic Analysis of Eutherian Relationships

Morgan Kullberg*, Maria A. Nilsson*, Ulfur Arnason*, Eric H. Harley{dagger} and Axel Janke*

* Division of Evolutionary Molecular Systematics, Department of Cell and Organism Biology, University of Lund, Lund, Sweden; and {dagger} Division of Chemical Pathology, Department of Clinical Laboratory Sciences, University of Cape Town, South Africa

E-mail: ulfur.arnason{at}cob.lu.se.


    Abstract
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Acknowledgements
 References
 
The molecular relationship of placental mammals has attracted great interest in recent years. However, 2 crucial and conflicting hypotheses remain, one with respect to the position of the root of the eutherian tree and the other the relationship between the orders Rodentia, Lagomorpha (rabbits, hares), and Primates. Although most mitochondrial (mt) analyses have suggested that rodents have a basal position in the eutherian tree, some nuclear data in combination with mt-rRNA genes have placed the root on the so-called African clade or on a branch that includes this clade and the Xenarthra (e.g., anteater and armadillo). In order to generate a new and independent set of molecular data for phylogenetic analysis, we have established cDNA sequences from different tissues of various mammalian species. With this in mind, we have identified and sequenced 8 housekeeping genes with moderately fast rate of evolution from 22 placental mammals, representing 11 orders. In order to determine the root of the eutherian tree, the same genes were also sequenced for 3 marsupial species, which were used as outgroup. Inconsistent with the analyses of nuclear + mt-rRNA gene data, the current data set did not favor a basal position of the African clade or Xenarthra in the eutherian tree. Similarly, by joining rodents and lagomorphs on the same basal branch (Glires hypothesis), the data set is also inconsistent with the tree commonly favored in mtDNA analyses. The analyses of the currently established sequences have helped examination of problematic parts in the eutherian tree at the same time as they caution against suggestions that have claimed that basal eutherian relationships have been conclusively settled.

Key Words: cDNA • mammals • rooting • phylogeny • outgroup


    Introduction
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Acknowledgements
 References
 
The problems in resolving the placental mammal tree were outlined by Novacek (1992)Go in a synthesis of classical and molecular data. At this time, the amount of molecular data suitable for phylogenetic analyses was limited, and many relationships at the ordinal level were still unresolved due to lacking or contradictory information. These molecular studies were commonly based on limited sequence data such as those from single genes or fragments thereof (de Jong et al. 1981Go; Miyamoto and Goodman 1986Go; Irwin et al. 1991Go; Springer et al. 1997Go). During the 1990s, the 10–13 kb of protein-coding regions from mitochondrial (mt) genomes became a popular tool for phylogenetic analysis (mitogenomics). The continuously increasing amount of molecular data during the 1990s provided improved resolution of eutherian relationships and led to the acknowledgment of some unexpected relationships such as the sister group relationship between carnivores (e.g., cat and dog) and perissodactyls (e.g., horse and rhinoceros) (Xu et al. 1996Go). Only a decade after Novacek's (1992)Go synthesis, some 60 complete mt genomes representing all orders of placental mammals had been sequenced and used for phylogenetic reconstruction (Arnason et al. 2002Go). These mitogenomic analyses also provided insight into the times of mammalian inter- and intraordinal divergences (e.g., Janke et al. 1994Go; Arnason et al. 1996Go; Nilsson et al. 2004Go), which otherwise in many instances had to depend on an incomplete fossil record.

Murphy et al. (2001)Go presented the results of phylogenetic analysis based on a combination of nuclear and mt-rRNA sequences. The study encompassed 44 species, and the amount of data representing each species was in the range of 8–12 kb in most instances. The analysis was performed on all 3 codon positions (cdp's) and included gaps and also chimeric and missing data as the result of the limited amount of sequences representing some taxa. Malia et al. (2003)Go and Misawa and Nei (2003)Go discussed the problems associated with analysis of data sets of this kind. The unrooted nuclear + mt-rRNA results were nevertheless largely consistent with earlier mitogenomic trees, and in the few instances where the trees differed, there appeared to be a greater consistency between morphology and the nuclear + mt-rRNA tree than between morphology and the mitogenomic tree. At the present time, the tree presented by Murphy et al. (2001)Go is probably the most popular hypothesis with respect to placental mammal evolution, and in the following we refer to this tree as the "Murphy tree." Similarly, we refer to the indel free and nonchimeric tree of Arnason et al. (2002)Go and its variants as the "mitogenomic tree."

Many questions in mammalian phylogeny involve the position and relationships of rodents. Figure 1 shows a simplified tree of the 3 major hypotheses of eutherian relationships. Mitogenomic analysis in general do not support monophyletic grouping of rodents, and some mitogenomic analyses have strongly advocated rodent paraphyly (D'Erchia et al. 1996Go; Reyes et al. 1998Go). As discussed by Arnason et al. (2002)Go, this question is at least partly related to the position of Erinaceidae (hedgehogs) in the eutherian tree. Rodent monophyly is unequivocally supported by morphological data (Simpson 1945Go; Novacek 1992Go) and also by the analysis by Murphy et al. (2001)Go and Huchon et al. (2002)Go of nuclear + mt-rRNA data. Removal of Erinaceidae from their basal eutherian position (Lin et al. 2002Go) and extended rodent sampling has provided stronger support to monophyletic Rodentia in the mitogenomic eutherian tree (Reyes et al. 2004Go).


Figure 1
View larger version (10K):
[in this window]
[in a new window]
 
FIG. 1.— Simplified relationships among placental mammals showing (a) the morphological hypothesis (Novacek 1992Go), (b) the mitogenomic tree (Arnason et al. 2002Go), and (c) the Murphy tree (Murphy et al. 2001Go). ART: Artiodactyla; PRI: Primates; ROD: Rodentia; LAG: Lagomorpha; PRO: Proboscidea (elephants); XEN: Xenarthra (anteater and armadillo); MAR: Marsupialia.

 
Another difference between the Murphy tree and the mitogenomic tree is the relationship between rodents and lagomorphs (rabbits, hares). Consistent with the Glires hypothesis (Gregory 1910Go; Liu and Miyamoto 1999Go), rodents and lagomorphs join on a common branch in the Murphy tree (fig. 1). Other analyses of nuclear genes have not identified monophyletic Glires (Li et al. 1990Go; Misawa and Janke 2003Go), but reanalysis of the data set of Misawa and Janke (2003)Go has supported Glires monophyly when using rate heterogeneity models (Douzery and Huchon 2004Go). Thus, this issue, which is crucial to the interpretation of basal eutherian divergences, remains controversial.

The Murphy tree joins Glires with Primates, Scandentia (tree shrews), and Dermoptera (flying lemurs) in the superordinal clade "Euarchontaglires" (Murphy et al. 2001Go). This relationship is not favored by mitogenomic analysis, whereas morphological data have suggested a grouping of Primates, Scandentia, Dermoptera, and Chiroptera (bats), in the superorder Archonta (Gregory 1910Go; Novacek and Wyss 1986Go).

As discussed by Arnason et al. (2002)Go, the essential difference between the mitogenomic tree and the Murphy tree is in the location of the root. Analysis of nuclear + mt-rRNA data identify a basal position of the so-called African clade, a grouping of elephants, dugongs, hyraxes, the aardvark, golden moles, elephant shrews, and tenrecs. Some of these lineages were morphologically joined by Le Gros Clark and Sonntag (1926)Go. In contrast, mitogenomic analyses preferably place the root on the Erinaceidae. When the Erinaceidae are disregarded, the rodents constitute a basal group in the mitogenomic tree (Arnason et al. 2002Go), a position that is consistent with some nuclear data (e.g., Li et al. 1990Go; Rosenberg and Kumar 2001Go; Jorgensen et al. 2005Go) but is inconsistent with the Murphy tree. This difference with respect to the phylogenetic position of basal lineages is crucial for the interpretation of eutherian relationships as it inter alia affects the positions of primates and rodents.

The study of Murphy et al. (2001)Go provided an important alternative to the results coming from mitogenomic analyses of eutherian relationships as for the first time a significant amount of nuclear protein-coding gene data was compiled for all eutherian orders. About 30% of the eutherian data did not have a counterpart in the 2 marsupial data sets that were used to root the eutherian tree. Whether this discrepancy may have affected the rooting of the eutherian tree is not known. The nature of some of the nuclear data may have influenced the outcome of the analyses, however. For example, the molecular distances between some of these genes are strikingly high. Thus, the amino acid distances between humans and mice for the von Willebrand factor (vWF), the interphotoreceptor retinoid-binding protein (IRPB), and the breast and ovarian cancer susceptibility protein 1 (BRCA1) sequence is 20%–45%. Distances of this degree are problematic as the background noise of the data may interfere with the resolving power of the sequences (Page and Holmes 1998Go; Nei and Kumar 2000Go). The odd evolutionary pattern of the BRCA1 gene with all cdp's evolving at the same rate (e.g., Adkins et al. 2001Go; Delsuc et al. 2002Go) is an additional complication that challenges the use of this gene in phylogenetic analysis.

It is evident that more comprehensive and better-suited nuclear data sets than those currently available are needed for establishing the tree of placental mammals. However, the intron–exon structure of protein-coding nuclear genes makes it difficult to acquire these sequences from genomic DNA. Here we have circumvented this problem by establishing via cDNA procedures the sequences of housekeeping genes from a number of different mammalian orders. The decision to use housekeeping genes (for definition see Watson et al. 1965Go; Warrington et al. 2000Go; Hsiao et al. 2001Go) was based on their presence in most tissues as this facilitates the recovery of cDNA sequences from tissues at different developmental stages and from cell cultures. In order to meet the need for versatility, we have directed our efforts toward genes that are also present and expressed outside the placental mammals (preferably in all vertebrates) and which have evolutionary rates that allow analysis of both basal vertebrate divergences and ordinal and subordinal divergences within different classes. For this reason, we have primarily selected genes that show amino acid distances of 2%–10% between humans and mice. The use of conserved sequences simplifies identification of these genes in mammals and other vertebrates and their alignment, but most importantly, it reduces problems in phylogenetic reconstruction due to multiple substitutions.

The cDNA is produced by standard methods, in order to make our approach universally applicable. The new data are then used to reconstruct and investigate the major branches and the basal divergences of the placental mammal tree.


    Materials and Methods
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Acknowledgements
 References
 
Search for Housekeeping Genes
Housekeeping genes were identified by searching the human, mouse, rat, cow, and dog genomes in the National Center for Biotechnology Information (NCBI) database and by investigating housekeeping genes compiled by Warrington et al. (2000)Go. Sequences were aligned using Se-Al v2.0a11 (Rambaut 1996Go). After removing indel differences, absolute distances were calculated using PAUP* (Swofford 1998Go) for selecting slow evolving genes. From the alignments of these genes, oligonucleotide primer pairs (17–21 mers) were designed manually at conserved sites.

RNA Isolation and cDNA Synthesis
Whenever possible, tissue samples were collected within minutes postmortem. Most samples were frozen in liquid nitrogen and then stored at –80 °C. Primary fibroblast cell cultures were either established by the authors or purchased from American Type Culture Collection (http://www.atcc.org) or European Collection of Cell Cultures (http://www.ecacc.org.uk). The cells were cultured under standard conditions at 37 °C in 5% CO2 in Dulbecco's modified Eagle's medium supplemented with 10% fetal calf serum, 2 mM L-glutamine, 0.1 mg/ml sodium pyruvate, 100 U/ml penicillin, and 0.1 mg/ml streptomycin (Freshney 2000Go).

The homogenization procedure was found to be crucial for an efficient RNA isolation. When tissue samples were homogenized, best results were obtained using a small motor-driven ultra-turrax (IKA Werke model T8 S5N-5G). RNA purification was performed using the guanidium thiocyanate method of Chomczynski and Sacchi (1987)Go. The approach produced high-quality RNA from all types of tissues with virtually no contaminations of RNAses, DNA, or protein. The preparations were inspected for their quality by judging the degradation of the 18S and 28S rRNA bands on a denaturizing, ethidium bromide–stained agarose gel (Sambrook and Russell 2001Go). Table 1 summarizes the species and respective tissue samples that were used in this study.


View this table:
[in this window]
[in a new window]
 
Table 1 Common and Scientific Species Names and Their Tissue Types Used for RNA Isolation

 
The RNA was reverse transcribed to cDNA using 2.5 µM oligo-dT primer (20-mers) and 22 U BcaBEST RT-polymerase (TaKaRa) in a total volume of 20 µl according to the manufacturer's recommendations. The reaction mix was incubated on a programmable thermocycler at 65 °C for 1 min and at 30 °C for 5 min. The temperature was then gradually raised to 65 °C over a period of 20 min. The sample was then incubated at 65 °C for 20 min, at 98 °C for 5 min, and finally kept at 5 °C until processed further. Specific cDNA sequences (table 2) were polymerase chain reaction (PCR) amplified in 25 µl volumes using 1 µl of the cDNA reaction with conserved primer pairs (table 3) and the Ex-Taq polymerase (TaKaRa) according to the manufacturer's protocol. More details on the RNA isolation from different tissue, the RNA stability, cDNA synthesis, and a comparison of different methods will be published elsewhere.


View this table:
[in this window]
[in a new window]
 
Table 2 Names, Accession Numbers, Amino Acid Distances, and Lengths of Housekeeping Genes Used for Phylogenetic Analysis

 

View this table:
[in this window]
[in a new window]
 
Table 3 Primers Used for Amplifying and Sequencing Housekeeping Genes

 
The PCR product was purified by 2 subsequent ethanol precipitations, and both strands were sequenced using the BIG-DYE v. 3 cycle sequencing kit on an ABI Prism 3100 Genetic Analyzer. If the PCR product was longer than 1000 nt, an internal primer was designed for that gene in order to sequence the remaining region. The accession numbers of the new cDNA genes are: DQ402944DQ402968 (malate dehydrogenase 2, NAD [mt]), DQ402969DQ402993 (succinate dehydrogenase subunit A), DQ402994DQ403018 (succinate dehydrogenase subunit B [SDHB]), DQ403019DQ403043 (ribosomal protein L18), DQ403044DQ403068 (glyceraldehyde-3-P-dehydrogenase), DQ403069DQ403093 (isocitrate dehydrogenase 1), DQ403094DQ403118 (ATP synthase, beta polypeptide), and DQ403119DQ403143 (citrate synthase).

Phylogenetic Reconstruction
Phylogenetic relationships were analyzed using the Tree-Puzzle (Schmidt et al. 2002Go), PHYLIP (Felsenstein 1993Go), MOLPHY (Adachi and Hasegawa 1996Go), MrBayes (Huelsenbeck and Ronquist 2001Go), TREEFINDER (Jobb et al. 2004Go), PHYML (Guindon and Gascuel 2003Go), or PAUP* (Swofford 1998Go) program packages. The best-fitted model for nucleotide sequence evolution and parameters were determined with Modeltest (Posada and Crandall 1998Go). The WAG2000 model of amino acid sequence evolution (Whelan and Goldman 2001Go) and the general time reversible (GTR) and 3-state GTR model of nucleotide evolution (Lanave et al. 1984Go; Gibson et al. 2005Go) were used for distance and likelihood analyses. All analyses were performed in parallel with or without assuming a gamma model of rate heterogeneity (Gu et al. 1995Go) with 4 classes of variable sites and 1 class of invariable site (4{Gamma} + I). Bayesian analysis was carried out under standard conditions with the above-mentioned models, running 3 cold and 1 heated chain and using 1,000,000 generations, a sample and print frequency of 100 and a burnin of 100,000 generations. Maximum likelihood (ML) bootstrap support was evaluated from 1000 bootstrap replicates of the amino acid data set generated by SEQBOOT and with PHYML under the WAG2000 + 4{Gamma} + I model using the tree in figure 2 as the starting tree. The resulting topologies were summarized by CONSENS.


Figure 2
View larger version (10K):
[in this window]
[in a new window]
 
FIG. 2.— ML tree of mammals based on amino acid analysis applying WAG2000 + 4{Gamma} + I.

 
Dating of Divergence Times
Estimates of divergence times were based on the tree shown in figure 2 from amino acid sequences and using the R8S version 1.70 program package (Sanderson 2002Go) and a nonparametric rate smoothing method and optimization by the Powell model as implemented in R8S. For calculating standard deviations (SDs), 100 bootstrap replicates of the sequence data were created using SEQBOOT. The branch lengths for each replicate were estimated using Tree-Puzzle under WAG2000 + 4{Gamma} + I model, and the tree is given in figure 2. For each of those tree replicates, the divergence times were estimated by the R8S program, and standard errors (SEs) were calculated from the different results.

The program MULTIDIVTIME as implemented in the T3 program package (http://abacus.gene.ucl.ac.uk/) and the WAG2000 + 8{Gamma} model was used in addition to the R8S program to estimate divergence times. Eight categories of gamma-distributed rate heterogeneity among sites (8{Gamma}) were used in order to compensate for the lack of the program to account for invariable sites. Other important parameters that were used to build the model file (modelinf) are an absolute maximum age of the placental mammal origin 200 million years before present (MYBP) and 100,000 generations for the Markov chain Monte Carlo chain.

For calibration of the evolutionary rates and estimating the divergence times, we used the split between cetaceans and ruminants {approx}60 MYBP, AC-60 (Arnason et al. 1996Go), and the split between equids and rhinoceroses set at {approx}50 MYBP, ER-50 (Arnason et al. 1998Go). The reference AC-60—the divergence between artiodactyl (A) ruminants and cetaceans (C)—is (1) upheld by the age, {approx}53 Myr, of the oldest archaeocete fossils, (2) the divergence between Hippopotamidae and Cetacea (molecularly dated to {approx}55 MYBP), and (3) by its consistency with the age, 34–35 Myr, of the oldest fossils diagnostic for the divergence between odontocetes and mysticetes (Arnason, Gullberg, Gretarsdottir, et al. 2000Go; Arnason et al. 2004Go and references in these papers). The lower bound of this reference is more difficult to define, but based on paleontological evidence, its lower limits can be placed at {approx}65 MYA (Gatesy and O'Leary 2001Go), a date that we maintain in the current study. On these grounds, the split between cetaceans and ruminants has been set to 60–65 MYA. The age of ER-50 (Arnason et al. 1998Go) is substantiated by the age, {approx}48 Myr, of the oldest rhinocerotid fossil, Hyrachyus. It may be argued, however, that the lower bound of ER-50 should be placed earlier than 50 MYA in line with the early Eocene age of Hyracotherium. The status of Hyracotherium as a phylogenetic wastebasket (Hooker 1989Go; Prothero and Schoch 1989Go) makes the argumentation problematic, however. In the current estimates, we have tentatively placed the lower bound of ER at 58 MYBP, the same as used by Garland et al. (1993)Go. The metatherian/eutherian split has 125 MYA as upper limit, in agreement with the appearance of oldest metatherians (Sinodelphys) and eutherians (Eomaia) in the fossil record (Ji et al. 2002Go; Luo et al. 2003Go). As the lower limit, we have placed this split at 170 MYBP. This age is without paleontological substantiation but is among the oldest molecular-based estimates (Kumar and Hedges 1998Go).


    Results
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Acknowledgements
 References
 
cDNA Sequences from Selected Housekeeping Genes
From the 535 housekeeping genes listed by Warrington et al. (2000)Go, about 50 genes were selected for further investigation. The evolutionary distance of these genes between humans and mice was 3%–10% at the amino acid level. For about 20 of these genes, conserved primers could be designed that spanned at least 500 nt of the protein-coding region and that would PCR amplify the corresponding gene from most placental mammals. In order to optimize the specificity of the primers, the number of ambiguities in the primer sequence was kept to a minimum.

For 8 of the selected genes, cDNA sequences could be produced from every placental species that was included in the study. The respective genes, their names, and properties are listed in table 2. The table also shows the accession number of the human homologue, the amino acid distance between humans and mice, and the length and amino acid distances of the protein-coding regions that were used in the alignment for the phylogenetic analysis. The actual distance values for the sequenced cDNA region were lower than initially selected for because the conserved primer pairs usually covered the central and more conserved regions of the genes. The corresponding reverse transcription (RT)–PCR and sequencing primer pairs and their properties are listed in table 3.

In order to examine the consistency between the currently established sequences and sequences from whole genome projects, a number of cDNA sequences were produced for such an examination. The database sequence and the cDNA sequences were identical or differed at less than 3 nucleotide sites per sequence in all cases. This indicates that the primer pairs and RT–PCR conditions applied have produced homologous sequences in these model organisms and conceivably also in the other mammalian species studied.

Tree Reconstruction
All mammalian sequences aligned readily. Only a single indel difference was identified in the SDHB amino acid sequence of the Xenarthra as compared with the remaining mammals. The position of this indel, which was removed from the alignment, was unambiguous. The chicken genome may not be completely sequenced. Therefore, some chicken sequences were obtained from expressed sequence tags (EST) data (U.D. Chick EST Database). The EST sequences rarely covered the complete protein-coding region, and therefore, 5% of the sequences are missing from the chicken in the alignment. Exclusion of the chicken increased the amount of sites that were indel free, but the phylogenetic results remained unaffected. The alignment is available from treebase (http://www.treebase.org/treebase/index.html), accession number SN2729.

Concatenation of the amino acid sequences resulted in a data set with a total length of 1942 amino acid (aa) sites for mammals. The directly calculated distance between humans and mice was 3.6%. Tree-Puzzle estimated the distance between humans and mice to 3.2% under the WAG2000 model, whereas gamma rate heterogeneity (4{Gamma} + I) yielded an estimate of 3.3%. The current focus on genes that have low amino acid distance values limits the number of multiple substitutions that may have occurred in the sequences. Based on a Poisson model (d = –ln [1 – p]), the expected amino acid distance (d) can be calculated from p, the observed amino acid distance (Nei and Kumar 2000Go). With a distance of 3.6% between humans and mice and a sequence length of 1942 aa, the calculation suggests that only about one extra substitution may have remained undetected as the result of multiple substitutions. If so, the effect of unaccounted substitutions is negligible, and below the SE for p. Correspondingly, low values were observed for nonsynonymous nucleotide sites. This simplified estimate illustrates the low frequency of multiple substitutions in conserved sequences.

The tree in figure 2 shows the ML tree using 1942 aa sites under the WAG2000 + 4{Gamma} + I model. Rodentia and Lagomorpha constitute sistergroups forming the cohort Glires that is placed in a basal position among placental mammals. The next crownward group is Scandentia, followed by the primates. The analysis identified a sister group relationship between the elephant, representing the African clade and Xenarthra (anteater and armadillo). This group was sister to Cetferungulata (Arnason et al. 1999Go) plus Chiroptera (bats). Within the Cetferungulata, the Carnivora (carnivores) group with the Perissodactyla (horse and rhinos) and the Artiodactyla with the Cetacea (whales). Artiodactyla itself is paraphyletic because the hippopotamus cluster with the Cetacea, whereas cow and pig are sister groups.

Glires was not placed in a basal eutherian position in all analyses. Instead, the rodents branched off first when amino acid or all cdp's were analyses under rate homogeneity, and the muroids were basal when first and second cdp GTR were used, both of which would make the Glires paraphyletic. The likelihood difference of these trees relative to the one shown in figure 2 was less than 0.2 times the SE. Thus, there was no significant difference between these groupings. Most data sets and analytical methods reconstructed the Glires as shown in figure 2, however. Also, when using only the chicken or only the marsupials as outgroup, most phylogenetic analyses placed the root of the eutherian tree on the Glires or members thereof.

The only single analysis that did not identify the root at or within the Glires was the ML analysis using all 3 cdp's and a GTR + 4{Gamma} + I model of nucleotide substitution. Under this scenario, the elephant and Xenarthra were basal in the eutherian tree, and Glires and primates became sistergroups (fig. 3b). However, this analysis included third cdp's, which show large distance values relative to the outgroup. At these sites, the distance value between marsupials and eutherians is about 40%. This indicates a high level of randomization due to multiple substitutions. Third cdp's also markedly deviate in nucleotide composition among all orders. Inclusion of these sites is therefore of questionable value, although this is common practice in some analyses of eutherian evolution (Murphy et al. 2001Go). When differences at third cpd's are limited to transversions (R = purines, Y = pyrimidines), ML analysis assuming a GTR + 4{Gamma} + I model of evolution placed the eutherian root again in Glires. Use of a 3-state GTR + 4{Gamma} + I model as implemented in TREEFINDER for the analysis of all 3 cdp's also placed Glires basal among eutherian mammals. Table 4 summarizes an ML analysis of different tree topologies and different data sets.


Figure 3
View larger version (13K):
[in this window]
[in a new window]
 
FIG. 3.— Alternative simplified topologies that were evaluated in an ML analysis (table 4). AVE: Aves; MAR: Marsupialia; CET: Cetferungulata; CHI: Chiroptera; PRI: Primates; SCA: Scandentia; ROD: Rodentia; LAG: Lagomorpha; PRO: Proboscidea (elephants); XEN: Xenarthra (anteater and armadillo).

 

View this table:
[in this window]
[in a new window]
 
Table 4 ML Analysis of Alternative Trees

 
Based on the analysis of amino acid sequences, the difference in the log likelihood ({Delta}log L) between the topology in figure 3b and those in figures 2 and 3a is about 1 SE. In analysis of first and second cdp's, assuming a GTR + 4{Gamma} + I model, the trees in figure 3b and c are a half SE worse than the best one. This difference is not significant but indicates that placing the root at the elephant and Xenarthra may not be optimal and may not hold when more data are included. Alternative trees with the root either on the elephant or the xenarthrans show the same tendency. The recently proposed Murphy tree and the mitogenomic tree (fig. 3d and e) were not favored in any ML analyses. Although they cannot be statistically rejected applying Shimodaira–Hasegawa test, they are more than 1 SE worse and in many analyses more than 2 SE worse than the best tree.

The position of Scandentia (represented by the tree shrew) remained essentially unsettled. Some analyses using first and second cdp's placed Scandentia and Primates as sister groups, but most analyses did not identify this relationship. However, the difference in the log likelihood for the position of the tree shrew as in figure 2 or as sistergroup to the primates is about –1, that is, only 1/10 of the SE of the log L estimate. The position of the Scandentia relative to the primates is therefore best regarded as unresolved by the current data set.

The Bayesian probability values for internal branches (a–w in fig. 2) are given in table 5. MrBayes provided some support for the root of the eutherian tree being on Glires using amino acid data or on the rodents based on nucleotide data. However, the probability for lagomorphs grouping with other eutherians as in the nucleotide analysis is limited (P = 0.44), and the Glires hypothesis cannot be rejected. The low MrBayes support for Glires (P = 0.61) based on amino acid sequence analysis may reflect an influence from the placement of the root on rodents as observed in some other analyses. In contrast to the topology shown in figure 2, MrBayes identified some affinities between Scandentia and Primates and also between Chiroptera and Artiodactyla using nucleotide data. There is a surprisingly strong support for the pig grouping with the cow. This association has not been favored in previous phylogenetic studies (Murphy et al. 2001Go; Arnason et al. 2002Go). The {Delta}log L for placing the pig basal to the (cow, (hippopotamus, whale)) group is, depending on analysis of amino acid or nucleotide sequences, 0.7 to 3 times SE lower than the position of the pig as in the best ML tree.


View this table:
[in this window]
[in a new window]
 
Table 5 Divergence Times and Bayesian and ML Bootstrap Support for Different Relationships

 
Divergence Times
The estimated divergence times of major lineages are summarized in table 5. These times are generally slightly younger than those estimated previously using mitogenomic data, but the dates do not differ significantly in most instances. The R8S and MULTIDIVTME package and their different underlying methods provide similar divergence times for the eutherian and marsupial divergences. Most results overlap well within less than the 1 SE/SD differences.


    Discussion
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Acknowledgements
 References
 
RT–PCR of Housekeeping Genes
In the current study, cDNA sequences from 8 genes were produced from 25 species representing 11 eutherian and 2 marsupial orders and a large variety of tissues. Although the isolation and handling of RNA is not unproblematic (Farrell 1998Go; Sambrook and Russell 2001Go), the study has demonstrated the utility of RNA/cDNA-based approaches in phylogenetic studies.

The phylogenetic analyses were based on sequence data of the same 8 genes of all species, thereby eliminating the potential influence of missing data. This contrasts to several phylogenetic studies that have been based on concatenated nuclear genes. Such studies often lacked sequences, especially for the outgroup, and included genes that are difficult to align (Murphy et al. 2001Go). The possibility to choose from hundreds of different housekeeping genes provides many opportunities for selecting genes that are suitable for phylogenetic analysis. Such a selection can be made as here on the basis of distance values and the facility with which the sequences can be unambiguously aligned. Various other parameters for selecting suitable genes such as nucleotide composition or evolutionary rate differences can be evaluated using whole genomic data from model organisms (e.g., human, cow, dog, mouse, rat) prior to establishing the corresponding genes from other species.

The individual genes that were selected for this study have distance values of 3%–10% between rodents and primates. As mentioned above, these distances appear to be within a range that minimizes the number of multiple substitutions, whereas at the same time provide enough sequence variation to allow phylogenetic analysis at the ordinal and subordinal levels. The approach differs from studies that have made use of genes with up to 45% distance between the purportedly orthologous amino acid sequences among eutherians and up to a distance of 60% between the marsupials and the eutherian ingroup. Superficially, genes as vWF, IRPB, and BRCA that have been used in previous studies of mammalian relationships may appear to provide extended phylogenetic information. This may be illusory, however, as distances at this level will by necessity carry a heavy load of noise that may interfere with the correct phylogenetic signal.

The sequence consistency within each individual gene chosen for the current study greatly facilitated the alignment of ambiguous sites and the exclusion of indel differences. Sites of this kind have a yet not well-examined influence on the phylogenetic reconstruction. PAUP* uses the largest pairwise distance in the data set as a default value in distance analysis, if a sequence pair is missing (Swofford 1998Go), and this may skew the analysis. Most other programs do not specify how partial and missing sequences or gaps are treated. Some ignore the whole indel column for all species in the alignment, whereas other ignore a gapped site in pairwise comparisons. Either way, ambiguous alignment sites can introduce an erroneous signal into the data.

Several nuclear genes that have been used for mammalian phylogenetic analysis are specific for mammals (Murphy et al. 2001Go). This restricts the use of these genes to studies of this particular group. In contrast, housekeeping genes, like mitogenomic data, can be used for phylogenetic analysis of a wider range of vertebrates. Another advantage of using cDNA sequences rather than the PCR products of putative exons is that inclusion of pseudogenic sequences in the phylogenetic analyses is essentially avoided.

Phylogeny
The current analysis of 11 out of the 18 traditional placental orders allows examination of the major basal branching pattern of the eutherian tree because the sampling includes rodents, primates, and artiodactyls. The crucial question is whether rodents have a basal position in this collection of taxa or whether they join the primates on a common branch. The phylogenetic findings based on the new nuclear data set show general consistency with previously reported relationships among placental mammals. The relationships within Cetferungulata and the Chiroptera/Cetferungulata grouping corroborate earlier mitogenomic analyses (Pumo et al. 1998Go; Arnason et al. 2002Go) and results based on a combination of nuclear and mt sequences (Murphy et al. 2001Go). The grouping of rodents and lagomorphs in Glires is also supported by this new data set. This confirms earlier findings using nuclear genes (Murphy et al. 2001Go; Huchon et al. 2002Go; Douzery and Huchon 2004Go) but is in conflict with most mitogenomic analyses, which do not suggest a sistergroup relationship between these taxa.

The placement of the root of the eutherian tree on Glires as favored by the new set of nuclear genes is somewhat unexpected. It is inconsistent with the Murphy tree and other similar studies of combined data (Madsen et al. 2001Go; Murphy et al. 2001Go), which placed the root on the African clade and/or Xenarthra. More recent reports on selected genes from the same data set also placed the root on the African clade plus Xenarthra (Delsuc et al. 2002Go, 2004Go). In contrast to these results, a basal position of rodents has been described in other studies using nuclear genes (Li et al. 1990Go; Jorgensen et al. 2005Go). In the current study, a basal position of the African clade (as represented by the elephant) plus Xenarthra was only favored after including all 3 cdp's in an ML analysis that applied the GTR + 4G + I model of sequence evolution. All other analyses, whether performed on amino acid sequences or first and second cdp's (see table 4 for details), placed the root on Glires. It is possible that the inclusion of third cdp's interferes with the analysis as a result of high distance values and nonhomogenous composition. The long branches of the elephant and Xenarthra in figure 2 reflect the fast evolutionary rates of these taxa. Long-branch attraction may constitute a problem in phylogenetic analysis (Felsenstein 1978Go) by dragging the affected taxa to a basal position. If long-branch attraction has affected the current analyses, it has not done so to the extent of placing the African clade/Xenarthra in a basal position in the eutherian tree.

Analyses of protein-coding sequences from whole genomes (>200,000 sites) have identified a basal position of rodents relative to human and the pig, using the fugu as an outgroup (Jorgensen et al. 2005Go). Although the taxon sampling of the study is limited, the relationship between rodents, primates, and artiodactyls is consistent with that of the current study.

The relationship among the cetferungulate orders (Carnivora, Perissodactyla, Artiodactyla, and Cetacea) is consistent with the mitogenomic and the Murphy trees, corroborating the phylogenetic utility of the new set of data. In comparison, analyses of single nuclear genes have failed to resolve these relationships (Miyamoto and Goodman 1986Go; Springer et al. 1997Go; DeBry and Sagel 2001Go; Madsen et al. 2001Go). The strongly supported grouping of the pig and the cow by the new data set is surprising, however, as this relationship has not been identified in other molecular analyses, which rather place the pig basal to a (cow, (hippopotamus, whale)) grouping. It remains to be seen if this relationship will be upheld with the inclusion of additional taxa or sequences.

Dating
The estimated times of most divergences in this study corroborate earlier estimates based on mitogenomic (Janke et al. 1994Go; Arnason et al. 1996Go; Arnason et al. 1998Go) or nuclear data (Li et al. 1990Go; Kumar and Hedges 1998Go). The algorithms behind the R8S and MULTIDIVTIME programs yielded concurring results, indicating that both programs compensate similarly for differences in evolutionary rates when calculating divergence times. Among the eutherians there are, nevertheless, a few notable differences relative to estimates based on nuclear data. The divergence time between rat and mouse is consistent with that of most mitogenomic studies that have used nonrodent calibration points (Arnason et al. 1996Go; Arnason, Gullberg, Gretarsdottir, et al. 2000Go). The estimate is, however, in conflict with recent nuclear studies of rodent relationships (Douzery and Huchon 2004Go). Similarly, the divergence between Old and New World monkeys at {approx}50 MYBP conforms to mitogenomic estimates (Arnason et al. 1998Go; Arnason, Gullberg, Gretarsdottir, et al. 2000Go; Arnason, Gullberg, Schweizer Burguete, and Janke 2000Go) and primate paleontology (Godinot and Mahboubi 1992Go) but challenges estimates that have used this split, set at 30–35 MYBP, as a calibration point to place the divergence between human and chimpanzee at 5 MYBP (e.g., Sarich 1970Go; Goodman 1996Go). As should be evident to the reader, a placement of the Old/New World primate calibration point at 50 MYBP rather than at 30–35 MYBP will automatically place the human/chimpanzee split proportionally earlier, consistent with the paleontological rebuttal (Senut et al. 2001Go) of a human/chimpanzee divergence 5 MYBP. Whether the differences in estimates between this study and earlier studies are related to limited taxon sampling within the respective groups, or on a limited amount of sequence data, remains to be shown. Nevertheless, the general congruency between divergence date estimates between this and other studies provides additional confidence in the data used here for phylogenetic analysis.

Summary
By applying mRNA/cDNA procedures, nuclear-encoded housekeeping genes can be recovered from a large variety of mammalian species and used for phylogenetic analysis. The approach enables the preselection of genes useful for phylogenetic reconstruction. Preselection of this kind can be carried out independently of a phylogeny, for example, on the basis of distance values and base composition. The current data set was established by concentrating on relatively slowly evolving genes, where only a limited number of multiple substitutions can be expected. The facility with which homologous sites could be unambiguously identified and aligned constituted another advantage connected to the relatively slow evolutionary rates of these genes. This new mammalian data set suggests that neither the Murphy tree nor the mitogenomic tree is entirely correct. Most elements of each tree are supported but a few are not. A major result from the study is that rodents and lagomorphs and not the African clade or Xenarthra are basal in the eutherian tree. This disrupts some groups that have been defined on the basis of the Murphy tree. The definition and naming of these unconventional clades may need reconsideration after analysis of larger and more comprehensive data sets, preferably of housekeeping genes, which due to their generally slow evolutionary rates also allow the examination of phylogenetic relationships of greater depths than those of mammalian orders.


    Acknowledgements
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Acknowledgements
 References
 
The Swedish Science Council, the Erik Philip-Sörensen Foundation, the Jörgen Lindström Foundation, and the Nilsson-Ehle Foundation supported the study. The authors would like to express their gratitude to Lars Hegestad, Jan Nilsson, and Eberhart Fuchs for providing tissue samples.


    Footnotes
 
William Martin, Associate Editor


    References
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Acknowledgements
 References
 

    Adachi J, Hasegawa M. 1996. MOLPHY version 2.3: programs for molecular phylogenetics based on maximum likelihood. Comput Sci Monogr 28:1–150.

    Adkins RM, Gelke EL, Rowe D, Honeycutt RL. 2001. Molecular phylogeny and divergence time estimates for major rodent groups: evidence from multiple genes. Mol Biol Evol 18:777–91.[Abstract/Free Full Text]

    Arnason U, Adegoke JA, Bodin K, Born EW, Esa YB, Gullberg A, Nilsson M, Short RV, Xu X, Janke A. 2002. Mammalian mitogenomic relationships and the root of the eutherian tree. Proc Natl Acad Sci USA 99:8151–6.[Abstract/Free Full Text]

    Arnason U, Gullberg A, Gretarsdottir S, Ursing B, Janke A. 2000. The mitochondrial genome of the sperm whale and a new molecular reference for estimating eutherian divergence dates. J Mol Evol 50:569–78.[ISI][Medline]

    Arnason U, Gullberg A, Janke A. 1998. Molecular timing of primate divergences as estimated by two nonprimate calibration points. J Mol Evol 47:718–27.[CrossRef][ISI][Medline]

    Arnason U, Gullberg A, Janke A. 1999. The mitochondrial DNA molecule of the aardvark, Orycteropus afer, and the position of the Tubulidentata in the eutherian tree. Proc Biol Sci 266:339–45.

    Arnason U, Gullberg A, Janke A. 2004. Mitogenomic analyses provide new insights into cetacean origin and evolution. Gene 333:27–34.[CrossRef][ISI][Medline]

    Arnason U, Gullberg A, Janke A, Xu X. 1996. Pattern and timing of evolutionary divergences among hominoids based on analyses of complete mtDNAs. J Mol Evol 43:650–61.[CrossRef][ISI][Medline]

    Arnason U, Gullberg A, Schweizer Burguete A, Janke A. 2000. Molecular estimates of primate divergences and new hypotheses for primate dispersal and the origin of modern humans. Hereditas 133:217–28.[CrossRef][ISI][Medline]

    Chomczynski P, Sacchi N. 1987. Single-step method of RNA isolation by acid guanidinium thiocyanate-phenol-chloroform extraction. Anal Biochem 162:156–9.[ISI][Medline]

    DeBry RW, Sagel RM. 2001. Phylogeny of Rodentia (Mammalia) inferred from nuclear-encoded gene IRBP. Mol Phylogenet Evol 19:290–301.[CrossRef][ISI][Medline]

    de Jong WW, Zweers A, Goodman M. 1981. Relationship of aardvark to elephants, hyraxes and sea cows from {alpha}-crystallin sequences. Nature 292:538–40.[CrossRef][Medline]

    Delsuc F, Scally M, Madsen O, Stanhope MJ, de Jong WW, Catzeflis FM, Springer MS, Douzery EJ. 2002. Molecular phylogeny of living xenarthrans and the impact of character and taxon sampling on the placental tree rooting. Mol Biol Evol 19:1656–71.[Abstract/Free Full Text]

    Delsuc F, Vizcaíno SF, Douzery EJ. 2004. Influence of Tertiary paleoenvironmental changes on the diversification of South American mammals: a relaxed molecular clock study within xenarthrans. BMC Evol Biol 4:11.[CrossRef][Medline]

    D'Erchia AM, Gissi C, Pesole G, Saccone C, Arnason U. 1996. The guinea pig is not a rodent. Nature 381:597–600.[CrossRef][Medline]

    Douzery EJ, Huchon D. 2004. Rabbits, if anything, are likely Glires. Mol Phylogenet Evol 33:922–35.[CrossRef][ISI][Medline]

    Farrell RE. 1998. RNA methodologies: a laboratory guide for isolation and characterization. San Diego, CA: Academic Press.

    Felsenstein J. 1978. Cases in which parsimony or compatibility methods will be positively misleading. Syst Zool 27:401–10.[CrossRef]

    Felsenstein J. 1993. Phylogenetic inference programs (PHYLIP). Seattle, WA: University of Washington.

    Freshney RI. 2000. Culture of animal cells: a manual of basic technique. 4th ed. New York: Wiley-Liss.

    Garland TJ, Dickerman AW, Janis CM, Jones JA. 1993. Phylogenetic analysis of covariance by computer simulation. Syst Biol 42:265–92.[CrossRef]

    Gatesy J, O'Leary MA. 2001. Deciphering whale origins with molecules and fossils. Trends Ecol Evol 16:562–70.[CrossRef]

    Gibson A, Gowri-Shankar V, Higgs PG, Rattray M. 2005. A comprehensive analysis of mammalian mitochondrial genome base composition and improved phylogenetic methods. Mol Biol Evol 22:251–64.[Abstract/Free Full Text]

    Godinot M, Mahboubi M. 1992. Earliest known simian primate found in Algeria. Nature 357:324–6.[CrossRef]

    Goodman M. 1996. Epilogue: a personal account of the origins of a new paradigm. Mol Phylogenet Evol 5:269–85.[CrossRef][ISI][Medline]

    Gregory WK. 1910. The orders of mammals. Bull Am Mus Nat Hist 27:1–524.

    Gu X, Fu YX, Li WH. 1995. Maximum likelihood estimation of the heterogeneity of substitution rate among nucleotide sites. Mol Biol Evol 12:546–57.[Abstract]

    Guindon S, Gascuel O. 2003. A simple, fast and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst Biol 52:696–704.[CrossRef][ISI][Medline]

    Hooker JJ. 1989. Character polarities in early perissodactyls and their significance for Hyracotherium and infraordinal relationships. In: Prothero DR, Schoch RM, editors. The evolution of perissodactyls. New York: Clarendon Press. p 79–101.

    Hsiao LL, Dangond F, Yoshida T, et al. (23 co-authors). 2001. A compendium of gene expression in normal human tissues. Physiol Genomics 7:97–104.[Abstract/Free Full Text]

    Huchon D, Madsen O, Sibbald MJ, Ament K, Stanhope MJ, Catzeflis F, de Jong WW, Douzery EJ. 2002. Rodent phylogeny and a timescale for the evolution of Glires: evidence from an extensive taxon sampling using three nuclear genes. Mol Biol Evol 19:1053–65.[Abstract/Free Full Text]

    Huelsenbeck JP, Ronquist F. 2001. MRBAYES: Bayesian inference of phylogenetic trees. Bioinformatics 17:754–55.[Abstract/Free Full Text]

    Irwin DM, Kocher TD, Wilson AC. 1991. Evolution of the cytochrome b gene of mammals. J Mol Evol 32:128–44.[CrossRef][ISI][Medline]

    Janke A, Feldmaier-Fuchs G, Thomas WK, von Haeseler A, Pääbo S. 1994. The marsupial mitochondrial genome and the evolution of placental mammals. Genetics 137:243–56.[Abstract]

    Ji Q, Luo ZX, Yuan CX, Wible JR, Zhang JP, Georgi JA. 2002. The earliest known eutherian mammal. Nature 416:816–22.[CrossRef]

    Jobb G, von Haeseler A, Strimmer K. 2004. TREEFINDER: a powerful graphical analysis environment for molecular phylogenetics. BMC Evol Biol 4:18.[CrossRef][Medline]

    Jorgensen FG, Hobolth A, Hornshoj H, Bendixen C, Fredholm M, Schierup MH. 2005. Comparative analysis of protein coding sequences from human, mouse and the domesticated pig. BMC Biol 3:2.[CrossRef][Medline]

    Kumar S, Hedges SB. 1998. A molecular timescale for vertebrate evolution. Nature 392:917–20.[CrossRef]

    Lanave C, Preparata G, Saccone C, Serio G. 1984. A new method for calculating evolutionary substitution rates. J Mol Evol 20:86–93.[CrossRef][ISI][Medline]

    Le Gros Clark WE, Sonntag CF. 1926. A monograph of Orycteropus afer. Proc Zool Soc 30:445–85.

    Li WH, Gouy M, Sharp PM, O'hUigin C, Yang YW. 1990. Molecular phylogeny of Rodentia, Lagomorpha, Primates, Artiodactyla, and Carnivora and molecular clocks. Proc Natl Acad Sci USA 87:6703–7.[Abstract/Free Full Text]

    Lin Y.-H, McLenachan PA, Gore AR, Phillips MJ, Ota R, Hendy MD, Penny D. 2002. Four new mitochondrial genomes and increased stability of evolutionary trees of mammals from improved taxon sampling. Mol Biol Evol 19:2060–70.[Abstract/Free Full Text]

    Liu FG, Miyamoto MM. 1999. Phylogenetic assessment of molecular and morphological data for eutherian mammals. Syst Biol 48:54–64.[CrossRef][ISI][Medline]

    Luo ZX, Ji Q, Wible JR, Yuan CX. 2003. An Early Cretaceous tribosphenic mammal and metatherian evolution. Science 302:1934–40.[Abstract/Free Full Text]

    Madsen O, Scally M, Douady CJ, Kao DJ, DeBry RW, Adkins R, Amrine HM, Stanhope MJ, de Jong WW, Springer MS. 2001. Parallel adaptive radiations in two major clades of placental mammals. Nature 409:610–4.[CrossRef][Medline]

    Malia MJ, Lipscomb DL, Allard MW. 2003. The misleading effects of composite taxa in supermatrices. Mol Phylogenet Evol 27:522–7.[CrossRef][ISI][Medline]

    Misawa K, Janke A. 2003. Revisiting the Glires concept—phylogenetic analysis of nuclear sequences. Mol Phylogenet Evol 28:320–7.[CrossRef][ISI][Medline]

    Misawa K, Nei M. 2003. Reanalysis of Murphy et al.'s data gives various mammalian phylogenies and suggests overcredibility of Bayesian trees. J Mol Evol 57:S290–6.

    Miyamoto MM, Goodman M. 1986. Biomolecular systematics of eutherian mammals: phylogenetic patterns and classification. Syst Zool 35:230–40.[CrossRef]

    Murphy WJ, Eizirik E, O'Brien SJ, et al. (11 co-authors). 2001. Resolution of the early placental mammal radiation using Bayesian phylogenetics. Science 294:2348–51.[Abstract/Free Full Text]

    Nei M, Kumar S. 2000. Molecular evolution and phylogenetics. New York: Oxford University Press.

    Nilsson MA, Arnason U, Spencer PB, Janke A. 2004. Marsupial relationships and a timeline for marsupial radiation in South Gondwana. Gene 340:189–96.[CrossRef][ISI][Medline]

    Novacek MJ. 1992. Mammalian phylogeny: shaking the tree. Nature 356:121–5.[CrossRef]

    Novacek MJ, Wyss AR. 1986. Higher-level relationships of the recent eutherian orders: morphological evidence. Cladistics 2:257–87.

    Page RDM, Holmes EC. 1998. Molecular evolution—a phylogenetic approach. Oxford: Blackwell Science Ltd.

    Posada D, Crandall KA. 1998. Modeltest: testing the model of DNA substitution. Bioinformatics 14:817–8.[Abstract/Free Full Text]

    Prothero DR, Schoch RM. 1989. Origin and evolution of the Perissodactyla: summary and synthesis. In: Prothero DR, Schoch RM, editors. The evolution of perissodactyls. New York: Clarendon Press. p 504–29.

    Pumo DE, Finamore PS, Franek WR, Phillips CJ, Tarzami S, Balzarano D. 1998. Complete mitochondrial genome of a neotropical fruit bat, Artibeus jamaicensis, and a new hypothesis of the relationship of bats to other eutherian mammals. J Mol Evol 47:709–17.[CrossRef][ISI][Medline]

    Rambaut A. 1996. Se-Al v2.0a11: sequence alignment editor. Available from: http://evolve.zoo.ox.ac.uk/.

    Reyes A, Gissi C, Catzeflis F, Nevo E, Pesole G, Saccone C. 2004. Congruent mammalian trees from mitochondrial and nuclear genes using Bayesian methods. Mol Biol Evol 21:397–403.[Abstract/Free Full Text]

    Reyes A, Pesole G, Saccone C. 1998. Complete mitochondrial DNA sequence of the fat dormouse, Glis glis: further evidence of rodent paraphyly. Mol Biol Evol 15:499–505.[Abstract]

    Rosenberg MS, Kumar S. 2001. Incomplete taxon sampling is not a problem for phylogenetic inference. Proc Natl Acad Sci USA 98:10751–6.[Abstract/Free Full Text]

    Sambrook J, Russell DW. 2001. Molecular cloning, a laboratory manual. New York: Cold Spring Harbor Press.

    Sanderson MJ. 2002. Estimating absolute rates of molecular evolution and divergence times: a penalized likelihood approach. Mol Biol Evol 19:101–9.[Abstract/Free Full Text]

    Sarich VM. 1970. Primate systematic with special reference to Old World monkeys. In: Napier JR, Napier PH, editors. Old World monkeys: evolution, systematic and behavior. New York: Academic Press. p 175–266.

    Schmidt HA, Strimmer K, Vingron M, von Haeseler A. 2002. TREE-PUZZLE: maximum likelihood phylogenetic analysis using quartets and parallel computing. Bioinformatics 18:502–4.[Abstract/Free Full Text]

    Senut B, Pickford M, Gommery D, Mein P, Cheboi K, Coppens Y. 2001. First hominid from the Miocene (Lukeino Formation, Kenya). C R Acad Sci Paris Sci Terre Planètes 332:137–44.

    Simpson GG. 1945. The principles of classification and a classification of mammals. Bull Am Mus Nat Hist 85:1–272.

    Springer MS, Cleven GC, Madsen O, de Jong WW, Waddell VG, Amrine HM, Stanhope MJ. 1997. Endemic African mammals shake the phylogenetic tree. Nature 388:61–4.[CrossRef]

    Swofford DL. 1998. Phylogenetic analysis using parsimony (*and other methods). Version 4. Sunderland, MA: Sinauer Associates.

    Warrington JA, Nair A, Mahadevappa M, Tsyganskaya M. 2000. Comparison of human adult and