MBE Advance Access originally published online on October 5, 2007
Molecular Biology and Evolution 2007 24(12):2594-2597; doi:10.1093/molbev/msm218
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Letter |
Analysis of Rare Amino Acid Replacements Supports the Coelomata Clade
National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland
E-mail: koonin{at}ncbi.nlm.nih.gov.
| Abstract |
|---|
|
|
|---|
The recent analysis of a novel class of rare genomic changes, RGC_CAMs (after conserved amino acids—multiple substitutions), supported the Coelomata clade of animals as opposed to the Ecdysozoa clade (Rogozin et al. 2007
Key Words: phylogenetic analysis cladistics rare genomic changes coelomata ecdysozoa
As the set of sequenced genomes from diverse taxa rapidly grows, phylogenetic analysis is entering a new era when the reconstruction of the evolutionary history of organisms on the basis of full-scale comparison of their genomes becomes the strategy of choice. In addition to more traditional, genome-wide analysis of alignments, rare genomic changes (RGCs) that are likely to comprise derived shared characters of individual clades are increasingly used in genome-wide phylogenetic studies (Rokas and Holland 2000
; Nei and Kumar 2001
; Rokas et al. 2003
).
We have recently proposed a new type of RGCs designated RGC_CAMs (after conserved amino acids—multiple substitutions), which are inferred using a genome-scale analysis of protein and underlying nucleotide sequence alignments (Rogozin et al. 2007
). The RGC_CAM approach utilizes amino acid residues that are conserved in the major lineages within an analyzed taxonomic division (e.g., eukaryotes), with the exception of a few species comprising a putative clade. In addition, to reduce the effect of homoplasy, only those amino acid replacements that require 2 or 3 nucleotide substitutions are employed for phylogenetic inference. The RGC_CAM analysis has been combined with a procedure for rigorous statistical testing of competing phylogenetic hypotheses and shown to be robust to branch-length differences and taxon sampling. When applied to animal phylogeny, the RGC_CAM approach significantly supports the coelomate clade that unites chordates with arthropods as opposed to the ecdysozoan (molting animals) clade that encompasses arthropods and nematodes (Rogozin et al. 2007
). This conclusion is compatible with some previous genome-wide phylogenetic analyses (Mushegian et al. 1998
; Blair et al. 2002
; Stuart and Berry 2004
; Wolf et al. 2004
; Philip et al. 2005
) but not others (Copley et al. 2004
; Dopazo and Dopazo 2005
; Philippe et al. 2005
) and runs against the view of animal evolution that is currently prevailing in the evolutionary developmental biology (evo-devo) community (Aguinaldo et al. 1997
; Adoutte et al. 2000
; Telford and Copley 2005
).
Irimia et al. (2007)
have further explored the RGC_CAM approach, after adding proteins from 2 recently sequenced animal genomes, the cnidarian (sea anemone) Nematostella vectensis and the nematode Brugia malayi, to the original data set of Rogozin et al. (2007)
. The analysis of the resulting alignments has suggested that the apparent support for the coelomate clade resulted from the rapid rate of evolution in the nematodes (Irimia et al. 2007
). There are 2 types of errors that have the potential to distort the results obtained with the RGC_CAM approach, namely, reversals and parallel changes (fig. 1). Irimia et al. (2007)
emphasize the effect of reversals but, effectively, ignore parallel changes; furthermore, they do not report any rigorous statistical analysis of the results.
|
Here we report a reanalysis of animal evolution with the RGC_CAM method, with special attention to the sources of potential artifacts, using a further amended data set. The adopted animal phylogeny is shown in figure 1, and the results of the RGC_CAM analysis of the set of 15 species are shown in the table 1 (top row). Only one RGC_CAM supported the coelomate clade, and 2 RGC_CAMs supported the ecdysozoan clade (table 1). Thus, considering the lengths of the respective branches, the coelomate clade still had a weak statistical support (table 1; see Methods for the details of the statistical test) under the assumption of the basal position of N. vectensis (the branch separating N. vectensis from the rest of the Bilateria is only 3 RGC_CAMs long [fig. 1], with no reversals). We further explored the support for different topologies provided by RGC_CAMs by performing taxon sampling of the outgroup species. All combinations of 10–15 species, that is, including from 1 to 6 outgroup species (63 combinations altogether), were analyzed. Of the 63 combinations, in 29 combinations of species, the raw number of RGC_CAMs compatible with the coelomate topology was greater than the number of RGC_CAMs compatible with the ecdysozoa topology, whereas the reverse was true of 32 combinations, with the remaining 2 combinations showing the same number of RGC_CAMs for both topologies (table 1). Considering the respective branch lengths, for 57 (91%) combinations of species, there was statistical support for the coelomate clade (table 1), whereas with the rest of the combinations (9%), none of the topologies received statistical support. Thus, the results of this extensive RGC_CAM analysis indicate support for the Coelomata topology but no significant support for the Ecdysozoa. As indicated by the results in table 1, the test loses most of its power when N. vectensis is included in the outgroup species set due to the very short branch connecting this species to the rest of animals. The problem could be caused by compressed cladogenesis at the base of the animal clade (Rokas et al. 2005
|
There are 2 types of evolutionary events that have the potential to produce artifacts in the RGC_CAM analysis, namely, parallel changes and reversals (fig. 1) (Irimia et al. 2007
|
In summary, the results of RGC_CAM analysis reported here reinforce the support for the Coelomata clade observed with this approach in the original study (Rogozin et al. 2007
| Methods |
|---|
|
|
|---|
Each of the 694 protein alignments constructed from selected eukaryotic orthologous groups (KOGs) (Koonin et al. 2004
The statistical test of phylogenetic hypotheses is based on a null model under which, in a comparison of 2 alternative hypotheses, for example, ([X–Y],Z) versus ([X–Z],Y), the number of RGC_CAMs that are shared by 2 lineages due to chance (NXY and NXZ) is proportional to the length of the branch the position of which differs between the 2 hypotheses, that is, Y and Z, respectively. The significance of the difference between normalized numbers of RGC_CAMs was estimated using Fishers exact test (Rogozin et al. 2007
).
| Acknowledgements |
|---|
|
|
|---|
We thank Scott Roy for providing his manuscript prior to publication and Miklos Csuros for helpful discussions. The sequence data for Monosiga brevicolis were produced by the US Department of Energy Joint Genome Institute http://www.jgi.doe.gov/. The B. malayi sequencing effort (http://www.tigr.org) is part of the International Brugia Genome Sequencing Project and is supported by an award from the National Institute of Allergy and Infectious Diseases, National Institutes of Health. This work was supported in part by the Intramural Research Program of the National Library of Medicine at National Institutes of Health/Department of Health and Human Services.
| Footnotes |
|---|
Lauren McIntyre, Associate Editor
| References |
|---|
|
|
|---|
Adoutte A, Balavoine G, Lartillot N, Lespinet O, Prud'homme B, de Rosa R. The new animal phylogeny: reliability and implications. Proc Natl Acad Sci USA (2000) 97:4453–4456.
Aguinaldo AM, Turbeville JM, Linford LS, Rivera MC, Garey JR, Raff RA, Lake JA. Evidence for a clade of nematodes, arthropods and other moulting animals. Nature (1997) 387:489–493.[CrossRef][Medline]
Blair JE, Ikeo K, Gojobori T, Hedges SB. The evolutionary position of nematodes. BMC Evol Biol (2002) 2:7.[CrossRef][Medline]
Copley RR, Aloy P, Russell RB, Telford MJ. Systematic searches for molecular synapomorphies in model metazoan genomes give some support for Ecdysozoa after accounting for the idiosyncrasies of Caenorhabditis elegans. Evol Dev (2004) 6:164–169.[CrossRef][ISI][Medline]
Dopazo H, Dopazo J. Genome-scale evidence of the nematode-arthropod clade. Genome Biol (2005) 6:R41.[CrossRef][Medline]
Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res (2004) 32:1792–1797.
Irimia M, Maeso I, Penny D, Garcia-Fernandez J, Roy SW. Rare coding sequence changes are consistent with Ecdysozoa, not Coelomata. Mol Biol Evol (2007) 24:1604–1607.
Koonin EV, Fedorova ND, Jackson JD, et al, (18 co-authors). A comprehensive evolutionary classification of proteins encoded in complete eukaryotic genomes. Genome Biol (2004) 5:R7.[CrossRef][Medline]
Mushegian AR, Garey JR, Martin J, Liu LX. Large-scale taxonomic profiling of eukaryotic model organisms: a comparison of orthologous proteins encoded by the human, fly, nematode, and yeast genomes. Genome Res (1998) 8:590–598.
Nei M, Kumar S. Molecular evolution and phylogenetics (2001) Oxford: Oxford University.
Philip GK, Creevey CJ, McInerney JO. The Opisthokonta and the Ecdysozoa may not be clades: stronger support for the grouping of plant and animal than for animal and fungi and stronger support for the Coelomata than Ecdysozoa. Mol Biol Evol (2005) 22:1175–1184.
Philippe H, Lartillot N, Brinkmann H. Multigene analyses of bilaterian animals corroborate the monophyly of Ecdysozoa, Lophotrochozoa and Protostomia. Mol Biol Evol (2005) 22:1246–1253.
Rogozin IB, Wolf YI, Carmel L, Koonin EV. Ecdysozoan clade rejected by genome-wide analysis of rare amino acid replacements. Mol Biol Evol (2007) 24:1080–1090.
Rokas A, Carroll SB. Bushes in the tree of life. PLoS Biol (2006) 4:e352.[CrossRef][Medline]
Rokas A, Holland PW. Rare genomic changes as a tool for phylogenetics. Trends Ecol Evol (2000) 15:454–459.[CrossRef][Medline]
Rokas A, King N, Finnerty J, Carroll SB. Conflicting phylogenetic signals at the base of the metazoan tree. Evol Dev (2003) 5:346–359.[CrossRef][ISI][Medline]
Rokas A, Kruger D, Carroll SB. Animal evolution and the molecular signature of radiations compressed in time. Science (2005) 310:1933–1938.
Stuart GW, Berry MW. An SVD-based comparison of nine whole eukaryotic genomes supports a coelomate rather than ecdysozoan lineage. BMC Bioinformatics (2004) 5:204.[CrossRef][Medline]
Tatusov RL, Koonin EV, Lipman DJ. A genomic perspective on protein families. Science (1997) 278:631–637.
Telford MJ, Copley RR. Animal phylogeny: fatal attraction. Curr Biol (2005) 15:R296–R299.[CrossRef][ISI][Medline]
Wolf YI, Rogozin IB, Koonin EV. Coelomata and not Ecdysozoa: evidence from genome-wide phylogenetic analysis. Genome Res (2004) 14:29–36.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
S. W. Roy and M. Irimia Rare Genomic Characters Do Not Support Coelomata: Intron Loss/Gain Mol. Biol. Evol., April 1, 2008; 25(4): 620 - 623. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||


