MBE Advance Access originally published online on September 8, 2005
Molecular Biology and Evolution 2006 23(1):1-3; doi:10.1093/molbev/msj006
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Letter |
The "Inverse Relationship Between Evolutionary Rate and Age of Mammalian Genes" Is an Artifact of Increased Genetic Distance with Rate of Evolution and Time of Divergence
Department of Biology and Biochemistry, University of Houston
E-mail: dgraur{at}uh.edu.
| Abstract |
|---|
|
|
|---|
It has recently been claimed that older genes tend to evolve more slowly than newer ones (Alba and Castresana 2005). By simulation of genes of equal age, we show that the inverse correlation between age and rate is an artifact caused by our inability to detect homology when evolutionary distances are large. Since evolutionary distance increases with time of divergence and rate of evolution, homologs of fast-evolving genes are frequently undetected in distantly related taxa and are, hence, misclassified as "new." This misclassification causes the mean genetic distance of new genes to be overestimated and the mean genetic distance of "old" genes to be underestimated.
Key Words: nonsynonymous substitutions novel genes divergence times
| Introduction |
|---|
|
|
|---|
Alba and Castresana (2005)
In the study of Alba and Castresana (2005)
, the rate of evolution was calculated from human-mouse "orthologous" gene pairs. Orthology was defined operationally, rather than evolutionarily, through reciprocal BlastP (Altschul et al. 1997
) hits. For each pair of human-mouse genes, the number of nonsynonymous substitutions per nonsynonymous site (KA) was calculated. Alba and Castresana (2005)
determined the age of each human-mouse gene pair by the phylogenetic distribution of their homologs among the genomes of six model organisms: Takifugu rubripes, Drosophila melanogaster, Caenorhabditis elegans, Arabidopsis thaliana, Saccharomyces cerevisiae, and Schizosaccharomyces pombe. If homologous genes were present in all these six genomes, the human-mouse gene pair was assigned to the OLD group. If homologous genes were found in C. elegans, D. melanogaster, and T. rubripes but absent from S. cerevisiae, S. pombe, and A. thaliana, then the human-mouse pair was classified in the METAZOANS group. If homologous genes were present in T. rubripes but absent from the other five genomes, then the pair was classified in the DEUTEROSTOMES group. If homologous genes were absent from all six genomes, then the pair was classified in the TETRAPODS group. A BlastP hit with an expectation value of less than 104 was deemed to be indicative of "presence." A negative correlation was found between the rate of substitution and age. The inferred KA values were 0.06, 0.08, 0.14, and 0.23 for OLD, METAZOANS, DEUTEROSTOMES, and TETRAPODS, respectively.
In this note, we show by simulation that the inverse relationship between evolutionary rate and gene age is an artifact caused by our inability to detect similarity when genetic distances are large.
| Methods |
|---|
|
|
|---|
The DNA Assembly with Gaps (DAWG) simulation program (Cartwight 2005
|
Presence or absence of a homolog in C, D, or E was determined by Blast 2 sequences. If the E value was less than 104, a homolog was assumed to be present; otherwise, we inferred absence. Similar results were obtained with cutoffs that varied from 102 to 1020. All cutoffs are, of course, arbitrary.
In a manner analogous to OLD, METAZOANS, DEUTEROSTOMES, and TETRAPODS of Alba and Castresana (2005)
, each simulated A-B pair was assigned to one of four age groups: SENIORS, ADULTS, TEENAGERS, and TODDLERS.
| Results and Discussion |
|---|
|
|
|---|
The distribution of the genetic distances and, by implication, the evolutionary rates are shown in figure 2. The SENIORS category consists of 753 sequence pairs. The corresponding values for ADULTS, TEENAGERS, and TODDLERS were 803, 1,766, and 1,592, respectively. The mean genetic distance for SENIORS was 0.08, and the corresponding values for ADULTS, TEENAGERS, and TODDLERS were 0.11, 0.17, and 0.22, respectively.
|
Alba and Castresana (2005)
Our simulation clearly demonstrates the inherent inability to detect homology of the fastest evolving genes (fig. 3). One can clearly see that detectability decreases rapidly with evolutionary distance. In other words, as far as the fastest evolving genes are concerned, the vast majority of them are undetectable even when the cutoffs are extremely permissive.
|
One explanation given by Alba and Castresana (2005)
Because all our simulated genes have the same evolutionary age and because our results are similar to those obtained by Alba and Castresana (2005)
, we conclude that the inverse relationship between evolutionary rate and gene age is an artifact caused by our inability to detect similarity when genetic distances are large. Since genetic distance increases with time of divergence and rate of evolution, it is difficult to identify homologs of fast-evolving genes in distantly related taxa. Thus, fast-evolving genes may be misclassified as new. The only conclusion that can be drawn from the study of Alba and Castresana (2005)
is that slowly evolving genes evolve slowly.
| Acknowledgements |
|---|
|
|
|---|
We wish to thank Reed A. Cartwight for his help with DAWG and Jose Castresana for providing us access to the raw data.
| Footnotes |
|---|
1 These authors contributed equally to this work and are to be considered co-first authors.
| References |
|---|
|
|
|---|
Alba, M. M., and J. Castresana. 2005. Inverse relationship between evolutionary rate and age of mammalian genes. Mol. Biol. Evol. 22:598606.
Altschul, S. F., T. L. Madden, A. A. Schaffer, J. Zhang, Z. Zhang, W. Miller, and D. J. Lipman. 1997. Gapped Blast and PSI-Blast: a new generation of protein database search programs. Nucleic Acids Res. 25:33893402.
Cartwight, R. A. 2005. DAWG: DNA Assembly with Gaps. (http://scit.us/projects/dawg/).
Kimura, M. 1980. A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences. J. Mol. Evol. 16:111120.[CrossRef][ISI][Medline]
Tatiana, A. T., and T. L. Madden. 1999. Blast 2 Sequencesa new tool for comparing protein and nucleotide sequences. FEMS Microbiol. Lett. 174:247250.[CrossRef][ISI][Medline]
Thompson, J. D., D. G. Higgins, and T. J. Gibson. 1994. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 22:46734680.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
J. A. Servin, C. W. Herbold, R. G. Skophammer, and J. A. Lake Evidence Excluding the Root of the Tree of Life from the Actinobacteria Mol. Biol. Evol., January 1, 2008; 25(1): 1 - 4. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. G. Skophammer, J. A. Servin, C. W. Herbold, and J. A. Lake Evidence for a Gram-positive, Eubacterial Root of the Tree of Life Mol. Biol. Evol., August 1, 2007; 24(8): 1761 - 1768. [Abstract] [Full Text] [PDF] |
||||
![]() |
H.-L. Lee, R. K. Jansen, T. W. Chumley, and K.-J. Kim Gene Relocations within Chloroplast Genomes of Jasminum and Menodora (Oleaceae) Are Due to Multiple, Overlapping Inversions Mol. Biol. Evol., May 1, 2007; 24(5): 1161 - 1180. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. A. Lake, C. W. Herbold, M. C. Rivera, J. A. Servin, and R. G. Skophammer Rooting the Tree of Life Using Nonubiquitous Genes Mol. Biol. Evol., January 1, 2007; 24(1): 130 - 136. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Julenius and A. G. Pedersen Protein Evolution Is Faster Outside the Cell Mol. Biol. Evol., November 1, 2006; 23(11): 2039 - 2048. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||



