Molecular Biology and Evolution, Vol 15, 1600-1611, Copyright © 1998 by Society for Molecular Biology and Evolution
Z Yang, R Nielsen and M Hasegawa
Models of amino acid substitution were developed and compared using maximum
likelihood. Two kinds of models are considered. "Empirical" models do not
explicitly consider factors that shape protein evolution, but attempt to
summarize the substitution pattern from large quantities of real data.
"Mechanistic" models are formulated at the codon level and separate
mutational biases at the nucleotide level from selective constraints at the
amino acid level. They account for features of sequence evolution, such as
transition-transversion bias and base or codon frequency biases, and make
use of physicochemical distances between amino acids to specify
nonsynonymous substitution rates. A general approach is presented that
transforms a Markov model of codon substitution into a model of amino acid
replacement. Protein sequences from the entire mitochondrial genomes of 20
mammalian species were analyzed using different models. The mechanistic
models were found to fit the data better than empirical models derived from
large databases. Both the mutational distance between amino acids
(determined by the genetic code and mutational biases such as the
transition-transversion bias) and the physicochemical distance are found to
have strong effects on amino acid substitution rates. A significant
proportion of amino acid substitutions appeared to have involved more than
one codon position, indicating that nucleotide substitutions at neighboring
sites may be correlated. Rates of amino acid substitution were found to be
highly variable among sites.
ORIGINAL ARTICLE
Models of amino acid substitution and applications to mitochondrial protein evolution
Department of Biology (Galton Laboratory), University College London, England. z.yang@ucl.ac.uk
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
W. Fletcher and Z. Yang INDELible: A Flexible Simulator of Biological Sequence Evolution Mol. Biol. Evol., August 1, 2009; 26(8): 1879 - 1888. [Abstract] [Full Text] [PDF] |
||||
![]() |
T.-K. Seo and H. Kishino Statistical Comparison of Nucleotide, Amino Acid, and Codon Substitution Models for Evolutionary Analysis of Protein-Coding Sequences Syst Biol, June 29, 2009; (2009) syp015v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Chen, R. DeSalle, M. Schiffman, R. Herrero, and R. D. Burk Evolutionary Dynamics of Variant Genomes of Human Papillomavirus Types 18, 45, and 97 J. Virol., February 1, 2009; 83(3): 1443 - 1455. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Delport, K. Scheffler, and C. Seoighe Models of coding sequence evolution Brief Bioinform, January 1, 2009; 10(1): 97 - 109. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. C. Almeida and R. DeSalle Orthology, Function and Evolution of Accessory Gland Proteins in the Drosophila repleta Group Genetics, January 1, 2009; 181(1): 235 - 245. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. P Huelsenbeck, P. Joyce, C. Lakner, and F. Ronquist Bayesian analysis of amino acid substitution models Phil Trans R Soc B, December 27, 2008; 363(1512): 3941 - 3953. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Q. Le, N. Lartillot, and O. Gascuel Phylogenetic mixture models for proteins Phil Trans R Soc B, December 27, 2008; 363(1512): 3965 - 3976. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Yang Empirical evaluation of a prior for Bayesian phylogenetic inference Phil Trans R Soc B, December 27, 2008; 363(1512): 4031 - 4039. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Kryazhimskiy, G. A Bazykin, J. Plotkin, and J. Dushoff Directionality in the evolution of influenza A haemagglutinin Proc R Soc B, November 7, 2008; 275(1650): 2455 - 2464. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Rodrigue, N. Lartillot, and H. Philippe Bayesian Comparisons of Codon Substitution Models Genetics, November 1, 2008; 180(3): 1579 - 1591. [Abstract] [Full Text] [PDF] |
||||
![]() |
P.-A. Christin, N. Salamin, A. M. Muasya, E. H. Roalson, F. Russier, and G. Besnard Evolutionary Switch and Genetic Convergence on rbcL following the Evolution of C4 Photosynthesis Mol. Biol. Evol., November 1, 2008; 25(11): 2361 - 2368. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. C. Karn, N. L. Clark, E. D. Nguyen, and W. J. Swanson Adaptive Evolution in Rodent Seminal Vesicle Secretion Proteins Mol. Biol. Evol., November 1, 2008; 25(11): 2301 - 2310. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Wang, A. Diehl, F. Wu, J. Vrebalov, J. Giovannoni, A. Siepel, and S. D. Tanksley Sequencing and Comparative Analysis of a Conserved Syntenic Segment in the Solanaceae Genetics, September 1, 2008; 180(1): 391 - 408. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Rokas and S. B. Carroll Frequent and Widespread Parallel Evolution of Protein Sequences Mol. Biol. Evol., September 1, 2008; 25(9): 1943 - 1953. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. C. Almeida and R. DeSalle Evidence of Adaptive Evolution of Accessory Gland Proteins in Closely Related Species of the Drosophila repleta Group Mol. Biol. Evol., September 1, 2008; 25(9): 2043 - 2053. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Q. Le and O. Gascuel An Improved General Amino Acid Replacement Matrix Mol. Biol. Evol., July 1, 2008; 25(7): 1307 - 1320. [Abstract] [Full Text] [PDF] |
||||
![]() |
T.-K. Seo and H. Kishino Synonymous Substitutions Substantially Improve Evolutionary Inference from Highly Diverged Proteins Syst Biol, June 1, 2008; 57(3): 367 - 377. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Yang and R. Nielsen Mutation-Selection Models of Codon Substitution and Their Use to Estimate Selective Strengths on Codon Usage Mol. Biol. Evol., March 1, 2008; 25(3): 568 - 579. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. S. Vinh and A. von Haeseler Computational Molecular Evolution--Ziheng Yang. 2006. Oxford University Press, Oxford. 376 pp. ISBN 978-0-19-856699-1 (ISBN-10 0-19-856699-9) {pound}60 $115 (hardback). ISBN 978-0-19-856702-8 (ISBN-10 0-19-856702-2) {pound}27.50 $52.50 (paperback). Syst Biol, December 1, 2007; 56(6): 1024 - 1026. [Full Text] [PDF] |
||||
![]() |
C. T. Saunders and P. Green Insights from Modeling Protein Evolution with Context-Dependent Mutation and Asymmetric Amino Acid Selection Mol. Biol. Evol., December 1, 2007; 24(12): 2632 - 2647. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Susko and A. J. Roger On Reduced Amino Acid Alphabets for Phylogenetic Inference Mol. Biol. Evol., September 1, 2007; 24(9): 2139 - 2150. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. C. Choi, A. Hobolth, D. M. Robinson, H. Kishino, and J. L. Thorne Quantifying the Impact of Protein Tertiary Structure on Molecular Evolution Mol. Biol. Evol., August 1, 2007; 24(8): 1769 - 1782. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Kosiol, I. Holmes, and N. Goldman An Empirical Codon Model for Protein Sequence Evolution Mol. Biol. Evol., July 1, 2007; 24(7): 1464 - 1479. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Gojobori, H. Tang, J. M. Akey, and C.-I Wu Adaptive evolution in humans revealed by the negative correlation between the polymorphism and fixation phases of evolution PNAS, March 6, 2007; 104(10): 3907 - 3912. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Doron-Faigenboim and T. Pupko A Combined Empirical and Mechanistic Codon Model Mol. Biol. Evol., February 1, 2007; 24(2): 388 - 397. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Bofkin and N. Goldman Variation in Evolutionary Processes at Different Codon Positions Mol. Biol. Evol., February 1, 2007; 24(2): 513 - 521. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Abascal, D. Posada, and R. Zardoya MtArt: A New Model of Amino Acid Replacement for Arthropoda Mol. Biol. Evol., January 1, 2007; 24(1): 1 - 5. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. L. Cantarel, H. G. Morrison, and W. Pearson Exploring the Relationship between Sequence Similarity and Accurate Phylogenetic Trees Mol. Biol. Evol., November 1, 2006; 23(11): 2090 - 2100. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Nishihara, M. Hasegawa, and N. Okada Pegasoferae, an unexpected mammalian clade revealed by tracking ancient retroposon insertions PNAS, June 27, 2006; 103(26): 9929 - 9934. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Tang and C.-I Wu A New Method for Estimating Nonsynonymous Substitutions and Its Applications to Detecting Positive Selection Mol. Biol. Evol., February 1, 2006; 23(2): 372 - 379. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Y. Tseng and J. Liang Estimation of Amino Acid Residue Substitution Rates at Local Spatial Regions and Application in Protein Function Inference: A Bayesian Monte Carlo Approach Mol. Biol. Evol., February 1, 2006; 23(2): 421 - 436. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Yang and B. Rannala Bayesian Estimation of Species Divergence Times Under a Molecular Clock Using Multiple Fossil Calibrations with Soft Bounds Mol. Biol. Evol., January 1, 2006; 23(1): 212 - 226. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Ren, H. Tanaka, and Z. Yang An Empirical Examination of the Utility of Codon-Substitution Models in Phylogeny Reconstruction Syst Biol, October 1, 2005; 54(5): 808 - 818. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Y. Yampolsky and A. Stoltzfus The Exchangeability of Amino Acids in Proteins Genetics, August 1, 2005; 170(4): 1459 - 1472. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Chen, M. Terai, L. Fu, R. Herrero, R. DeSalle, and R. D. Burk Diversifying Selection in Human Papillomavirus Type 16 Lineages Based on Complete Genome Analyses J. Virol., June 1, 2005; 79(11): 7014 - 7023. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Sasaki, M. Nikaido, H. Hamilton, M. Goto, H. Kato, N. Kanda, L. A. Pastene, Y. Cao, R. E. Fordyce, M. Hasegawa, et al. Mitochondrial Phylogenetics and Evolution of Mysticete Whales Syst Biol, February 1, 2005; 54(1): 77 - 90. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Kosiol and N. Goldman Different Versions of the Dayhoff Rate Matrix Mol. Biol. Evol., February 1, 2005; 22(2): 193 - 199. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Richards, Y. Liu, B. R. Bettencourt, P. Hradecky, S. Letovsky, R. Nielsen, K. Thornton, M. J. Hubisz, R. Chen, R. P. Meisel, et al. Comparative genome sequencing of Drosophila pseudoobscura: Chromosomal, gene, and cis-element evolution Genome Res., January 1, 2005; 15(1): 1 - 18. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Whelan and N. Goldman Estimating the Frequency of Events That Cause Multiple-Nucleotide Changes Genetics, August 1, 2004; 167(4): 2027 - 2043. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. D. Emes, M. C. Riley, C. M. Laukaitis, L. Goodstadt, R. C. Karn, and C. P. Ponting Comparative Evolutionary Genomics of Androgen-Binding Protein Genes Genome Res., August 1, 2004; 14(8): 1516 - 1529. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Tang, G. J. Wyckoff, J. Lu, and C.-I Wu A Universal Evolutionary Index for Amino Acid Changes Mol. Biol. Evol., August 1, 2004; 21(8): 1548 - 1556. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. D. Emes, S. A. Beatson, C. P. Ponting, and L. Goodstadt Evolution and Comparative Genomics of Odorant- and Pheromone-Associated Genes in Rodents Genome Res., April 1, 2004; 14(4): 591 - 602. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Siepel and D. Haussler Phylogenetic Estimation of Context-Dependent Substitution Rates by Maximum Likelihood Mol. Biol. Evol., March 1, 2004; 21(3): 468 - 488. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Rodriguez-Trelles, R. Tarrio, and F. J. Ayala Convergent neofunctionalization by positive Darwinian selection after ancient recurrent duplications of the xanthine dehydrogenase gene PNAS, November 11, 2003; 100(23): 13413 - 13417. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. M. Robinson, D. T. Jones, H. Kishino, N. Goldman, and J. L. Thorne Protein Evolution with Dependence Among Codons Due to Tertiary Structure Mol. Biol. Evol., October 1, 2003; 20(10): 1692 - 1704. [Abstract] [Full Text] |
||||
![]() |
B. S. W. Chang Ancestral Gene Reconstruction and Synthesis of Ancient Rhodopsins in the Laboratory Integr. Comp. Biol., August 1, 2003; 43(4): 500 - 507. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Goldman and S. Whelan A Novel Use of Equilibrium Frequencies in Models of Sequence Evolution Mol. Biol. Evol., November 1, 2002; 19(11): 1821 - 1831. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Schadt and K. Lange Codon and Rate Variation Models in Molecular Phylogeny Mol. Biol. Evol., September 1, 2002; 19(9): 1534 - 1549. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. E. Schadt, J. S. Sinsheimer, and K. Lange Applications of Codon and Rate Variation Models in Molecular Phylogeny Mol. Biol. Evol., September 1, 2002; 19(9): 1550 - 1562. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. J. Wyckoff, J. Li, and C.-I Wu Molecular Evolution of Functional Genes on the Mammalian Y Chromosome Mol. Biol. Evol., September 1, 2002; 19(9): 1633 - 1636. [Full Text] [PDF] |
||||
![]() |
F. Rodriguez-Trelles, R. Tarrio, and F. J. Ayala A methodological bias toward overestimation of molecular evolutionary time scales PNAS, June 11, 2002; 99(12): 8112 - 8115. [Abstract] [Full Text] [PDF] |
||||
![]() |
H.-w. Kao and S.-C. Lee Phosphoglucose Isomerases of Hagfish, Zebrafish, Gray Mullet, Toad, and Snake, with Reference to the Evolution of the Genes in Vertebrates Mol. Biol. Evol., April 1, 2002; 19(4): 367 - 374. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. S. Fornasari, G. Parisi, and J. Echave Site-Specific Amino Acid Replacement Matrices from Structurally Constrained Protein Evolution Simulations Mol. Biol. Evol., March 1, 2002; 19(3): 352 - 356. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Xia and Z. Xie Protein Structure, Neighbor Effect, and a New Index of Amino Acid Dissimilarities Mol. Biol. Evol., January 1, 2002; 19(1): 58 - 67. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Rodriguez-Trelles, R. Tarrio, and F. J. Ayala Erratic overdispersion of three molecular clocks: GPDH, SOD, and XDH PNAS, September 5, 2001; (2001) 201392198. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. R. Morton Selection at the Amino Acid Level Can Influence Synonymous Codon Usage: Implications for the Study of Codon Adaptation in Plastid Genes Genetics, September 1, 2001; 159(1): 347 - 358. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Whelan and N. Goldman A General Empirical Model of Protein Evolution Derived from Multiple Protein Families Using a Maximum-Likelihood Approach Mol. Biol. Evol., May 1, 2001; 18(5): 691 - 699. [Abstract] [Full Text] |
||||
![]() |
D. Posada and K. A. Crandall Simple (Wrong) Models for Complex Trees: A Case from Retroviridae Mol. Biol. Evol., February 1, 2001; 18(2): 271 - 275. [Full Text] |
||||
![]() |
A. D. Yoder and Z. Yang Estimation of Primate Speciation Dates Using Local Molecular Clocks Mol. Biol. Evol., July 1, 2000; 17(7): 1081 - 1090. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Yang and R. Nielsen Estimating Synonymous and Nonsynonymous Substitution Rates Under Realistic Evolutionary Models Mol. Biol. Evol., January 1, 2000; 17(1): 32 - 43. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Yokoyama and F. B. Radlwimmer The Molecular Genetics of Red and Green Color Vision in Mammals Genetics, October 1, 1999; 153(2): 919 - 932. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Rodriguez-Trelles, R. Tarrio, and F. J. Ayala Erratic overdispersion of three molecular clocks: GPDH, SOD, and XDH PNAS, September 25, 2001; 98(20): 11405 - 11410. [Abstract] [Full Text] [PDF] |
||||









