Molecular Biology and Evolution, Vol 13, 93-104, Copyright © 1996 by Society for Molecular Biology and Evolution
J Felsenstein and GA Churchill
The method of Hidden Markov Models is used to allow for unequal and unknown
evolutionary rates at different sites in molecular sequences. Rates of
evolution at different sites are assumed to be drawn from a set of possible
rates, with a finite number of possibilities. The overall likelihood of
phylogeny is calculated as a sum of terms, each term being the probability
of the data given a particular assignment of rates to sites, times the
prior probability of that particular combination of rates. The
probabilities of different rate combinations are specified by a stationary
Markov chain that assigns rate categories to sites. While there will be a
very large number of possible ways of assigning rates to sites, a simple
recursive algorithm allows the contributions to the likelihood from all
possible combinations of rates to be summed, in a time proportional to the
number of different rates at a single site. Thus with three rates, the
effort involved is no greater than three times that for a single rate. This
"Hidden Markov Model" method allows for rates to differ between sites and
for correlations between the rates of neighboring sites. By summing over
all possibilities it does not require us to know the rates at individual
sites. However, it does not allow for correlation of rates at nonadjacent
sites, nor does it allow for a continuous distribution of rates over sites.
It is shown how to use the Newton-Raphson method to estimate branch lengths
of a phylogeny and to infer from a phylogeny what assignment of rates to
sites has the largest posterior probability. An example is given using
beta-hemoglobin DNA sequences in eight mammal species; the regions of high
and low evolutionary rates are inferred and also the average length of
patches of similar rates.
ORIGINAL ARTICLE
A Hidden Markov Model approach to variation among sites in rate of evolution
Department of Genetics, University of Washington, Seattle 98195, USA.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
J. R. Hall, K. R. Mitchell, O. Jackson-Weaver, A. S. Kooser, B. R. Cron, L. J. Crossey, and C. D. Takacs-Vesbach Molecular Characterization of the Diversity and Distribution of a Thermal Spring Microbial Community by Using rRNA and Metabolic Genes Appl. Envir. Microbiol., August 1, 2008; 74(15): 4910 - 4922. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Barquist and I. Holmes xREI: a phylo-grammar visualization webserver Nucleic Acids Res., July 1, 2008; 36(suppl_2): W65 - W69. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. W. Mount The Maximum Likelihood Approach for Phylogenetic Prediction CSH Protocols, April 1, 2008; 2008(5): pdb.top34 - pdb.top34. [Abstract] [Full Text] |
||||
![]() |
H. M. Kang, N. A. Zaitlen, C. M. Wade, A. Kirby, D. Heckerman, M. J. Daly, and E. Eskin Efficient Control of Population Structure in Model Organism Association Mapping Genetics, March 1, 2008; 178(3): 1709 - 1723. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. S. Vernikos and J. Parkhill Resolving the structural features of genomic islands: A machine learning approach Genome Res., February 1, 2008; 18(2): 331 - 342. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. J. Tourasse and A.-B. Kolsto SuperCAT: a supertree database for combined and integrative multilocus sequence typing analysis of the Bacillus cereus group of bacteria (including B. cereus, B. anthracis and B. thuringiensis) Nucleic Acids Res., January 11, 2008; 36(suppl_1): D461 - D468. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Molina and E. van Nimwegen Universal patterns of purifying selection at noncoding positions in bacteria Genome Res., January 1, 2008; 18(1): 148 - 160. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Miller, K. Rosenbloom, R. C. Hardison, M. Hou, J. Taylor, B. Raney, R. Burhans, D. C. King, R. Baertsch, D. Blankenberg, et al. 28-Way vertebrate alignment and conservation track in the UCSC Genome Browser Genome Res., December 1, 2007; 17(12): 1797 - 1808. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. D. Rasmussen and M. Kellis Accurate gene-tree reconstruction by learning gene- and species-specific substitution rates across multiple complete genomes Genome Res., December 1, 2007; 17(12): 1932 - 1942. [Abstract] [Full Text] [PDF] |
||||
![]() |
C.-H. Yeang, J. F. J. Darot, H. F. Noller, and D. Haussler Detecting the Coevolution of Biosequences An Example of RNA Interaction Prediction Mol. Biol. Evol., September 1, 2007; 24(9): 2119 - 2131. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. C. Serrani, R. Sanjuan, O. Ruiz-Rivero, M. Fos, and J. L. Garcia-Martinez Gibberellin Regulation of Fruit Set and Growth in Tomato Plant Physiology, September 1, 2007; 145(1): 246 - 257. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. S.W. Wong and R. Nielsen Finding cis-regulatory modules in Drosophila using phylogenetic hidden Markov models Bioinformatics, August 15, 2007; 23(16): 2031 - 2037. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Mayrose, A. Doron-Faigenboim, E. Bacharach, and T. Pupko Towards realistic codon models: among site variability and dependency of synonymous and non-synonymous rates Bioinformatics, July 1, 2007; 23(13): i319 - i327. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. J. Bowles, J.-H. Lee, R. Alberio, R. E. I. Lloyd, D. Stekel, K. H. S. Campbell, and J. C. St. John Contrasting Effects of in Vitro Fertilization and Nuclear Transfer on the Expression of mtDNA Replication Factors Genetics, July 1, 2007; 176(3): 1511 - 1526. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. F. Boni, D. Posada, and M. W. Feldman An Exact Nonparametric Method for Inferring Mosaic Structure in Sequence Triplets Genetics, June 1, 2007; 176(2): 1035 - 1047. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. D. Nguyen, M. Yoshihama, and N. Kenmochi The Evolution of Spliceosomal Introns in Alveolates Mol. Biol. Evol., May 1, 2007; 24(5): 1093 - 1096. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. G. Beiko and R. L. Charlebois A simulation test bed for hypotheses of genome evolution Bioinformatics, April 1, 2007; 23(7): 825 - 831. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Liu, Y. Wu, L. Li, Y. Ma, and Z. Shao Thalassospira xiamenensis sp. nov. and Thalassospira profundimaris sp. nov. Int J Syst Evol Microbiol, February 1, 2007; 57(2): 316 - 320. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Qiu, Y. Huang, L. Sun, X. Zhang, Z. Liu, and W. Song Leifsonia ginsengi sp. nov., isolated from ginseng root Int J Syst Evol Microbiol, February 1, 2007; 57(2): 405 - 408. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Doron-Faigenboim and T. Pupko A Combined Empirical and Mechanistic Codon Model Mol. Biol. Evol., February 1, 2007; 24(2): 388 - 397. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. E. Baker and K. Rogers Phylogenetic Analysis of Fungal Centromere H3 Proteins Genetics, November 1, 2006; 174(3): 1481 - 1492. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. H. Duncan, R. I. Aminov, K. P. Scott, P. Louis, T. B. Stanton, and H. J. Flint Proposal of Roseburia faecis sp. nov., Roseburia hominis sp. nov. and Roseburia inulinivorans sp. nov., based on isolates from human faeces. Int J Syst Evol Microbiol, October 1, 2006; 56(Pt 10): 2437 - 2441. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. L. Kosakovsky Pond, D. Posada, M. B. Gravenor, C. H. Woelk, and S. D. W. Frost Automated Phylogenetic Detection of Recombination Using a Genetic Algorithm Mol. Biol. Evol., October 1, 2006; 23(10): 1891 - 1901. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Shi, H. Gu, and C. Field Test a Clade in Phylogenetic Trees Mol. Biol. Evol., October 1, 2006; 23(10): 1976 - 1983. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Shapiro, A. Rambaut, O. G. Pybus, and E. C. Holmes A Phylogenetic Method for Detecting Positive Epistasis in Gene Sequences and Its Application to RNA Virus Evolution Mol. Biol. Evol., September 1, 2006; 23(9): 1724 - 1730. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Rodrigue, H. Philippe, and N. Lartillot Assessing Site-Interdependent Phylogenetic Models of Sequence Evolution Mol. Biol. Evol., September 1, 2006; 23(9): 1762 - 1775. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. J. Etherington, S. M. Ring, M. A. Charleston, J. Dicks, V. J. Rayward-Smith, and I. N. Roberts Tracing the origin and co-phylogeny of the caliciviruses. J. Gen. Virol., May 1, 2006; 87(Pt 5): 1229 - 1235. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. W. Hahn Accurate Inference and Estimation in Population Genomics Mol. Biol. Evol., May 1, 2006; 23(5): 911 - 918. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. K. Kuhner LAMARC 2.0: maximum likelihood and Bayesian estimation of population parameters Bioinformatics, March 15, 2006; 22(6): 768 - 770. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Y. Tseng and J. Liang Estimation of Amino Acid Residue Substitution Rates at Local Spatial Regions and Application in Protein Function Inference: A Bayesian Monte Carlo Approach Mol. Biol. Evol., February 1, 2006; 23(2): 421 - 436. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Stern and T. Pupko An Evolutionary Space-Time Model with Varying Among-Site Dependencies Mol. Biol. Evol., February 1, 2006; 23(2): 392 - 400. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. K. Pond and S. V. Muse Site-to-Site Variation of Synonymous Substitution Rates Mol. Biol. Evol., December 1, 2005; 22(12): 2375 - 2385. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Csuros and I. Miklos Statistical Alignment of Retropseudogenes and Their Functional Paralogs Mol. Biol. Evol., December 1, 2005; 22(12): 2457 - 2471. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Li, S. Zhong, and W. H. Wong Reliable prediction of transcription factor binding sites by phylogenetic verification PNAS, November 22, 2005; 102(47): 16945 - 16950. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Chang and T. M. Aune Histone hyperacetylated domains across the Ifng gene region in natural killer cells and T cells PNAS, November 22, 2005; 102(47): 17095 - 17100. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Zhao, R. E. Davis, and I.-M. Lee Phylogenetic positions of 'Candidatus Phytoplasma asteris' and Spiroplasma kunkelii as inferred from multiple sets of concatenated core housekeeping proteins Int J Syst Evol Microbiol, September 1, 2005; 55(5): 2131 - 2141. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. C. King, J. Taylor, L. Elnitski, F. Chiaromonte, W. Miller, and R. C. Hardison Evaluation of regulatory potential and conservation scores for detecting cis-regulatory modules in aligned mammalian genome sequences Genome Res., August 1, 2005; 15(8): 1051 - 1060. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Siepel, G. Bejerano, J. S. Pedersen, A. S. Hinrichs, M. Hou, K. Rosenbloom, H. Clawson, J. Spieth, L. W. Hillier, S. Richards, et al. Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes Genome Res., August 1, 2005; 15(8): 1034 - 1050. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Li and W. H. Wong Sampling motifs on phylogenetic trees PNAS, July 5, 2005; 102(27): 9481 - 9486. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. D. McAuliffe, M. I. Jordan, and L. Pachter Subtree power analysis and species selection for comparative genomics PNAS, May 31, 2005; 102(22): 7900 - 7905. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Liu and Z. Shao Alcanivorax dieselolei sp. nov., a novel alkane-degrading bacterium isolated from sea water and deep-sea sediment Int J Syst Evol Microbiol, May 1, 2005; 55(3): 1181 - 1186. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. E. Hammer, S. Strehl, and S. Hagemann Homologs of Drosophila P Transposons Were Mobile in Zebrafish but Have Been Domesticated in a Common Ancestor of Chicken and Human Mol. Biol. Evol., April 1, 2005; 22(4): 833 - 844. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Blouin, D. Butt, and A. J. Roger Impact of Taxon Sampling on the Estimation of Rates of Evolution at Sites Mol. Biol. Evol., March 1, 2005; 22(3): 784 - 791. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Dufraigne, B. Fertil, S. Lespinats, A. Giron, and P. Deschavanne Detection and characterization of horizontal transfers in prokaryotes using genomic signature Nucleic Acids Res., January 13, 2005; 33(1): e6 - e6. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. S. Janssen, R. S. Phillips, C. M. R. Turner, and M. P. Barrett Plasmodium interspersed repeats: the major multigene superfamily of malaria parasites Nucleic Acids Res., October 26, 2004; 32(19): 5712 - 5720. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. S. Pedersen, R. Forsberg, I. M. Meyer, and J. Hein An Evolutionary Model for Protein-Coding Regions with Conserved RNA Structure Mol. Biol. Evol., October 1, 2004; 21(10): 1913 - 1922. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. S. Pedersen, I. M. Meyer, R. Forsberg, P. Simmonds, and J. Hein A comparative method for finding and folding RNA secondary structures within protein-coding regions Nucleic Acids Res., September 24, 2004; 32(16): 4925 - 4936. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Caumette, R. Guyoneaud, J. F. Imhoff, J. Suling, and V. Gorlenko Thiocapsa marina sp. nov., a novel, okenone-containing, purple sulfur bacterium isolated from brackish coastal and marine environments Int J Syst Evol Microbiol, July 1, 2004; 54(4): 1031 - 1036. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Siepel and D. Haussler Phylogenetic Estimation of Context-Dependent Substitution Rates by Maximum Likelihood Mol. Biol. Evol., March 1, 2004; 21(3): 468 - 488. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. H. Ardell, C. A. Lozupone, and L. F. Landweber Polymorphism, Recombination and Alternative Unscrambling in the DNA Polymerase {alpha} Gene of the Ciliate Stylonychia lemnae (Alveolata; class Spirotrichea) Genetics, December 1, 2003; 165(4): 1761 - 1777. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Mannerova, R. Pantucek, J. Doskar, P. Svec, C. Snauwaert, M. Vancanneyt, J. Swings, and I. Sedlacek Macrococcus brunensis sp. nov., Macrococcus hajekii sp. nov. and Macrococcus lamae sp. nov., from the skin of llamas Int J Syst Evol Microbiol, September 1, 2003; 53(5): 1647 - 1654. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. L. Hughes and H. Piontkivska Phylogeny of Trypanosomatidae and Bodonidae (Kinetoplastida) Based on 18S rRNA: Evidence for Paraphyly of Trypanosoma and Six Other Genera Mol. Biol. Evol., April 1, 2003; 20(4): 644 - 652. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Husmeier and G. McGuire Detecting Recombination in 4-Taxa DNA Sequence Alignments with Bayesian Hidden Markov Models and Markov Chain Monte Carlo Mol. Biol. Evol., March 1, 2003; 20(3): 315 - 337. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Smith, L. Vigilant, and P. A. Morin The effects of sequence length and oligonucleotide mismatches on 5' exonuclease assay efficiency Nucleic Acids Res., October 15, 2002; 30(20): e111 - e111. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Susko, Y. Inagaki, C. Field, M. E. Holder, and A. J. Roger Testing for Differences in Rates-Across-Sites Distributions in Phylogenetic Subtrees Mol. Biol. Evol., September 1, 2002; 19(9): 1514 - 1523. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Schadt and K. Lange Codon and Rate Variation Models in Molecular Phylogeny Mol. Biol. Evol., September 1, 2002; 19(9): 1534 - 1549. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. E. Schadt, J. S. Sinsheimer, and K. Lange Applications of Codon and Rate Variation Models in Molecular Phylogeny Mol. Biol. Evol., September 1, 2002; 19(9): 1550 - 1562. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. R. Lemmon and M. C. Milinkovitch The metapopulation genetic algorithm: An efficient solution for the problem of large phylogeny estimation PNAS, August 6, 2002; 99(16): 10516 - 10521. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. L. Simon, E. A. Stone, and A. Sidow Inference of functional regions in proteins by quantification of evolutionary constraints PNAS, March 5, 2002; 99(5): 2912 - 2917. [Abstract] [Full Text] [PDF] |
||||
![]() |
|










