Molecular Biology and Evolution 18:763-776 (2001)
© 2001 Society for Molecular Biology and Evolution
ARTICLE |
A Dependent-Rates Model and an MCMC-Based Methodology for the Maximum-Likelihood Analysis of Sequences with Overlapping Reading Frames
Department of Theoretical Statistics, Institute of Mathematics, University of Aarhus, Denmark
We present a model and methodology for the maximum-likelihood analysis of pairwise alignments of DNA sequences in which two genes are encoded in overlapping reading frames. In the model for the substitution process, the instantaneous rates of substitution are allowed to depend on the nucleotides occupying the sites in a neighborhood of the site subject to substitution at the instant of the substitution. By defining the neighborhood of a site to extend over all sites in the codons in both reading frames to which a site belongs, constraints imposed by the genetic code in both reading frames can be taken into account. Due to the dependency of the instantaneous rates of substitution on the states at neighboring sites, the transition probability between sequences does not factorize and therefore cannot be obtained directly. We present a Markov chain Monte Carlo procedure for obtaining the ratio of two transition probabilities between two sequences under the model considered, and we describe how maximum-likelihood parameter estimation and likelihood ratio tests can be performed using the procedure. We describe how the expected numbers of different types of substitutions in the shared history of two sequences can be calculated, and we use the described model and methodology in an analysis of a pairwise alignment of two hepatitis B sequences in which two genes are encoded in overlapping frames. Finally, we present an extended model, together with a simpler approximate estimation procedure, and use this to test the adequacy of the former model.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
I. Miklos, A. Novak, R. Satija, R. Lyngso, and J. Hein Stochastic models of sequence evolution including insertion--deletion events Statistical Methods in Medical Research, October 1, 2009; 18(5): 453 - 485. [Abstract] [PDF] |
||||
![]() |
M. Anisimova and C. Kosiol Investigating Protein-Coding Sequence Evolution with Probabilistic Codon Substitution Models Mol. Biol. Evol., February 1, 2009; 26(2): 255 - 271. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. C. Choi, B. D Redelings, and J. L Thorne Basing population genetic inferences and models of molecular evolution upon desired stationary distributions of DNA or protein sequences Phil Trans R Soc B, December 27, 2008; 363(1512): 3931 - 3939. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Wadhawan, B. Dickins, and A. Nekrutenko Wheels within Wheels: Clues to the Evolution of the Gnas and Gnal Loci Mol. Biol. Evol., December 1, 2008; 25(12): 2745 - 2757. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Rodrigue, H. Philippe, and N. Lartillot Uniformization for sampling realizations of Markov processes: applications to Bayesian implementations of codon substitution models Bioinformatics, January 1, 2008; 24(1): 56 - 62. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. McCauley, S. de Groot, T. Mailund, and J. Hein Annotation of selection strengths in viral genomes Bioinformatics, November 15, 2007; 23(22): 2978 - 2986. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Rodrigue, H. Philippe, and N. Lartillot Exploring Fast Computational Strategies for Probabilistic Phylogenetic Analysis Syst Biol, October 1, 2007; 56(5): 711 - 726. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. C. Choi, A. Hobolth, D. M. Robinson, H. Kishino, and J. L. Thorne Quantifying the Impact of Protein Tertiary Structure on Molecular Evolution Mol. Biol. Evol., August 1, 2007; 24(8): 1769 - 1782. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. de Groot, T. Mailund, and J. Hein Comparative annotation of viral genomes with non-conserved gene structure Bioinformatics, May 1, 2007; 23(9): 1080 - 1089. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. C. Holmes, D. J. Lipman, D. Zamarin, and J. W. Yewdell Comment on "Large-Scale Sequence Analysis of Avian Influenza Isolates" Science, September 15, 2006; 313(5793): 1573b - 1573b. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Rodrigue, H. Philippe, and N. Lartillot Assessing Site-Interdependent Phylogenetic Models of Sequence Evolution Mol. Biol. Evol., September 1, 2006; 23(9): 1762 - 1775. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Yu and J. L. Thorne Dependence among Sites in RNA Evolution Mol. Biol. Evol., August 1, 2006; 23(8): 1525 - 1537. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. McCauley and J. Hein Using hidden Markov models and observed evolution to annotate viral genomes Bioinformatics, June 1, 2006; 22(11): 1308 - 1316. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Mateiu and B. Rannala Inferring Complex DNA Substitution Processes on Phylogenies Using Uniformization and Data Augmentation Syst Biol, April 1, 2006; 55(2): 259 - 269. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Gesell and A. von Haeseler In silico sequence evolution with site-specific interactions along phylogenetic trees Bioinformatics, March 15, 2006; 22(6): 716 - 722. [Abstract] [Full Text] [PDF] |
||||
![]() |
T.-K. Seo, H. Kishino, and J. L. Thorne Incorporating gene-specific variation when inferring and evaluating optimal evolutionary tree topologies from multilocus sequence data PNAS, March 22, 2005; 102(12): 4436 - 4441. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. S. Pedersen, R. Forsberg, I. M. Meyer, and J. Hein An Evolutionary Model for Protein-Coding Regions with Conserved RNA Structure Mol. Biol. Evol., October 1, 2004; 21(10): 1913 - 1922. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. A. Huttley Modeling the Impact of DNA Methylation on the Evolution of BRCA1 in Mammals Mol. Biol. Evol., September 1, 2004; 21(9): 1760 - 1768. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Whelan and N. Goldman Estimating the Frequency of Events That Cause Multiple-Nucleotide Changes Genetics, August 1, 2004; 167(4): 2027 - 2043. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Siepel and D. Haussler Phylogenetic Estimation of Context-Dependent Substitution Rates by Maximum Likelihood Mol. Biol. Evol., March 1, 2004; 21(3): 468 - 488. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Miklos, G. A. Lunter, and I. Holmes A "Long Indel" Model For Evolutionary Sequence Alignment Mol. Biol. Evol., March 1, 2004; 21(3): 529 - 540. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. M. Robinson, D. T. Jones, H. Kishino, N. Goldman, and J. L. Thorne Protein Evolution with Dependence Among Codons Due to Tertiary Structure Mol. Biol. Evol., October 1, 2003; 20(10): 1692 - 1704. [Abstract] [Full Text] |
||||
![]() |
R. Forsberg and F. B. Christiansen A Codon-Based Model of Host-Specific Selection in Parasites, with an Application to the Influenza A Virus Mol. Biol. Evol., August 1, 2003; 20(8): 1252 - 1259. [Abstract] [Full Text] [PDF] |
||||







