Skip Navigation


MBE Advance Access originally published online on December 5, 2003
This Article
Right arrow Full Text Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow An erratum has been published
Right arrow All Versions of this Article:
21/3/419    most recent
msh029v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (17)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Smith, A. D.
Right arrow Articles by Tillier, E. R. M.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Smith, A. D.
Right arrow Articles by Tillier, E. R. M.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Mol. Biol. Evol. 21(3):419-427. 2004
DOI: 10.1093/molbev/msh029
© 2004 by the Society for Molecular Biology and Evolution. ISSN: 0737-4038

Empirical Models for Substitution in Ribosomal RNA

Andrew D. Smith, Thomas W. H. Lui and Elisabeth R. M. Tillier

Department of Medical Biophysics, University of Toronto, and Ontario Cancer Institute, University Health Network, Toronto, Ontario, Canada

E-mail: e.tillier{at}utoronto.ca.

Empirical models of substitution are often used in protein sequence analysis because the large alphabet of amino acids requires that many parameters be estimated in all but the simplest parametric models. When information about structure is used in the analysis of substitutions in structured RNA, a similar situation occurs. The number of parameters necessary to adequately describe the substitution process increases in order to model the substitution of paired bases.

We have developed a method to obtain substitution rate matrices empirically from RNA alignments that include structural information in the form of base pairs. Our data consisted of alignments from the European Ribosomal RNA Database of Bacterial and Eukaryotic Small Subunit and Large Subunit Ribosomal RNA ( Wuyts et al. 2001. Nucleic Acids Res. 29:175–177; Wuyts et al. 2002. Nucleic Acids Res. 30:183–185). Using secondary structural information, we converted each sequence in the alignments into a sequence over a 20-symbol code: one symbol for each of the four individual bases, and one symbol for each of the 16 ordered pairs. Substitutions in the coded sequences are defined in the natural way, as observed changes between two sequences at any particular site. For given ranges (windows) of sequence divergence, we obtained substitution frequency matrices for the coded sequences. Using a technique originally developed for modeling amino acid substitutions ( Veerassamy, Smith, and Tillier. 2003. J. Comput. Biol. 10:997–1010), we were able to estimate the actual evolutionary distance for each window. The actual evolutionary distances were used to derive instantaneous rate matrices, and from these we selected a universal rate matrix.

The universal rate matrices were incorporated into the Phylip Software package ( Felsenstein 2002. http://evolution.genetics.washington.edu/phylip.html), and we analyzed the ribosomal RNA alignments using both distance and maximum likelihood methods. The empirical substitution models performed well on simulated data, and produced reasonable evolutionary trees for 16S ribosomal RNA sequences from sequenced Bacterial genomes.

Empirical models have the advantage of being easily implemented, and the fact that the code consists of 20 symbols makes the models easily incorporated into existing programs for protein sequence analysis. In addition, the models are useful for simulating the evolution of RNA sequence and structure simultaneously.

Key Words: rRNA evolution • empirical substitution model • RNA structure • simulation


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
BioinformaticsHome page
M. Wolf, B. Ruderisch, T. Dandekar, J. Schultz, and T. Muller
ProfDistS: (profile-) distance based phylogeny on sequence--structure alignments
Bioinformatics, October 15, 2008; 24(20): 2401 - 2402.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
J. Yu and J. L. Thorne
Dependence among Sites in RNA Evolution
Mol. Biol. Evol., August 1, 2006; 23(8): 1525 - 1537.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
T. Gesell and A. von Haeseler
In silico sequence evolution with site-specific interactions along phylogenetic trees
Bioinformatics, March 15, 2006; 22(6): 716 - 722.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
M. J Telford, M. J Wise, and V. Gowri-Shankar
Consideration of RNA Secondary Structure Significantly Improves Likelihood-Based Estimates of Phylogeny: Examples from the Bilateria
Mol. Biol. Evol., April 1, 2005; 22(4): 1129 - 1136.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.