Skip Navigation


MBE Advance Access originally published online on November 28, 2008
Molecular Biology and Evolution 2009 26(2):473-480; doi:10.1093/molbev/msn275
This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow Supplementary Data
Right arrow All Versions of this Article:
26/2/473    most recent
msn275v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Cartwright, R. A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Cartwright, R. A.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2008. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oxfordjournals.org

Research Articles

Problems and Solutions for Estimating Indel Rates and Length Distributions

Reed A. Cartwright

Department of Genetics, Bioinformatics Research Center, North Carolina State University, Raleigh, NC

E-mail: racartwr{at}ncsu.edu.

Accepted for publication November 18, 2008.

Insertions and deletions (indels) are fundamental but understudied components of molecular evolution. Here we present an expectation–maximization algorithm built on a pair hidden Markov model that is able to properly handle indels in neutrally evolving DNA sequences. From a data set of orthologous introns, we estimate relative rates and length distributions of indels among primates and rodents. This technique has the advantage of potentially handling large genomic data sets. We find that a zeta power-law model of indel lengths provides a much better fit than the traditional geometric model and that indel processes are conserved between our taxa. The estimated relative rates are about 12–16 indels per 100 substitutions, and the estimated power-law magnitudes are about 1.6–1.7. More significantly, we find that using the traditional geometric/affine model of indel lengths introduces artifacts into evolutionary analysis, casting doubt on studies of the evolution and diversity of indel formation using traditional models and invalidating measures of species divergence that include indel lengths.

Key Words: indel • power law • conservation • estimation • comparative genomics


Asger Hobolth, Associate Editor


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Mol Biol EvolHome page
W. Fletcher and Z. Yang
INDELible: A Flexible Simulator of Biological Sequence Evolution
Mol. Biol. Evol., August 1, 2009; 26(8): 1879 - 1888.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.