Skip Navigation


MBE Advance Access originally published online on August 25, 2009
Molecular Biology and Evolution 2009 26(12):2731-2744; doi:10.1093/molbev/msp188
This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow Supplementary Data
Right arrow All Versions of this Article:
26/12/2731    most recent
msp188v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Gibbons, J. G.
Right arrow Articles by Rokas, A.
PubMed
Right arrow PubMed Citation
Right arrow Articles by Gibbons, J. G.
Right arrow Articles by Rokas, A.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2009. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oxfordjournals.org

Research Articles

Benchmarking Next-Generation Transcriptome Sequencing for Functional and Evolutionary Genomics

John G. Gibbons*, Eric M. Janson*, Chris Todd Hittinger{dagger},{ddagger}, Mark Johnston{dagger},{ddagger}, Patrick Abbot* and Antonis Rokas*

* Department of Biological Sciences, Vanderbilt University, Nashville, TN
{dagger} Department of Biochemistry and Molecular Genetics, University of Colorado Denver Health Sciences Center
{ddagger} Department of Genetics, Center for Genome Sciences, Washington University in St Louis School of Medicine

E-mail: antonis.rokas{at}vanderbilt.edu.

Accepted for publication August 19, 2009.

Next-generation sequencing has opened the door to genomic analysis of nonmodel organisms. Technologies generating long-sequence reads (200–400 bp) are increasingly used in evolutionary studies of nonmodel organisms, but the short-sequence reads (30–50 bp) that can be produced at lower cost are thought to be of limited utility for de novo sequencing applications. Here, we tested this assumption by short-read sequencing the transcriptomes of the tropical disease vectors Aedes aegypti and Anopheles gambiae, for which complete genome sequences are available. Comparison of our results to the reference genomes allowed us to accurately evaluate the quantity, quality, and functional and evolutionary information content of our "test" data. We produced more than 0.7 billion nucleotides of sequenced data per species that assembled into more than 21,000 test contigs larger than 100 bp per species and covered ~27% of the Aedes reference transcriptome. Remarkably, the substitution error rate in the test contigs was ~0.25% per site, with very few indels or assembly errors. Test contigs of both species were enriched for genes involved in energy production and protein synthesis and underrepresented in genes involved in transcription and differentiation. Ortholog prediction using the test contigs was accurate across hundreds of millions of years of evolution. Our results demonstrate the considerable utility of short-read transcriptome sequencing for genomic studies of nonmodel organisms and suggest an approach for assessing the information content of next-generation data for evolutionary studies.

Key Words: next-generation DNA sequencing • Solexa • transcriptome • orthology prediction • error rate • functional annotation


Scott Edwards, Associate Editor


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Proc. Natl. Acad. Sci. USAHome page
C. T. Hittinger, M. Johnston, J. T. Tossberg, and A. Rokas
Leveraging skewed transcript abundance by RNA-Seq to increase the genomic depth of the tree of life
PNAS, January 26, 2010; 107(4): 1476 - 1481.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.