MBE Advance Access published online on February 22, 2008
Molecular Biology and Evolution, doi:10.1093/molbev/msn034
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
research article |
Integrating Markov Clustering and Molecular Phylogenetics to Reconstruct the Cyanobacterial Species Tree from Conserved Protein Families
Institute of Low Temperature Science, Hokkaido University, N19W8, Sapporo 060-0819 Japan
Departments of Biology and Chemistry, Campus Box 1229, Washington University St. Louis, MO 63130
School of Natural Sciences, University of California, Merced, Merced, CA 95344
* Corresponding author: jason.raymond{at}ucmerced.edu
Received for publication September 4, 2007. Revision received December 20, 2007. Accepted for publication December 26, 2007.
Attempts to classify living organisms by their physical characteristics are as old as biology itself. The advent of protein and DNA sequencing – most notably the use of 16S rRNA – defined a new level of classification that now forms our basic understanding of the history of life on Earth. High-throughput sequencing currently provides DNA sequences at an unprecedented rate, providing a wealth of information, but also posing considerable analytical challenges. Here we present comparative genomics-based methods useful for automating evolutionary analysis between any number of species. As a practical example, we applied our method to the well-studied cyanobacterial lineage. The 24 cyanobacterial genomes compared here occupy a wide variety of environmental niches and play major roles in global carbon and nitrogen cycles. By integrating phylogenetic data inferred for upwards of 1000 protein-coding genes common to all or most cyanobacteria, we have reconstructed an evolutionary history of the phylum, establishing a framework for resolving key issues regarding the evolution of their metabolic and phenotypic diversity. Greater resolution on individual branches can be attained by telescoping inward to the larger set of conserved proteins between fewer taxa. The construction of all individual protein phylogenies allows for quantitative tree scoring, providing insight into the evolutionary history of each protein family as well as probing the limits of phylogenetic resolution. The tools incorporated here are fast, computationally tractable, and easily extendable to other phyla and provide a scaleable framework for contrasting and integrating the information present in thousands of protein coding genes within related genomes
Key Words: genomics cyanobacteria evolution Markov clustering phylogenomics
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
E. A. Welsh, M. Liberton, J. Stockel, T. Loh, T. Elvitigala, C. Wang, A. Wollam, R. S. Fulton, S. W. Clifton, J. M. Jacobs, et al. The genome of Cyanothece 51142, a unicellular diazotrophic cyanobacterium important in the marine nitrogen cycle PNAS, September 30, 2008; 105(39): 15094 - 15099. [Abstract] [Full Text] [PDF] |
||||
