Skip Navigation

This Article
Right arrow Full Text Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (18)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Stuart, G. W.
Right arrow Articles by Leader, J. J.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Stuart, G. W.
Right arrow Articles by Leader, J. J.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Molecular Biology and Evolution 19:554-562 (2002)
© 2002 Society for Molecular Biology and Evolution

A Comprehensive Vertebrate Phylogeny Using Vector Representations of Protein Sequences from Whole Genomes

Gary W. Stuart, Karen Moffett and Jeffery J. Leader

*Department of Life Sciences, Indiana State University;
{dagger}Department of Mathematics, Rose-Hulman Institute of Technology

We recently developed a method for producing comprehensive gene and species phylogenies from unaligned whole genome data using singular value decomposition (SVD) to analyze character string frequencies. This work provides an integrated gene and species phylogeny for 64 vertebrate mitochondrial genomes composed of 832 total proteins. In addition, to provide a theoretical basis for the method, we present a graphical interpretation of both the original frequency matrix and the SVD-derived matrix. These large matrices describe high-dimensional Euclidean spaces within which biomolecular sequences can be uniquely represented as vectors. In particular, the SVD-derived vector space describes each protein relative to a restricted set of newly defined, independent axes, each of which represents a novel form of conserved motif, termed a correlated peptide motif. A quantitative comparison of the relative orientations of protein vectors in this space provides accurate and straightforward estimates of sequence similarity, which can in turn be used to produce comprehensive gene trees. Alternatively, the vector representations of genes from individual species can be summed, allowing species trees to be produced.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Nucleic Acids ResHome page
M. J. Taylor and B. A. Peculis
Evolutionary conservation supports ancient origin for Nudt16, a nuclear-localized, RNA-binding, RNA-decapping enzyme
Nucleic Acids Res., October 1, 2008; 36(18): 6021 - 6034.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
C. Martin, N. N. Diaz, J. Ontrup, and T. W. Nattkemper
Hyperbolic SOM-based clustering of DNA fragment features for taxonomic visualization and classification
Bioinformatics, July 15, 2008; 24(14): 1568 - 1574.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
X. Wu, Z. Cai, X.-F. Wan, T. Hoang, R. Goebel, and G. Lin
Nucleotide composition string selection in HIV-1 subtyping using whole genomes
Bioinformatics, July 15, 2007; 23(14): 1744 - 1752.
[Abstract] [Full Text] [PDF]


Home page
Syst BiolHome page
M. Hohl and M. A. Ragan
Is Multiple-Sequence Alignment Required for Accurate Inference of Phylogeny?
Syst Biol, April 1, 2007; 56(2): 206 - 221.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
K. H. Chu, C. P. Li, and J. Qi
Ribosomal RNA as molecular barcodes: a simple correlation analysis without sequence alignment
Bioinformatics, July 15, 2006; 22(14): 1690 - 1701.
[Abstract] [Full Text] [PDF]


Home page
Proc R Soc BHome page
S. V Edwards, W Bryan Jennings, and A. M Shedlock
Phylogenetics of modern birds in the era of genomics
Proc R Soc B, May 22, 2005; 272(1567): 979 - 992.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
K. H. Chu, J. Qi, Z.-G. Yu, and V. Anh
Origin and Phylogeny of Chloroplasts Revealed by a Simple Correlation Analysis of Complete Genomes
Mol. Biol. Evol., January 1, 2004; 21(1): 200 - 206.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.