MBE Advance Access originally published online on July 13, 2007
Molecular Biology and Evolution 2007 24(9):2029-2039; doi:10.1093/molbev/msm139
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Research Articles |
Treeness Triangles: Visualizing the Loss of Phylogenetic Signal
* Allan Wilson Center for Molecular Ecology and Evolution, Massey University, Palmerston North, New Zealand
E-mail: D.Penny{at}massey.ac.nz.
Accepted for publication June 22, 2007.
It is well known that molecular data "saturates" with increasing sequence divergence (thereby losing phylogenetic information) and that in addition the accumulation of misleading information due to chance similarities or to systematic bias may accompany saturation as well. Exploratory data analysis methods that can quantify the extent of signal loss or convergence for a given data set are scarce. Such methods are needed because genomics delivers very long sequence alignments spanning substantial phylogenetic depth, where site saturation may be compounded by systematic biases or other alternative signals. Here we introduce the Treeness Triangle (TT) graph, in which signals detectable by Hadamard (spectral) analysis are summed into 3 categories—those supporting 1) external and 2) internal branches in the optimal tree, in addition to 3) the residuals (potential internal branches not present in the optimal tree). These 3 values are plotted in a standard ternary coordinate system. The approach is illustrated with simulated and real data sets, the latter from complete chloroplast genomes, where potential problems of paralogy or lateral gene acquisition can be excluded. The TT uncovers the divergence-dependent loss of phylogenetic signal as subsets of chloroplast genomes are investigated that span increasingly deeper evolutionary timescales. The rate of signal loss (or signal retention) varies with the gene and/or the method of analysis.
Key Words: plastid genomes spectral analysis model misspecification exploratory data analysis ternary plot Hadamard conjugation
1 These authors contributed equally to this work.
Jianzhi Zhang, Associate Editor
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
O. Deusch, G. Landan, M. Roettger, N. Gruenheit, K. V. Kowallik, J. F. Allen, W. Martin, and T. Dagan Genes of Cyanobacterial Origin in Plant Nuclear Genomes Point to a Heterocyst-Forming Plastid Ancestor Mol. Biol. Evol., April 1, 2008; 25(4): 748 - 761. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Penny, W. T. White, M. D. Hendy, and M. J. Phillips A Bias in ML Estimates of Branch Lengths in the Presence of Multiple Signals Mol. Biol. Evol., February 1, 2008; 25(2): 239 - 242. [Abstract] [Full Text] [PDF] |
||||
