MBE Advance Access originally published online on August 23, 2007
Molecular Biology and Evolution 2007 24(11):2400-2411; doi:10.1093/molbev/msm178
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Research Articles |
The Problem of Rooting Rapid Radiations
The Allan Wilson Centre for Molecular Ecology and Evolution, Massey University, Palmerston North, New Zealand
E-mail: l.shavit{at}massey.ac.nz.
| Abstract |
|---|
|
|
|---|
There are many examples of groups (such as birds, bees, mammals, multicellular animals, and flowering plants) that have undergone a rapid radiation. In such cases, where there is a combination of short internal and long external branches, correctly estimating and rooting phylogenetic trees is known to be a difficult problem. In this simulation study, we tested the performances of different phylogenetic methods at estimating a tree that models a rapid radiation. We found that maximum likelihood, corrected and uncorrected neighbor-joining, and corrected and uncorrected parsimony, all suffer from biases toward specific tree topologies. In addition, we found that using a single-taxon outgroup to root a tree frequently disrupts an otherwise correct ingroup phylogeny. Moreover, for uncorrected parsimony, we found cases where several individual trees (in which the outgroup was placed incorrectly) were selected more frequently than the correct tree. Even for parameter settings where the correct tree was selected most frequently when using extremely long sequences, for sequences of up to 60,000 nucleotides the incorrectly rooted trees were each selected more frequently than the correct tree. For all the cases tested here, tree estimation using a two taxon outgroup was more accurate than when using a single-taxon outgroup. However, the ingroup was most accurately recovered when no outgroup was used.
Key Words: maximum parsimony maximum likelihood misleading zones neighbor-joining outgroup rooting topological bias
| Introduction |
|---|
|
|
|---|
The problem of tree reconstruction and rooting is known to be challenging, especially in cases of rapid radiations where there is a combination of short and long branches. In particular, long-branch attraction (Felsenstein 1978
Simulation studies have proven to be useful in evaluating the strengths and weaknesses of phylogenetic methods in tree reconstruction. Previous simulation studies on bifurcating trees show that when internal branches are small relative to external branches, even a small misspecification of the substitution model may mislead phylogenetic inference (Poe and Swofford 1999
; Ho and Jermiin 2004
). Holland et al. (2003)
conducted a simulation study of the performance of the unweighted pair group method with arithmetic mean (UPGMA), neighbor-joining (NJ), maximum parsimony (MP), and maximum likelihood (ML) methods, for a 5-taxon tree with a symmetric 4-taxon ingroup under a molecular clock. That study compared the accuracy of different phylogenetic methods for various sequence lengths, and explored the effectiveness of correcting NJ for multiple substitutions. Holland et al. (2003)
also tested the effectiveness of using an outgroup to root a tree and demonstrated some of the problems in reconstructing and rooting trees. They discovered a misleading zone where the tree estimate is consistent (i.e., the probability of estimating the correct tree tends to 1 as the sequence length tends to infinity), but for a wide range of sequence lengths, 4 incorrect trees were each chosen up to twice as frequently as the correct tree. They also established that the inclusion of a distant outgroup, which should join into a short internal branch, frequently disrupted the ingroup tree. This effect of outgroup inclusion disrupting the ingroup was also found for both mammals and birds (Lin, Waddell, and Penny 2002
; Slack et al. 2003
). In their study, Holland et al. (2003)
used only 5 taxa; as the number of taxa increases and the models become more complex, additional problems are expected.
| Materials and Methods |
|---|
|
|
|---|
To extend the work of Holland et al. (2003)
|
For all simulations, unless otherwise stated, the following settings and procedures apply. Seq-Gen version 1.3.2 (Rambaut and Grassly 1997
When comparing NJ applied to corrected distances and MP, 2 parameters are being changed simultaneously—the tree building method and whether or not a correction for multiple substitutions is done (Steel, Hendy, and Penny 1993
; Penny et al. 1996
). However, it is possible to separate the effects of these 2 parameters to allow for a better comparison between the methods. Therefore, NJ was applied both with the Jukes–Cantor correction (Jukes and Cantor 1969
) by setting the DSET option in PAUP* to JC and with no correction by setting the DSET option to P. In some cases, MP was performed with Jukes–Cantor correction in addition to its usual implementation (no correction). Although correcting MP for multiple changes is possible, it is not implemented in publicly available software. Therefore, correction for MP was implemented using our own code with distance Hadamard (Hendy and Penny 1993
) applied to distances that were corrected by the Jukes–Cantor method (this code is available from l.shavit{at}massey.ac.nz). For more information about corrected MP (cMP) and the effect of the correction on parsimony's consistency, see Steel, Hendy, and Penny (1993)
and Penny et al. (1996)
.
Sequences were generated on the model trees depicted in figure 1. Branch lengths varied according to parameters x, y, z, and w (fig. 1), where x (ranging from 0.005 to 0.025 in steps of 0.010) is the expected number of substitutions per site on each of the 5 internal branches, y (ranging from 0.1 to 0.3 in steps of 0.1) is the expected number of substitutions per site on each of the 8 external branches, z (ranging from 0 to 0.3 in steps of 0.05) is the expected number of substitutions per site on the edge connecting the outgroup taxa to the ingroup in the middle of the innermost edge, and w (ranging from 0 to 0.3 in steps of 0.05) is the expected number of substitutions per site on each outgroup branch. If z + w
1.5x + y, then there is a point on the tree such that the distances from that point to each of the leaves are all equal (we then say that "a molecular clock is maintained," though this is not true for all parameter combinations used here). One thousand data sets were generated of lengths l = (200, 400, 800, and 1,600) for each parameter combination of the model tree. The reconstructed unweighted trees (without edge lengths) were compared with the model (generating) tree.
| Results |
|---|
|
|
|---|
Eight-Taxon Simulation
Accuracy of the Methods
We first considered the ability of the methods to reconstruct the ingroup tree alone. Sequences were generated on the 8-taxon tree T8 = (((1,2),(3,4)),((5,6),(7,8))) (see fig. 1). Figure 2 shows the accuracy of the different methods in reconstructing T8 for different regions of the parameter space. The results of this simulation show that all 4 methods are consistent for all regions of the parameter space. As expected, all methods are less accurate when the internal edges are short and the external branches are long.
|
An unexpected feature is that in this parameter space MP performed as well as, and usually better than, the other methods tested. This was surprising because the tree and parameters were chosen so that it would be difficult for MP to obtain the correct tree. However, it is known that some biases can favor the correct tree (Swofford et al. 2001
Topological Bias
Two trees have the same unlabeled topology if one tree can be converted into the other (ignoring branch lengths) by a permutation of the labels (taxon names). A 2-fold symmetry is a point on any vertex or edge on the tree where precisely 2 of the subtrees are topologically identical. An example of a 2-fold symmetry is a cherry, which is defined as a single pair of leaves adjacent to a common node (McKenzie and Steel 2000
). Note that a star tree with 3 or more taxa contains no cherries, as there are more than 2 taxa adjacent to the single internal node. We investigated the bias of phylogenetic methods toward estimating trees with a certain number of cherries. The 4 possible 8-taxon, unrooted, unlabeled, bifurcating tree topologies are shown in figure 3. Their frequencies were calculated using the formula given by Hendy, Little, and Penny (1984)
(see also Penny, Hendy, and Steel 1991
). Within the 4 possible unlabeled topologies of 8-taxon bifurcating trees, 1 topology comprises 4 cherries, 2 topologies have 3 cherries, and 1 topology has 2 cherries (fig. 3).
|
To test the hypothesis that parsimony methods are biased toward selecting the highly symmetric topology of T8, 10,395 alignments (the number of 8-taxon, unrooted, bifurcating trees) were generated on an 8-taxon star tree (by setting x = 0). The expected number y of substitutions per site on the 8 (external) branches was set to 0.2, and the length of the generated sequences was set to 1,000. Each of the 5 phylogenetic methods was applied to the set of alignments, and the number of trees of each of the 4 topologies (fig. 3) was recorded. The DCOLLAPSE and LCOLLAPSE options in PAUP* were both set to "Yes," thus allowing uNJ, cNJ, and ML to collapse branches with length smaller than 10–8. For MP, the 2 COLLAPSE options MINBRLEN and MAXBRLEN were tested.
It is important to note, that because the star tree is a multifurcating tree with no internal branches, the correct number of trees having any of the 4 bifurcating topologies is 0. However, seeing that all methods selected many bifurcating trees, we compared the distribution of these with the distribution of all different 8-taxon, leaf-labeled, unrooted, bifurcating trees. The results are shown in figure 4. All methods were found to be biased toward fully resolved trees. Even when they were allowed to collapse 0-length branches, none of the methods ever recovered the star tree. Moreover, the biases demonstrated were not equivalent for all methods.
|
Strikingly, for MP, 80% of the estimated trees had 4 cherries, although only 3% (315 out of 10,395) of the 8-taxon bifurcating trees have such a topology. Furthermore, MP did not select any trees with 2 cherries or any multifurcating trees. In this example, we did not detect any differences in the results using either of the 2 collapsing options (MINBRLEN, MAXBRLEN). A less extreme bias was found for cMP, where 10% of the estimated trees had 4 cherries and 26% had 2 cherries. Both uNJ and cNJ had similar biases with only 17% of the estimated trees having 2 cherries, substantially less than the 49% (5,040 out of 10,395) of bifurcating trees having this topology. Thirteen percent of the trees constructed by uNJ and cNJ had 4 cherries, still well in excess of the 3% in the uniform distribution of the bifurcating trees.
Compared with the distribution of bifurcating trees, all methods selected more trees with topology 3 and fewer trees with topology 2 (both topologies have 3 cherries). For ML, some of the trees with topology 2 were collapsed into multifurcating trees. ML also found fewer trees with topology 4 (2 cherries) than there are in the uniform distribution.
For MP, cMP, uNJ, and cNJ, a general bias toward forming cherries was found. Although ML demonstrated less bias toward forming cherries, it did exhibit bias against collapsing edges that are adjacent to cherries. In more than 40% of the cases, ML selected multifurcating trees; however, the star tree was never selected. MP, uNJ, and cNJ estimated only bifurcating trees, even though the collapse options in PAUP* version 4b10 (Swofford 2002
) were set to "Yes." The bias toward selecting bifurcating trees with cherries is particularly evident for MP, and this is almost certainly the explanation for why MP appears to perform so well in figure 2. When the sequence length was increased to 10,000, cMP, uNJ, and cNJ selected each topology with a similar frequency (to within 2%) to that found when the length of the generated sequences was set to 1,000. ML selected more trees with 2 cherries (topology 4) and fewer trees of topology 3 than were selected when the sequence length was set to 1,000 but selected the other topologies with similar frequencies (to within 1%) to those found with sequence length l = 1,000. MP selected only (i.e., 100%) trees with 4 cherries (topology 1). We also tested the effect of setting NBEST to "No," allowing the methods to select more than one tree for each data set (while weighting the trees for each data set, so that the total weight of each data set was 1). This did not have a significant effect on the results.
We have shown that, for the parameter space used, all the phylogenetic methods tested here were consistent in reconstructing the 8-taxon tree (T8). Nonetheless, we found that the phylogenetic methods tested, and particularly MP, are biased toward specific tree topologies.
Adding a Single-Taxon Outgroup
The next simulation tested the effect of adding a single-taxon outgroup to the 8-taxon tree. Sequences were generated on the 9-taxon tree T9 = ((((1,2),(3,4)),((5,6),(7,8))),9). The expected number of substitutions per site on the edge connecting the outgroup taxa to the ingroup, z, was set to 0.
Accuracy of the Methods
Given that the simulation study done by Holland et al. (2003)
found that the addition of an outgroup can disrupt a correct ingroup, we compared the outcomes of applying the methods to the 9-taxon alignment and to an alignment of the 8 ingroup taxa alone. The results were classified into 6 categories according to the scheme shown in figure 5a, based on whether or not the 9-taxon tree (constructed from the 9-taxon alignment) was correct and whether or not the 8-taxon tree (constructed from the 8-taxon ingroup alignment) was correct. The percentage of trials resulting in each category is reported in figure 5b. As in the 8-taxon simulation, and as expected, all methods were found to be less accurate when the internal edges are short and the external branches are long (see supplementary material S1, Supplementary Material online).
|
With the inclusion of an outgroup, the accuracy in reconstructing the correct ingroup tree was reduced compared with the 8-taxon case. As expected, the results show that the more distant the outgroup becomes, the more difficult it is to reconstruct the correct tree (fig. 5). For the tree and parameters used, ML was the most accurate of the methods tested. MP, which was very accurate in reconstructing the 8-taxon tree (see Topological Bias), was particularly affected by the inclusion of the outgroup. In fact, MP was the only method that became inconsistent (with parameters x = 0.005 or x = 0.015, y = 0.3, z = 0, and w = 0.4; see supplementary material S1, Supplementary Material online). When the molecular clock was maintained, uNJ performed better than cNJ, but when the molecular clock was violated, cNJ was more accurate (see table 1).
|
Most interesting are cases in which the 8-taxon ingroup tree was correct but adding the outgroup disrupted the ingroup (these are
13% of all cases). In most of those cases, the distorted ingroup results from the outgroup attaching to the ingroup at one of the long external branches, 2 branches away from the correct short internal branch. Examples, where the addition of an outgroup distorts an ingroup tree, were previously reported for birds (Slack et al. 2003
Misleading Zone
For MP, the simulations on the 9-taxon tree with the parameters x = 0.015, y = 0.2, z = 0, and w = 0.4 were extended to include sequence lengths of l = (200, 400, 800, ..., 204,800). Trees were classified into 4 categories: I) the single correct tree, II) the 4 trees in which the ingroup phylogeny is correct but the outgroup (taxon 9) is incorrectly joined to one of the internal branches, III) the 8 trees in which the ingroup phylogeny is correct but the outgroup (taxon 9) is incorrectly joined to one of the external branches, and IV) the remaining 135,122 trees. The results for sequence lengths l = 200–102,400 are shown in figure 6.
|
Within its consistency zone, the probability of MP selecting the correct tree goes to 1 as the sequence length increases. However, following Holland et al. (2003)
1:3 between the correct tree and each of 8 incorrect trees where the outgroup attaches to one of the external branches. Sequences of
60,000 nucleotides are required before the correct tree is chosen more frequently than any other tree. With sequence length of 102,400, the correct tree is still only recovered
15% of the time. With sequence length of 204,800 (not shown), the correct tree is recovered in
28% of the time. Extrapolating from this data, we expect that a sequence length of at least 400,000 characters would be needed for MP to have a 50% success rate in finding the correct tree. It is important to note that correcting for multiple substitutions significantly reduces the size of MP's misleading zone for this combination of parameters. In fact, for cMP (as for uNJ, cNJ, and ML), a sequence length as short as 400 is already enough for the correct tree to be chosen most frequently (data not shown). For short sequence lengths (l = 200), all methods often select an incorrect tree and some incorrect trees are each selected with greater frequency than that of the correct tree. But because the number of times each tree is selected is very small, it is difficult to check whether this is statistically significant. However, as in the 5-taxon study of Holland et al. (2003)
Breaking Symmetry
In order to evaluate the effect of breaking the symmetry of the ingroup tree, we changed the 9-taxon tree so that one external edge of the ingroup is longer than the others (a higher rate of evolution) and consequentially the symmetry of the ingroup is broken. The results (fig. 7) show that the longer this ingroup branch is, the more frequently the outgroup joins it, reducing the accuracy of all methods in reconstructing the 9-taxon tree. The long external edge seems to have little effect on the accuracy of ML and cNJ, whereas a strong negative effect on both MP and uNJ was observed. The longer the selected external edge, the further we are from maintaining a molecular clock and the more pronounced the advantage of the corrected methods (ML and cNJ) over the uncorrected methods (uNJ and MP) becomes.
|
We have demonstrated that with the use of a single-taxon outgroup and a rapid radiation, it is difficult to correctly infer the position of the root, even when the ingroup tree is correct. This is particularly noticeable when the substitution rate of one ingroup taxon is higher than the others. Of particular concern is the observation that introducing an outgroup can interfere with the accuracy of the ingroup tree.
Two-Taxon Outgroup
Accuracy of the Methods
This simulation was used to evaluate the effect of including a second outgroup taxon, on the accuracy of the different methods in reconstructing the tree. Sequences were generated on the 10-taxon tree T10 (fig. 1) with 2, 1, or 0 outgroup taxon removed to acquire the 8-, 9-, and 10-taxon data sets, respectively. The phylogenetic methods were applied to the same data sets, and their ability to reconstruct the correct 8-, 9-, and 10-taxon trees was compared.
In table 2, the number of times in which the tree was reconstructed correctly is reported for each of the methods and for the 4 different branch lengths used. In every single case, correct trees were reconstructed more frequently for the 10-taxon data compared with the 9-taxon data. However, the frequency with which trees were correctly estimated for the 8-taxon data is higher than for both the 9- and 10-taxon data sets. This is true for all 4 methods with each of the sequence lengths.
|
This increase in reliability, when going from 9 to 10 taxa, runs counter to our intuition that the greater the number of taxa (and so the greater the number of internal edges that need to be estimated), the more difficult it is to reconstruct the correct tree. A possible explanation is that the more balanced topology of the 10-taxon tree makes it easier for the methods to reconstruct it. The correct ingroup is reconstructed most frequently for the 8-taxon (ingroup alone) data sets (table 3) and more frequently for the 10-taxon data set than for the 9-taxon data sets. Thus, the inclusion of a single-taxon outgroup disrupts the correctly constructed ingroup more frequently than does the inclusion of the 2 related outgroup taxa.
|
Placement of a Second Outgroup Taxon
Biologists often face the problem of choosing good outgroup taxa for tree reconstruction. In this simulation, we tested how the placement of the second outgroup taxon affects the accuracy of the methods in reconstructing the ingroup, that is, the 8-taxon tree. The ability of the methods to reconstruct the ingroup for different values of z (the expected number of substitutions on the edge connecting the outgroups' common ancestor to the ingroup) and w (the expected number of substitutions on the edge of each outgroup taxon) was compared. In addition, the outcomes of these runs were compared with the corresponding results for 9 and 8 taxon (all phylogenetic methods used were applied to the same data sets). The results are shown in figure 8, where the accuracy of the methods in reconstructing the ingroup tree using the 8, 9, and 10 taxon (unconstrained) is presented. The results are categorized into 8 categories: "rrr," ingroup correct in all (8, 9, and 10 taxon); "rrw," ingroup wrong in the 10 taxon but correct in the 8 and 9 taxon; "rwr," ingroup correct in the 8 and 10 taxon but wrong in the 9 taxon; "rww," ingroup wrong in both the 9 and 10 taxon but correct in the 8 taxon; "wrr," ingroup correct in the 9 and 10 taxon but wrong in the 8 taxon; "wrw," ingroup wrong in the 8 and 10 taxon but correct in the 9 taxon; "wwr," ingroup correct in the 10 taxon but wrong in the 8 and 9 taxon; "www," ingroup wrong in all (8, 9, and 10 taxon).
|
As expected, ML constructed the correct ingroup (for 8, 9, and 10 taxon) more frequently than did the other methods. Although uNJ performed slightly better than cNJ, both constructed the correct trees with similar frequencies (the parameters used obey the molecular clock assumption). MP reconstructed the ingroup correctly for all in only about 55% of the cases; however, it had the lowest percentage of "www" (wrong in all). Moreover, when the common ancestor of the 2 outgroup taxa was close to the ingroup (z = 0.05), MP reconstructed the ingroup tree correctly for the 8- and 10-taxon data approximately 95% of the time. In addition, MP has the highest percentage of runs in which the ingroup was reconstructed correctly in the 8 and 10 taxon but was wrong in the 9-taxon data ("rwr") and the lowest percentage of runs in which the ingroup was wrong for the 8-taxon data but was right for the others ("wrr"). These results are as expected, taking into account the bias parsimony has toward forming cherries. Cases in which the methods construct the ingroup incorrectly from the 8- and 10-taxon data sets while reconstructing the correct ingroup from the 9-taxon data set are very rare (<2%).
Finally, we tested the accuracy of the phylogenetic methods in reconstructing the 10-taxon tree for different numbers (z) of substitutions per site on the edge connecting the common ancestor of the 2 outgroup taxa to the ingroup and the effect of constraining the 2 outgroup taxa to be together. The results are shown in figure 9. The closer the common ancestor of the 2 outgroup taxa was to the ingroup (the further the 2 outgroup taxa are from each other), the more accurate the methods were in reconstructing the 10-taxon generating tree. However, it appears advantageous for z to be larger than 0 (such that there is a split separating the outgroup taxa from the ingroup). This trend is very obvious for MP, where the accuracy dropped very rapidly as the common ancestor of the 2 outgroup taxa became further from the ingroup. This trend is also noticeable for uNJ and cNJ where a more moderate change in accuracy was observed. For ML, although only a very slight drop in accuracy was found, the general trend still applies. We also found that for z = 0 constraining the 2 outgroup taxa to come together had a positive effect on the accuracy of all the methods, both in reconstructing the ingroup tree and in placing the outgroup taxa in the correct position. When z > 0, for long sequences, constraining the 2 outgroup taxa to come together did not effect the accuracy with which the methods reconstructed the ingroup tree and placed the outgroup taxa (fig. 9a). However, for short sequences and small values of z, a slight improvement was recorded (fig. 9b).
|
| Discussion |
|---|
|
|
|---|
In this simulation study, we have identified problems that are likely to affect the ability of phylogenetic methods to reconstruct tree topologies corresponding to rapid radiations (where there is a combination of short internal and long external branches). Rapid radiations are often star-like, and it is therefore important to identify possible biases in reconstructing a star tree. We established that MP, cMP, uNJ, and cNJ are all biased toward forming cherries (see fig. 4). This effect is most pronounced for MP, for which trees having 4 cherries were chosen many more times than any other topology even though the generating tree had no cherries. ML seems to be biased in a different way; it appears to collapse edges that are not adjacent to cherries. All methods are biased toward a high number of internal edges as none of the methods was successful in recovering the star tree, even when collapsing was allowed. This effect is similar to the Bayesian "star paradox," where sequences that have evolved on a star tree can give branches with posterior probability close to 1. Steel and Matsen (2007)
Our findings indicate that rooting a star-like tree (many short internal branches connecting long external branches), by joining distant outgroup taxa to a short internal edge, often prevents the correct construction of the ingroup tree (see table 3 and fig. 5). The effect is particularly strong when an outgroup taxon and an ingroup taxon share a higher substitution rate (fig. 7). In many of the cases tested, the outgroup was placed 2 branches away from the correct position. For our data, an important finding is that when a tree rooted by an outgroup is in disagreement with the unrooted ingroup tree, the unrooted ingroup tree is most often correct. For the cases tested here, we found that the use of 2 outgroup taxa is better than the use of a single outgroup taxon, both for the accuracy with which a tree is rooted and for maintaining the correct ingroup tree (see tables 2 and 3). However, ingroup tree reconstruction is more accurate when the methods are applied to the ingroup alone (see table 3 and fig. 8). We also found that using 2 outgroup taxa that are distant from each other is better than using 2 closely related outgroup taxa; this is especially true for MP. For the trees and parameters tested here, and for short sequence lengths, constraining the 2 outgroup taxa to come together is generally advantageous, especially when they are not closely related (see fig. 9b). However, for longer sequences, constraining the outgroup taxa to come together does not have an effect on the accuracy of the methods (see fig. 9a). In general, our results confirm that it is "best practice" to infer phylogeny both with and without an outgroup and then compare the results.
Correcting MP for multiple changes was found to be beneficial in the cases where the molecular clock assumption is valid, particularly in cases where MP is misleading or inconsistent. A possible explanation for this is that with the given tree topology under the molecular clock, MP suffers from long-branch attraction. With our parameters, cMP does not suffer from long-branch attraction and therefore is doing better in estimating the correct tree. Nevertheless, under the set of parameters used here, when the molecular clock assumption is violated, MP does not suffer from the long-branch attraction and is indeed biased toward the correct tree. Consequently, under our conditions when the molecular clock assumption is violated, MP is more accurate than cMP in reconstructing the correct tree. This effect is likely to be a characteristic of the highly symmetric model tree.
In the cases where the molecular clock assumption is valid, uNJ was found to be more accurate than cNJ in reconstructing both the 9-taxon tree as a whole and the relationships among the ingroup taxa (see table 1). However, when this assumption is violated, by breaking the symmetry of the tree, cNJ and ML were found to be more accurate than uNJ and MP. This effect under the molecular clock may be due to amplification of sampling error and/or because the standard correction has a bias toward overcorrecting. These results are consistent with those found in other simulation studies (Sourdis and Krimbas 1987
; Saitou and Imanishi 1989
; Holland et al. 2003
), where corrections for multiple substitutions were found to be helpful only for recovering trees with unequal rates of change along branches. Nei and Kumar (2000)
offered guidelines for constructing phylogenetic trees and our results support their argument that uncorrected distances give the correct tree more often than corrected distances when the rate of nucleotide substitution is nearly the same for all evolutionary lineages and there is no strong transition/transversion bias.
Our results support the observation of Holland et al. (2003)
that methods can be consistent but misleading (even in the absence of model misspecification). We observed a misleading zone for MP where, although the frequency with which the correct tree is found tends to 1 as the sequence length l tends to infinity, for finite yet very long sequences, a number of incorrect trees are each chosen more frequently than the correct tree.
Holland et al. (2003)
considered the boundary of the consistency zone for MP, that is, the part of the parameter space where a slight change in the edge lengths makes parsimony either consistent or inconsistent. For 5-taxon trees with 2-state data, they calculated that each of the 4 incorrect trees where the outgroup is drawn to an external edge is selected by MP twice as frequently as the correct tree. In the 5-taxon case, there are only 6 splits for which the number of substitutions needed is not the same in the 5 competing trees. All 6 splits have the same expected frequency on the boundary of MP inconsistency. Two of those splits support the correct tree and each of these has to independently compete with 2 of the other splits (see Holland et al. 2003
). The calculation for 4-state data is more complex, but we suspect that the ratio between the correct tree and each of the frequently selected incorrect trees will be equivalent to that of the 2-state data. The calculation for the 9-taxon tree is more difficult, as there are many interdependent splits. Therefore, further mathematical work is required to calculate the ratio between the correct tree and each of the incorrect trees, where the outgroup is drawn to an external edge, and to evaluate the effect of the number of taxa on the frequency with which the correct tree (with the outgroup in its correct placement) is found.
Although this study specifically tested the effects on the reconstruction of an 8-taxon symmetric tree and a simple (biologically oversimplified) substitution model, the problems reported are expected to exist in larger trees and with more complex models (in which the Jukes–Cantor model is nested). Using a complex model of sequence evolution would not have ensured that any tree-estimation properties found were general. In our study, we used 4-state data, which is the natural biological language and are known to saturate slightly slower than 2-state data (Penny et al. 2001
). It would be interesting to test the methods further using 20-state amino acid data. Bayesian phylogenetic analysis was suggested to be as robust to relative branch-length differences as ML (Mar, Harlow, and Ragan 2005
); therefore, it would also be interesting to test Bayesian inference for the cases studied here.
| Supplementary Material |
|---|
|
|
|---|
Supplementary material S1 is available at Molecular Biology and Evolution online (http://www.mbe.oxfordjournals.org/).
| Acknowledgements |
|---|
|
|
|---|
We thank Klaus Schliep for R-code to generate graphs and statistical advice, and we thank Warwick Allen for computer support. We also thank the Marsden Fund and the Foundation for Research Science and Technology for funding. This study would not have been possible without the use of Helix parallel computing facility (http://helix.massey.ac.nz).
| Footnotes |
|---|
Arndt Von Haeseler, Associate Editor
| References |
|---|
|
|
|---|
Bergsten J. A review of long-branch attraction. Cladistics (2005) 21:163–193.[CrossRef][Web of Science]
Bruno WJ, Halpern AL. Topological bias and inconsistency of maximum likelihood using wrong models. Mol Biol Evol. (1999) 16:564–566.[Web of Science][Medline]
Felsenstein J. Cases in which parsimony or compatibility methods will be positively misleading. Syst Zool. (1978) 27:401–410.
Felsenstein J. Inferring phylogenies (2004) Sunderland, (MA): Sinauer Associates.
Goremykin VV, Holland B, Hirsch-Ernst KI, Hellwig FH. Analysis of Acorus calamus chloroplast genome and its phylogenetic implications. Mol Biol Evol. (2005) 22:1813–1822.
Harrison GL, McLenachan PA, Phillips MJ, Slack KE, Cooper A, Penny D. Four new avian mitochondrial genomes help get to basic evolutionary questions in the late Cretaceous. Mol Biol Evol. (2004) 21:974–983.
Hendy MD, Little CHC, Penny D. Comparing trees with pendant vertices labelled. SIAM J Appl Math. (1984) 44:1054–1065.[CrossRef]
Hendy MD, Penny D. A framework for the quantitative study of evolutionary trees. Syst Zool. (1989) 38:297–309.
Hendy MD, Penny D. Spectral-analysis of phylogenetic data. J Classif. (1993) 10:5–24.[Medline]
Ho SYW, Jermiin LS. Tracing the decay of the historical signal in biological sequence data. Syst Biol. (2004) 53:623–637.[CrossRef][Web of Science][Medline]
Holland BR, Penny D, Hendy MD. Outgroup misplacement and phylogenetic inaccuracy under a molecular clock—a simulation study. Syst Biol. (2003) 52:229–238.[CrossRef][Web of Science][Medline]
Jukes TH, Cantor CR. Evolution of protein molecules. In: Mammalian protein metabolism—Munro HN, ed. (1969) New York: Academic Press. 21–132.
Leebens-Mack J, Raubeson LA, Cui LY, Kuehl JV, Fourcade MH, Chumley TW, Boore JL, Jansen RK, dePamphilis CW. Identifying the basal angiosperm node in chloroplast genome phylogenies: sampling one's way out of the Felsenstein zone. Mol Biol Evol. (2005) 22:1948–1963.
Lin YH, McLenachan PA, Gore AR, Phillips MJ, Ota R, Hendy MD, Penny D. Four new mitochondrial genomes and the increased stability of evolutionary trees of mammals from improved taxon sampling. Mol Biol Evol. (2002) 19:2060–2070.
Lin YH, Waddell PJ, Penny D. Pika and vole mitochondrial genomes increase support for both rodent monophyly and glires. Gene (2002) 294:119–129.[CrossRef][Web of Science][Medline]
Lockhart PJ, Cameron SA. Trees for bees. Trends Ecol Evol. (2001) 16:84–88.[CrossRef][Medline]
Lockhart PJ, Penny D. The place of Amborella within the radiation of angiosperms. Trends Plant Sci. (2005) 10:201–202.[CrossRef][Web of Science][Medline]
Mar JC, Harlow TJ, Ragan MA. Bayesian and maximum likelihood phylogenetic analyses of protein sequence data under relative branch-length differences and model violation. BMC Evol Biol. (2005) 5:8.[CrossRef][Medline]
McKenzie A, Steel M. Distributions of cherries for two models of trees. Math Biosci. (2000) 164:81–92.[CrossRef][Web of Science][Medline]
Nei M, Kumar S. Molecular evolution and phylogenetics (2000) New York: Oxford University Press.
Penny D, Hendy MD, Lockhart PJ, Steel MA. Corrected parsimony, minimum evolution, and Hadamard conjugations. Syst Biol. (1996) 45:596–606.
Penny D, Hendy MD, Steel MA. Testing the theory of descent. In: Phylogenetic analysis of DNA sequences—Miyamoto MM, Cracraft J, eds. (1991) New York: Oxford University Press. 155–183.
Penny D, McComish BJ, Charleston MA, Hendy MD. Mathematical elegance with biochemical realism: the covarion model of molecular evolution. J Mol Evol. (2001) 53:711–723.[CrossRef][Web of Science][Medline]
Philip GK, Creevey CJ, McInerney JO. The Opisthokonta and the Ecdysozoa may not be clades: stronger support for the grouping of plant and animal than for animal and fungi and stronger support for the Coelomata than Ecdysozoa. Mol Biol Evol. (2005) 22:1175–1184.
Philippe H, Lartillot N, Brinkmann H. Multigene analyses of bilaterian animals corroborate the monophyly of Ecdysozoa, Lophotrochozoa, and Protostomia. Mol Biol Evol. (2005) 22:1246–1253.
Poe S, Swofford DL. Taxon sampling revisited. Nature (1999) 398:299–300.
Rambaut A, Grassly NC. Seq-Gen: an application for the Monte Carlo simulation of DNA sequence evolution along phylogenetic frees. Comput Appl Biosci. (1997) 13:235–238.
Saitou N, Imanishi T. Relative efficiencies of the Fitch-Margoliash, maximum-parsimony, maximum-likelihood, minimum-evolution, and neighbor-joining methods of phylogenetic tree construction in obtaining the correct tree. Mol Biol Evol. (1989) 6:514–525.[Web of Science]
Siddall ME. Success of parsimony in the four-taxon case: long-branch repulsion by likelihood in the Farris zone. Cladistics (1998) 14:209–220.[CrossRef][Web of Science]
Slack KE, Janke A, Penny D, Arnason U. Two new avian mitochondrial genomes (penguin and goose) and a summary of bird and reptile mitogenomic features. Gene (2003) 302:43–52.[CrossRef][Web of Science][Medline]
Soltis DE, Albert VA, Savolainen V. (11 co-authors). Genome-scale data, angiosperm relationships, and ending incongruence: a cautionary tale in phylogenetics. Trends Plant Sci. (2004) 9:477–483.[CrossRef][Web of Science][Medline]
Sourdis J, Krimbas C. Accuracy of phylogenetic trees estimated from DNA-sequence data. Mol Biol Evol. (1987) 4:159–166.[Abstract]
Steel M, Matsen FA. The Bayesian star paradox persists for long finite sequences. Mol Biol Evol. (2007) 24:1075–1079.
Steel MA, Hendy MD, Penny D. Parsimony can be consistent. Syst Biol. (1993) 42:581–587.
Stefanovic S, Rice DW, Palmer JD. Long branch attraction, taxon sampling, and the earliest angiosperms: amborella or monocots? BMC Evol Biol. (2004) 4:35.[CrossRef][Medline]
Sullivan J, Joyce P. Model selection in phylogenetics. Annu Rev Ecol Evol Syst. (2005) 36:445–466.[CrossRef]
Sullivan J, Swofford DL. Should we use model-based methods for phylogenetic inference when we know that assumptions about among-site rate variation and nucleotide substitution pattern are violated? Syst Biol. (2001) 50:723–729.[CrossRef][Web of Science][Medline]
Swofford DL. PAUP*. In: Phylogenetic analysis using parsimony (*and other methods). Version 4 (2002) Sunderland, (MA): Sinauer Associates.
Swofford DL, Waddell PJ, Huelsenbeck JP, Foster PG, Lewis PO, Rogers JS. Bias in phylogenetic estimation and its relevance to the choice between parsimony and likelihood methods. Syst Biol. (2001) 50:525–539.[Web of Science][Medline]
Yang ZH. How often do wrong models produce better phylogenies? Mol Biol Evol. (1997) 14:105–108.[Web of Science][Medline]
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
T. Dagan and W. Martin Getting a better picture of microbial evolution en route to a network of genomes Phil Trans R Soc B, August 12, 2009; 364(1527): 2187 - 2196. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. W. Graham and W. J. D. Iles Different gymnosperm outgroups have (mostly) congruent signal regarding the root of flowering plant phylogeny Am. J. Botany, January 1, 2009; 96(1): 216 - 227. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. B. Whitfield, S. A. Cameron, D. H. Huson, and M. A. Steel Filtered Z-Closure Supernetworks for Extracting and Visualizing Recurrent Signal from Incongruent Gene Trees Syst Biol, December 1, 2008; 57(6): 939 - 947. [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||











