MBE Advance Access originally published online on August 17, 2005
Molecular Biology and Evolution 2005 22(12):2472-2479; doi:10.1093/molbev/msi237
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Research Article |
Evaluation of an Improved Branch-Site Likelihood Method for Detecting Positive Selection at the Molecular Level


* Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor;
Centre for Bioinformatics, University of Copenhagen, Denmark;
Department of Biology, University College London, London, United Kingdom
E-mail: z.yang{at}ucl.ac.uk.
| Abstract |
|---|
|
|
|---|
Detecting positive Darwinian selection at the DNA sequence level has been a subject of considerable interest. However, positive selection is difficult to detect because it often operates episodically on a few amino acid sites, and the signal may be masked by negative selection. Several methods have been developed to test positive selection that acts on given branches (branch methods) or on a subset of sites (site methods). Recently, Yang, Z., and R. Nielsen (2002. Codon-substitution models for detecting molecular adaptation at individual sites along specific lineages. Mol. Biol. Evol. 19:908917) developed likelihood ratio tests (LRTs) based on branch-site models to detect positive selection that affects a small number of sites along prespecified lineages. However, computer simulations suggested that the tests were sensitive to the model assumptions and were unable to distinguish between relaxation of selective constraint and positive selection (Zhang, J. 2004. Frequent false detection of positive selection by the likelihood method with branch-site models. Mol. Biol. Evol. 21:13321339). Here, we describe a modified branch-site model and use it to construct two LRTs, called branch-site tests 1 and 2. We applied the new tests to reanalyze several real data sets and used computer simulation to examine the performance of the two tests by examining their false-positive rate, power, and robustness. We found that test 1 was unable to distinguish relaxed constraint from positive selection affecting the lineages of interest, while test 2 had acceptable false-positive rates and appeared robust against violations of model assumptions. As test 2 is a direct test of positive selection on the lineages of interest, it is referred to as the branch-site test of positive selection and is recommended for use in real data analysis. The test appeared conservative overall, but exhibited better power in detecting positive selection than the branch-based test. Bayes empirical Bayes identification of amino acid sites under positive selection along the foreground branches was found to be reliable, but lacked power.
Key Words: computer simulation branch-site model likelihood ratio test positive selection
| Introduction |
|---|
|
|
|---|
Positive Darwinian selection at the DNA sequence level is often tested by estimating the ratio (
) of the rate of nonsynonymous (dN) nucleotide substitutions to that of synonymous (dS) substitutions between homologous protein-coding gene sequences (Nei and Kumar 2000
value significantly higher than 1 is interpreted as evidence for positive selection,
< 1 suggests purifying selection (selective constraints), and
= 1 indicates neutral evolution. Detecting positive selection is generally difficult because positive selection often acts on a few sites and in a short period of evolutionary time, and the signal may be swamped by the ubiquitous negative selection. Recent years have seen the development of several methods for testing positive selection on an individual branch, or a set of branches, of a phylogenetic tree (i.e., branch methods) (Yu and Irwin 1996
More recently, Yang and Nielsen (2002)
introduced a branch-site method for testing positive selection on individual codons along specific lineages. In this test, branches of the tree are divided a priori into foreground and background lineages, and a likelihood ratio test (LRT) is constructed by comparing a model that allows positive selection on the foreground lineages with a model that does not allow such positive selection. However, computer simulations showed that this test was sensitive to the underlying model, and, when the model assumptions were violated, could lead to frequent rejection of the null hypotheses of no selection, producing false positives (Zhang 2004
). If some sites evolve under negative selection on the background lineages, but experience a relaxation of constraints on the foreground lineages, the test may be misled to incorrectly reject the null neutral model.
In this paper, we describe a simple modification to a branch-site model proposed by Yang and Nielsen (2002)
and use it to construct two new LRTs, referred to as test 1 and test 2. We use computer simulation to evaluate the performance of the tests, examining the false-positive rate (type I error), the power, as well as the robustness. The results suggest that the branch-site test of positive selection has low false-positive rate and also more power than the branch-based test of Yang (1998)
. We also apply the new tests to the four data sets analyzed by Yang and Nielsen (2002)
. Our reanalysis led to the same conclusions as in the previous analysis.
| Methods |
|---|
|
|
|---|
LRTs and Identification of Positively Selected Sites
We refer to the reviews of Yang and Bielawski (2000)
0 < 1 estimated. Site class 1 includes codons that are evolving neutrally throughout the tree with
1 = 1. Site classes 2a and 2b include codons that are conserved or neutral on the background branches, but become under positive selection on the foreground branches with
2 > 1, estimated from the data. The model involves four parameters in the
distribution: p0, p1,
0, and
2. This is a slight modification of the old branch-site model A of Yang and Nielsen (2002)
0 = 0 fixed. The old model is very unrealistic as it does not allow conserved sites with 0 <
< 1 and is thus replaced by the model of table 1. The old branch-site model B of Yang and Nielsen (2002)
0 and
1 estimated from the data. We found it advantageous to fix
1 = 1 so that sites under weak constraint (with
close to but less than 1) are lumped into this class of neutral sites rather than being falsely claimed to be under positive selection.
|
We use branch-site model A (table 1) to construct two LRTs. Model A is the alternative hypothesis in both tests. The null hypothesis for test 1 is the site model M1a (Yang et al. 2000
0 < 1 and
0 = 1 for all branches. Significance of the test can be caused either by relaxed selective constraint on the foreground branch (e.g., presence of sites with
< 1 on the background branches and
= 1 on the foreground branches) or by positive selection along the foreground branch. Because the alternative hypothesis has two more parameters (p2 and
2) than the null hypothesis, we use
to perform the test. However, we note that the regularity conditions for the
2 approximation are not satisfied, and the correct asymptotic distribution is unknown; that is, p2 = 0 is at the boundary of the parameter space under the alternative hypothesis, and when p2 = 0,
2 is not identifiable. In test 2, the null hypothesis is the branch-site model A (table 1) but with
2 = 1 fixed. This null model allows sites evolving under negative selection on the background lineages to be released from constraint and to evolve neutrally on the foreground lineages. The test is thus a direct test for positive selection on the foreground lineages, and we refer to it also as the branch-site test of positive selection. As the alternative model A has
2
1 constrained and the test is one-sided, the asymptotic null distribution is a 50:50 mixture of point mass 0 and
with the critical values to be 2.71 and 5.41, at the 5% and 1% levels, respectively (Chernoff 1954
to perform the test.
If the test of positive selection rejects the null hypothesis, the alternative model (model A of table 1) can be used to identify the sites under positive selection along the foreground lineages. The empirical Bayes approach can be used to calculate the posterior probabilities that each site belongs to the site class of positive selection on the foreground lineages. The procedure used by Yang and Nielsen (2002)
, known as naïve empirical Bayes (NEB), uses maximum likelihood estimates (MLEs) of parameters (such as proportions of site classes and the
ratios) but ignores their sampling errors. A Bayes empirical Bayes (BEB) procedure was described recently by Yang, Wong, and Nielsen (2005)
, which accommodates uncertainties in the MLEs properly.
Computer Simulation
Computer simulations were conducted as described in Zhang (2004)
. The same two unrooted trees were used (fig. 1). In both trees, the molecular clock (rate constancy) holds for the synonymous substitution rate. Five evolutionary schemes with regard to
values for sites were used (table 2). Scheme X represented normal gene evolution, with varying degrees of purifying selection at different sites. Scheme Y represented a partial relaxation of functional constraints, with some sites having higher
values than those in scheme X. Scheme Z represented a complete relaxation of functional constraints, with all sites having
= 1. Schemes X, Y, and Z were used in the simulation of Zhang (2004)
and assume no sites under positive selection. Two new schemes, U and V, contain some positively selected sites. Scheme U differed from X in that some negatively selected sites in X became positively selected. Scheme V differed from X in a more complex manner, with some sites having higher
and some having lower
.
|
|
In the simulation, a DNA sequence of 200 codons was randomly generated for an interior node of a model tree, with equal frequencies of the 4 nucleotides. Random point mutations were then generated assuming a transition/transversion rate ratio of
= 4. The expected number of accepted nonsynonymous substitutions and the expected number of accepted synonymous substitutions followed the evolutionary scheme used. Nonsense mutations were not allowed. One branch in the model tree was chosen as the foreground branch, and all other branches were background branches. The evolutionary scheme X was always used for the background branches, whereas X, Y, Z, U, or V was used for the foreground branches. After the generation of the extant sequences, the codeml program in the PAML package (Yang 1997In about 2%3% of the simulated replicates, the alternative hypothesis had lower log likelihood than the null hypothesis, apparently because the program failed to find the global maximum likelihood. Such cases were discarded, and new data sets were simulated until 200 replicates with useful results were generated for each condition examined. For a few simulation schemes, we also used another strategy, conducting the same analysis twice by using different initial values for numerical optimization. The results were virtually identical between the two strategies.
| Simulation Results |
|---|
|
|
|---|
False-Positive Rate When the Null Model is Correct
We first conducted simulations under the null model to examine the reliability of the
2 approximation to the null distribution. Tree I of figure 1 is used to generate data, but the unit of evolution is changed so that 0.1 synonymous substitutions per synonymous site in the tree becomes 0.3 nucleotide substitutions per codon.
We first evaluate test 1. We simulated 1,000 data sets under site model M1a, with two site classes in proportions p0 = 0.7 and p1 = 0.3 and
ratios
0 = 0.2 and
1 = 1. Other simulation conditions are as previously described. There are 200 codons in the sequence. The data are analyzed under the null model M1a (neutral) and under the alternative branch-site model A, with branch
used as the foreground branch. In 512 of the 1,000 replicate data sets, the test statistic was equal to 0. The estimated critical values at the 10%, 5%, and 1% levels from the simulated empirical distribution are 2.33, 3.37, and 6.10, respectively. The critical values from the
distribution are 4.60, 6.63, and 9.21. Use of
is thus conservative, as expected.
Next, we examine test 2, the branch-site test of positive selection. We generate 1,000 data sets under the null model of the test. The 200 sites are drawn at random from four site classes in proportions p0 = 0.6, p1 = 0.2, p2a = 0.15, and p2b = 0.05, with
ratios specified according to table 1 with
0 = 0.2,
1 = 1, and
2 = 1. Branch
is the foreground branch. Each data set is analyzed under branch-site model A, either with
2 = 1 fixed or with
2
1 estimated. In 521 of the 1,000 data sets, the test statistic was equal to 0. Asymptotically, 2
should follow a 50:50 mixture of a point mass at 0 and the
distribution (Self and Liang 1987
), so that the proportion of zeros should be about 50%. The estimated critical values at the 10%, 5%, and 1% levels from the empirical distribution are 1.75, 2.95, and 5.98, respectively. The critical values from the
distribution are 2.70, 3.84, and 5.99, while the critical values based on the 50:50 mixture of 0 and the
distribution are 1.64, 2.71, and 5.41. Thus, use of the
distribution makes the test too conservative, while use of the mixture makes the test slightly too liberal.
We suspect that the slight discrepancy between the simulated distribution and the mixture distribution is due to the small sample size because there are only 200 codons in the sequence. To confirm this, we conducted another set of simulations, increasing the sequence length to 3,000 codons. In 515 of the 1,000 data sets, the test statistic was equal to 0. Again, this is close to the expected proportion of 50%. The estimated critical values at the 10%, 5%, and 1% levels from the empirical distribution are 1.37, 2.57, and 4.62, respectively, which are close to but smaller than 1.64, 2.71, and 5.41, the expected values from the mixture distribution. Thus, for this sequence length, use of both the mixture distribution and the
distribution is conservative.
The simulations under the null models of the tests confirm that the asymptotic distribution for test 2 (the branch-site test of positive selection) is the 50:50 mixture of a point mass at 0 and the
distribution, and use of the
distribution makes the test conservative. Similarly, use of the
distribution makes test 1 conservative. In the rest of the paper, we use the
and
distributions for the two tests to guide against violations of model assumptions and small sample sizes.
False-Positive Rate of the LRTs Under Models of Relaxed Constraints
We examined the false-positive rate of the two LRTs when the data were simulated under models of relaxed selective constraint using the schemes of table 2. The null hypotheses of the tests were thus violated even though the data were simulated without positive selection. Under such conditions, the old branch-site tests of Yang and Nielsen (2002)
were found to have high false positives (Zhang 2004
). The results for the two new tests are shown in table 3. The first set of simulations was conducted using model tree I (fig. 1). The deepest branch in the tree,
, was the foreground branch, and all other branches were set as background branches. Evolutionary scheme X was used for both background and foreground branches; that is, there was no positive selection and no change in
among branches (table 2). When test 1 was used with the null distribution
, in 5 of the 200 replicates, the null hypothesis of no positive selection was rejected at the 5% level (table 3). At this level of significance, we should expect about 5% incorrect rejections (false positives) of the null hypothesis if the test is unbiased. Thus, test 1 is conservative. Test 2 is also conservative, as the null hypothesis was rejected in 3 of the 200 replicates. Similarly, when branches ß or
of tree I were designated the foreground branch or when tree II (fig. 1) was used with branches
, ß,
, or
used as the foreground branch, both tests are conservative, with false-positive rates below the nominal 5% (table 3).
|
When schemes X and Y were used for the background and foreground branches, respectively, test 1 rejected the null hypothesis in between
40% and 100% of the replicates. If we consider the test as a test of positive selection, these will be false-positive rates, which are unacceptably high. Alternatively, if we consider the test as a test of relaxed constraint on the foreground lineage, the positives are true positives, and the test had good power. The false-positive rate of test 2 (the branch-site test of positive selection) is at or below the nominal 5%. The results were very similar when scheme Z was applied to the foreground branch (table 3). Thus, this set of simulations showed that the type I error rate of test 2 is at or below the nominal level. However, test 1 should not be considered a reliable test of positive selection.
The above simulations examined the robustness of the LRTs to more complex variations of the
ratios among sites than assumed in the models. We also conducted two simulations to examine the robustness of the LRTs to more complex variations of the
ratio across lineages on the phylogeny. In particular, we examined whether the tests will be misled to produce high false positives when the foreground branch, which is not under positive selection, is surrounded on the tree by background branches that are under positive selection. We use branch ß on tree II as the foreground branch (with length 0.05). All other branches are background branches, evolving under scheme X, except the ancestral branch and the interior descendant branch of branch ß, which follow scheme V, with sites under positive selection (see fig. 2). Branch ß also follows scheme X. In other words, there are no sites under positive selection along branch ß, but there are some on two background branches. At the 5% significance level, positive selection was falsely detected for branch ß in 1, 5, and 6 cases by the branch test, branch-site test 1, and branch-site test 2, at the proportions 0.5%, 2.5%, and 3%, respectively. The second simulation was conducted in the same way except that all three background branches surrounding branch ß (one branch ancestral and two branches descendent to branch ß) evolve under scheme V, while branch ß again follows scheme X. The false-positive rate at the 5% significance level was 0%, 2.5%, and 1.5% for the branch test, branch-site test 1, and branch-site test 2, respectively. These results suggest that all three tests were robust and not misled by positive selection on branches close to the foreground branch.
|
Power of LRTs in Detecting Positive Selection
We examined how often the two LRTs detected positive selection if it indeed occurred on the foreground branches (table 4). We used scheme X for background branches and used either U or V for foreground branches, which include positively selected sites. For comparison, we also used the branch test of Yang (1998)
among sites and detects positive selection only if the
ratio for the whole sequence is significantly greater than 1 on the foreground lineages. In tree I, the two branch-site tests were significant in 7.5%17% of the cases (table 4). Although these rates of detection are low, we note that the branch test detected positive selection in only 0%2% of the cases, presumably because the average
ratio across all sites is <1 for the foreground branch under both schemes U and V (table 2). Thus, the new branch-site tests are considerably more powerful than the branch test. It is also interesting to note that test 2 (the branch-site test of positive selection) had about the same power as test 1. Similar results were obtained in data simulated using tree II (table 4).
|
One may expect it to be easier to detect positive selection that occurred in recent lineages than in ancient lineages because the method can infer (probabilistically) recent substitutions more reliably than ancient substitutions. This expectation was supported overall although the difference was small and sometimes in the wrong direction. For example, in tree II, the power for both tests was higher for the recent branch
than for the deep branches
or ß, but in tree I, the power was very similar for branches ß and
(table 4). The length of the foreground branch had greater influence on the power. For example, the power was higher for the long branch
than for the short branches ß or
in tree I, while in tree II, the power was much higher for the long branch
than for short branches
, ß, or
. To examine the effect of the foreground branch length, we increased the length of branch ß in tree II from 0.05 to 0.15 or 0.45. The power for both tests increased by threefold to sevenfold. We expect that the power will eventually decrease when the foreground branch becomes too long. Similarly, when several branches are designated foreground branches and experience positive selection on the same sites, the power of the tests is expected to improve. This was indeed the case: when we applied the same selection scheme to both branches ß and
in tree II, the power was much higher than if either ß or
alone was the foreground branch (table 4). Nevertheless, we stress that this analysis assumes that all foreground branches are under the same selection scheme, with the same sites under positive selection, an assumption that may not be realistic for many data sets.
Obviously, the power of the tests should improve when more sites are under positive selection on the foreground branches or when the positive selection is stronger, with higher
ratios. We conducted two additional simulations to demonstrate this effect. In the first set of simulations, we used branch ß in tree II as the foreground branch (with length 0.15), on which evolution follows scheme V, while all other branches are background branches, evolving under scheme X. However, we simulated 600 codons in the sequence, with 60 (instead of 20) codons in each domain of table 2. At the 5% significance level, positive selection was detected for branch ß in 0, 71, and 39 cases (with proportions 0, 0.355, and 0.195, respectively) for the branch test, branch-site test 1, and branch-site test 2, respectively. The corresponding proportions when there were 200 codons in the sequence are much lower, at 0.005, 0.135, and 0.110 (table 4). In the second simulation, we again used branch ß in tree II as the foreground branch (with length 0.15 and scheme V), while the background branches evolve under scheme X. There are 200 codons in the sequence. However, we increased the
ratios for domains 3 and 5 under scheme V (table 2) from 4 and 2 to 8 and 4, to model stronger positive selection on those domains. At the 5% significance level, positive selection was detected for branch ß in 4, 98, and 100 cases (with proportions 0.02, 0.490, and 0.500, respectively) for the branch test, branch-site test 1, and branch-site test 2, respectively. These proportions are much higher than 0.005, 0.135, and 0.110 (table 4) under the weaker positive selection of scheme V in table 2.
Performance of NEB and BEB in Identifying Positively Selected Sites
When the LRT of positive selection on the foreground branches is significant, the NEB (Yang and Nielsen 2002
) and BEB (Yang, Wong, and Nielsen 2005
) procedures can be used to compute the posterior probability that a codon belongs to the class of positive selection. A codon with high posterior probability is likely to be under positive selection on the foreground lineages. Here, we examine the performance of both the NEB and the BEB approaches. In our simulations, the entire sequence was divided into 10 domains, each with 20 codons. Sites within a domain have the same
. We tabulated the average frequency (f) at which a codon in a given domain is identified to be under positive selection on the foreground lineages by either NEB or BEB at either the 90% or 95% cutoffs. We refer to such frequencies as fN90, fN95, fB90, and fB95, corresponding to the method and cutoff. For domains that do not have any sites under positive selection, these frequencies will be the false-positive rates. Note that here we are evaluating the frequentist properties of the NEB and BEB methods: if the methods are good, they should satisfy fN90, fB90
0.1 and fN95, fB95
0.05 when there is no positive selection. However, we point out that NEB and BEB compute Bayesian posterior probabilities and do not explicitly control the type I error (or false positive) rate. For a discussion of the Bayesian and frequentist measures of performance in detecting positively selected sites, see Yang, Wong, and Nielsen (2005)
, who also demonstrated that the BEB procedure had excellent frequentist properties in site-based analysis.
Figure 2 shows the results obtained for tree I, with
being the foreground branch. Results obtained using other foreground branches or using tree II are similar and thus not presented. The NEB and BEB approaches were applied without first conducting the LRTs. In all cases, the posterior probability was estimated under branch-site model A (table 1). Note that only domains 3 and 5 in figure 2D and E are under positive selection, while all other domains contain no sites under positive selection. When scheme X was used for both background and foreground branches, the false-positive rate f for both NEB and BEB was extremely low (<0.001) for all domains and for both cutoffs (fig. 2A). When we changed the evolutionary scheme to Y for the foreground branch, fN90 and fN95 are higher than 0.2 for all domains (fig. 2B), and the false-positive rate of NEB was excessively high. In contrast, for the BEB method we observed fB90 < 0.1 and fB95 < 0.05 in most domains (fig. 2B). The averages over the whole sequence were fB90 = 0.04 and fB95 < 0.01, suggesting that BEB had the false-positive rate below the nominal 10% or 5% levels. When scheme Z was used for the foreground branch, NEB again had very high false-positive rates (fig. 2C). BEB performed much better, even though the false-positive rate for domains 9 and 10 was fB90
0.2 and fB95
0.1, respectively. Averaged over the whole sequence, the false-positive rates were fB90 = 0.09 and fB95 = 0.04, again below the nominal 10% or 5%. We next examined f when scheme U or V was used for the foreground branch. Under these schemes, positive selection occurs in domains 3 and 5 (table 2). We note that the power of detecting positively selected codons by either NEB or BEB is very low, with f < 1%. The false-positive rates for both NEB and BEB were also very low for domains other than 3 and 5.
| Real Data Analysis |
|---|
|
|
|---|
We apply the two branch-site tests to the four data sets analyzed by Yang and Nielsen (2002)
|
The first data set includes 19 primate lysozyme c gene sequences, originally analyzed by Messier and Stewart (1997)
Table 5 shows the results of the two LRTs discussed in this paper, in comparison with results obtained by Yang and Nielsen (2002)
for the two old branch-site tests. Both old tests are similar to branch-site test 1. The M1-MA test compares the site model M1 (neutral) and the branch-site model A, with both models fixing
0 = 0 for the conserved sites. The M3-MB test compares the site model M3 (discrete, with K = 2 classes) and the branch-site model B, with both models estimating
0 and
1 as free parameters. The old and new tests reached the same conclusions in these four data sets. There is no statistically significant support for positive selection in the primate lysozyme data set, marginally significant support in the primate BRCA1 data set, and overwhelming evidence for positive selection in the two angiosperm phytochrome data sets.
We also applied the BEB procedure to calculate the posterior probabilities that each site is under positive selection in the four data sets. In the primate lysozyme and primate BRCA1 data sets, no site had a posterior probability higher than 95%. In the phytochrome phy A-C/F data set, 27 sites were identified to be potentially under positive selection along the foreground lineage at the 95% cutoff. They are 55R, 102T, 105S, 117P, 130T, 147S, 171T, 216E, 227F, 252I, 304I, 305D, 321L, 440L, 517A, 552T, 560Y, 650T, 655S, 700A, 736K, 787N, 802V, 940T, 986M, 988Q, and 1087D (the amino acids refer to the Zea mays phyA sequence). In the phytochrome phy B/D-E data set, 19 sites were identified to be potentially under positive selection along the foreground branch at the 95% level. They are 2S, 216K, 287A, 400H, 446Y, 501Y, 535Q, 640E, 782I, 810N, 851A, 858A, 859S, 860A, 865A, 902S, 945S, 1032S, and 1046Y (the amino acids refer to the Sorghum bicolour phyB sequence).
Given that the old tests had high false positives while the new test did not under some simulation conditions, we expect that the different tests may produce different results in some data sets. It may be worthwhile to use branch-site test 2 to confirm the results if positive selection is detected using the old tests, but the sequences are short and the positive selection is weak (with small estimates of p2 and
2).
| Discussion |
|---|
|
|
|---|
In this paper, we used the modified branch-site model A (table 1) to construct two LRTs to detect positive selection affecting a small subset of sites along prespecified branches. Our computer simulation suggests that test 2 can accurately distinguish between relaxed constraint and positive selection and often has higher power in detecting positive selection than test 1. We thus recommend the use of test 2 in real data analysis. We note that this test was rather successful in our simulations and was in particular able to avoid high false positives in the presence of relaxed selective constraint, in contrast to the previous branch-site tests evaluated by Zhang (2004)
For identifying codons under positive selection in the foreground branches, our simulation suggests that NEB can have high false-positive rates and is unusable. NEB was found to be unreliable in small data sets in site-based analysis as well (Yang, Wong, and Nielsen 2005
). In contrast, BEB appeared to be reliable, with the false-positive rate below 10% or 5% when sites were identified at the 90% or 95% cutoff. However, the power of the analysis can be very low. In our simulations, it is common for the branch-site test of positive selection to provide significant support for the presence of positively selected sites on the foreground lineages, whereas no sites show a posterior probability higher than 95% according to BEB. We note that identifying sites under positive selection is intrinsically more difficult than testing whether such sites exist. When multiple sites are under positive selection with
> 1, the LRT combining evidence from all such sites may be able to infer their presence, but the information at any single site may not be strong enough for the BEB probability to reach high levels (especially as positive selection has affected only one lineage or a very few lineages on the tree). We suggest that it is still useful to be able to detect positive selection acting on the sequence even if the affected sites cannot be reliably inferred. It was also noted that a site could be identified by BEB as being under positive selection with high posterior probability, but the branch-site test of positive selection applied to the same data failed to provide significant support for positive selection. We suggest that in such a case, the results should not be interpreted as providing evidence for positive selection.
The branch-site model (table 1) makes a number of assumptions about the evolutionary process. It is not entirely clear which of the assumptions are the most worrisome and have the greatest impact on the reliability of the tests. We considered the restrictive assumptions about the selective pressures to be important and thus designed the simulation experiment to evaluate the robustness of the analysis to violations of such assumptions. For example, the null and alternative models used in the tests assume no more than three site classes, while many more site classes were used in the simulation. Branch-site model A assumes one site class for positive selection, while schemes U and V have two classes of positively selected sites, with different
s. Similarly, the tests allow only two kinds of branches on the tree, the foreground branches that are likely to be under positive selection and the background branches that are under constraint. We thus evaluated a few cases where some background branches are under positive selection. The simulation results suggest that the tests are robust to these violations of assumptions. Nevertheless, we caution that our simulation is limited in scope and it remains possible that the tests may fail under conditions not explored here. Identification of such conditions will be most useful in helping us improve the test.
| Acknowledgements |
|---|
|
|
|---|
We thank Eddie Holmes and two anonymous referees for a number of comments and suggestions. This study is supported by grants from National Institutes of Health (NIH) to J.Z., the Biotechnological and Biological Sciences Research Council to Z.Y., the Human Frontier Science Program (Y0055/2001-M) to R.N. and Z.Y., and National Science Foundation/NIH (DMS/NIGMS-0201037) to R.N.
| Footnotes |
|---|
Edward Holmes, Associate Editor
| References |
|---|
|
|
|---|
Alba, R., P. M. Kelmenson, M.-M. Cordonnier-Pratt, and L. H. Pratt. 2000. The phytochrome gene family in tomato and the rapid differential evolution of this family in angiosperms. Mol. Biol. Evol. 17:362373.
Chernoff, H. 1954. On the distribution of the likelihood ratio. Ann. Math. Stat. 25:573578.
Huttley, G. A., S. Easteal, M. C. Southey, A. Tesoriero, G. G. Giles, M. R. E. McCredie, J. L. Hopper, and D. J. Venter. 2000. Adaptive evolution of the tumour suppressor BRCA1 in humans and chimpanzees. Nat. Genet. 25:410413.[CrossRef][Web of Science][Medline]
Messier, W. and C.-B. Stewart. 1997. Episodic adaptive evolution of primate lysozymes. Nature 385:151154.[CrossRef][Medline]
Nei, M. and S. Kumar. 2000. Molecular evolution and phylogenetics. Oxford University Press, Oxford.
Nielsen, R. and Z. Yang. 1998. Likelihood models for detecting positively selected amino acid sites and applications to the HIV-1 envelope gene. Genetics 148:929936.
Self, S. G. and K.-Y. Liang. 1987. Asymptotic properties of maximum likelihood estimators and likelihood ratio tests under nonstandard conditions. J. Am. Stat. Assoc. 82:605610.[CrossRef][Web of Science]
Stewart, C.-B., J. W. Schilling, and A. C. Wilson. 1987. Adaptive evolution in the stomach lysozymes of foregut fermenters. Nature 330:401404.[CrossRef][Medline]
Suzuki, Y. and T. Gojobori. 1999. A method for detecting positive selection at single amino acid sites. Mol. Biol. Evol. 16:13151328.[Abstract]
Yang, Z. 1997. PAML: a program package for phylogenetic analysis by maximum likelihood. Comput. Appl. Biosci. 13:555556.
Yang, Z. 1998. Likelihood ratio tests for detecting positive selection and application to primate lysozyme evolution. Mol. Biol. Evol. 15:568573.[Abstract]
Yang, Z. 2002. Inference of selection from multiple species alignments. Curr. Opin. Genet. Dev. 12:688694.[CrossRef][Web of Science][Medline]
Yang, Z. and J. P. Bielawski. 2000. Statistical methods for detecting molecular adaptation. Trends Ecol. Evol. 15:496503.[CrossRef][Medline]
Yang, Z. and R. Nielsen. 2002. Codon-substitution models for detecting molecular adaptation at individual sites along specific lineages. Mol. Biol. Evol. 19:908917.
Yang, Z., R. Nielsen, N. Goldman, and A.-M. K. Pedersen. 2000. Codon-substitution models for heterogeneous selection pressure at amino acid sites. Genetics 155:431449.
Yang, Z., W. S. W. Wong, and R. Nielsen. 2005. Bayes empirical Bayes inference of amino acid sites under positive selection. Mol. Biol. Evol. 22:11071118.
Yu, M. and D. M. Irwin. 1996. Evolution of stomach lysozyme: the pig lysozyme gene. Mol. Phylogenet. Evol. 5: 298308.[CrossRef][Web of Science][Medline]
Zhang, J. 2004. Frequent false detection of positive selection by the likelihood method with branch-site models. Mol. Biol. Evol. 21:13321339.
Zhang, J., S. Kumar, and M. Nei. 1997. Small-sample tests of episodic adaptive evolution: a case study of primate lysozymes. Mol. Biol. Evol. 14:13351338.[Web of Science][Medline]
Zhang, J., H. F. Rosenberg, and M. Nei. 1998. Positive Darwinian selection after gene duplication in primate ribonuclease genes. Proc. Natl. Acad. Sci. USA 95:37083713.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
J. A. Wilder, E. K. Hewett, and M. E. Gansner Molecular Evolution of GYPC: Evidence for Recent Structural Innovation and Positive Selection in Humans Mol. Biol. Evol., December 1, 2009; 26(12): 2679 - 2687. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Viaene, D. Vekemans, V. F. Irish, A. Geeraerts, S. Huysmans, S. Janssens, E. Smets, and K. Geuten Pistillata--Duplications as a Mode for Floral Diversification in (Basal) Asterids Mol. Biol. Evol., November 1, 2009; 26(11): 2627 - 2645. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Moreno-Estrada, K. Tang, M. Sikora, T. Marques-Bonet, F. Casals, A. Navarro, F. Calafell, J. Bertranpetit, M. Stoneking, and E. Bosch Interrogating 11 Fast-Evolving Genes for Signatures of Recent Positive Selection in Worldwide Human Populations Mol. Biol. Evol., October 1, 2009; 26(10): 2285 - 2297. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Matsuno,, V. Compagnon, G. A. Schoch, M. Schmitt, D. Debayle, J.-E. Bassard, B. Pollet, A. Hehn, D. Heintz, P. Ullmann, et al. Evolution of a Novel Phenolic Pathway for Pollen Development Science, September 25, 2009; 325(5948): 1688 - 1692. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Yang, R. Nielsen, and N. Goldman In defense of statistical methods for detecting positive selection PNAS, September 8, 2009; 106(36): E95 - E95. [Full Text] [PDF] |
||||
![]() |
L. B. Geyer and H.A. Lessios Lack of Character Displacement in the Male Recognition Molecule, Bindin, in Altantic Sea Urchins of the Genus Echinometra Mol. Biol. Evol., September 1, 2009; 26(9): 2135 - 2146. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. J. Moore, D. C. T. Halliday, D. M. Rowell, A. J. Robinson, and J. S. Keogh Positive Darwinian selection results in resistance to cardioactive toxins in true toads (Anura: Bufonidae) Biol Lett, August 23, 2009; 5(4): 513 - 516. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Dong, G. He, S. Zhang, and Z. Zhang Evolution of Olfactory Receptor Genes in Primates Dominated by Birth-and-Death Process Gen Biol Evol, August 20, 2009; 2009(0): 258 - 264. [Abstract] [Full Text] [PDF] |
||||
![]() |
P.-A. Christin, E. Samaritani, B. Petitpierre, N. Salamin, and G. Besnard Evolutionary Insights on C4 Photosynthetic Subtypes in Grasses from Genomics and Phylogenetics Gen Biol Evol, August 13, 2009; 2009(0): 221 - 230. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Guirao-Rico and M. Aguade Positive Selection Has Driven the Evolution of the Drosophila Insulin-Like Receptor (InR) at Different Timescales Mol. Biol. Evol., August 1, 2009; 26(8): 1723 - 1732. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Fletcher and Z. Yang INDELible: A Flexible Simulator of Biological Sequence Evolution Mol. Biol. Evol., August 1, 2009; 26(8): 1879 - 1888. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Besnard, A. M. Muasya, F. Russier, E. H. Roalson, N. Salamin, and P.-A. Christin Phylogenomics of C4 Photosynthesis in Sedges (Cyperaceae): Multiple Appearances and Genetic Convergence Mol. Biol. Evol., August 1, 2009; 26(8): 1909 - 1919. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Chattopadhyay, S. J. Weissman, V. N. Minin, T. A. Russo, D. E. Dykhuizen, and E. V. Sokurenko High frequency of hotspot mutations in core genes of Escherichia coli due to short-term positive selection PNAS, July 28, 2009; 106(30): 12412 - 12417. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Lefebure and M. J. Stanhope Pervasive, genome-wide positive selection leading to functional divergence in the bacterial genus Campylobacter Genome Res., July 1, 2009; 19(7): 1224 - 1232. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Schneider, A. Souvorov, N. Sabath, G. Landan, G. H. Gonnet, and D. Graur Estimates of Positive Darwinian Selection Are Inflated by Errors in Sequencing, Annotation, and Alignment Gen Biol Evol, June 22, 2009; 2009(0): 114 - 118. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. D. O'Connor and N. I. Mundy Genotype-phenotype associations: substitution models to detect evolutionary associations between phenotypic variables and genotypic evolutionary rate Bioinformatics, June 15, 2009; 25(12): i94 - i100. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Ikeda, N. Fujii, and H. Setoguchi Molecular Evolution of Phytochromes in Cardamine nipponica (Brassicaceae) Suggests the Involvement of PHYE in Local Adaptation Genetics, June 1, 2009; 182(2): 603 - 614. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. V. Han, J. P. Demuth, C. L. McGrath, C. Casola, and M. W. Hahn Adaptive evolution of young gene duplicates in mammals Genome Res., May 1, 2009; 19(5): 859 - 867. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Mallick, S. Gnerre, P. Muller, and D. Reich The difficulty of avoiding false positives in genome scans for natural selection Genome Res., May 1, 2009; 19(5): 922 - 933. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. J. Weadick and B. S.W. Chang Molecular Evolution of the {beta}{gamma} Lens Crystallin Superfamily: Evidence for a Retained Ancestral Function in {gamma}N Crystallins? Mol. Biol. Evol., May 1, 2009; 26(5): 1127 - 1142. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Nozawa, Y. Suzuki, and M. Nei Reliabilities of identifying positive selection by the branch-site and the site-prediction methods PNAS, April 21, 2009; 106(16): 6700 - 6705. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Miyake, N. Takebayashi, and D. E. Wolf Possible Diversifying Selection in the Imprinted Gene, MEDEA, in Arabidopsis Mol. Biol. Evol., April 1, 2009; 26(4): 843 - 857. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Wang, L. Yuan, S. J. Rossiter, X. Zuo, B. Ru, H. Zhong, N. Han, G. Jones, P. D. Jepson, and S. Zhang Adaptive Evolution of 5'HoxD Genes in the Origin and Diversification of the Cetacean Flipper Mol. Biol. Evol., March 1, 2009; 26(3): 613 - 622. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Anisimova and C. Kosiol Investigating Protein-Coding Sequence Evolution with Probabilistic Codon Substitution Models Mol. Biol. Evol., February 1, 2009; 26(2): 255 - 271. [Abstract] [Full Text] [PDF] |
||||
![]() |
P.-A. Christin, B. Petitpierre, N. Salamin, L. Buchi, and G. Besnard Evolution of C4 Phosphoenolpyruvate Carboxykinase in Grasses, from Genotype to Phenotype Mol. Biol. Evol., February 1, 2009; 26(2): 357 - 365. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Delport, K. Scheffler, and C. Seoighe Models of coding sequence evolution Brief Bioinform, January 1, 2009; 10(1): 97 - 109. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Proux, R. A. Studer, S. Moretti, and M. Robinson-Rechavi Selectome: a database of positive selection Nucleic Acids Res., January 1, 2009; 37(suppl_1): D404 - D407. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. Cohen, N. D Rubinstein, A. Stern, U. Gophna, and T. Pupko A likelihood framework to analyse phyletic patterns Phil Trans R Soc B, December 27, 2008; 363(1512): 3903 - 3911. [Abstract] [Full Text] [PDF] |
||||
![]() |
P.-A. Christin, N. Salamin, A. M. Muasya, E. H. Roalson, F. Russier, and G. Besnard Evolutionary Switch and Genetic Convergence on rbcL following the Evolution of C4 Photosynthesis Mol. Biol. Evol., November 1, 2008; 25(11): 2361 - 2368. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Juarez, I. Comas, F. Gonzalez-Candelas, and J. J. Calvete Evolution of Snake Venom Disintegrins by Positive Darwinian Selection Mol. Biol. Evol., November 1, 2008; 25(11): 2391 - 2407. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. C. Karn, N. L. Clark, E. D. Nguyen, and W. J. Swanson Adaptive Evolution in Rodent Seminal Vesicle Secretion Proteins Mol. Biol. Evol., November 1, 2008; 25(11): 2301 - 2310. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Ali and R. Meier Positive Selection in ASPM Is Correlated with Cerebral Cortex Evolution across Primates but Not with Whole-Brain Size Mol. Biol. Evol., November 1, 2008; 25(11): 2247 - 2250. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Dorus, Z. N. Freeman, E. R. Parker, B. D. Heath, and T. L. Karr Recent Origins of Sperm Genes in Drosophila Mol. Biol. Evol., October 1, 2008; 25(10): 2157 - 2166. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. J. Lynch, A. Tanzer, Y. Wang, F. C. Leung, B. Gellersen, D. Emera, and G. P. Wagner Adaptive changes in the transcription factor HoxA-11 are essential for the evolution of pregnancy in mammals PNAS, September 30, 2008; 105(39): 14928 - 14933. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Li, J. Wang, S. J. Rossiter, G. Jones, J. A. Cotton, and S. Zhang The hearing gene Prestin reunites echolocating bats PNAS, September 16, 2008; 105(37): 13959 - 13964. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. A. Studer, S. Penel, L. Duret, and M. Robinson-Rechavi Pervasive positive selection on duplicated and nonduplicated vertebrate protein coding genes Genome Res., September 1, 2008; 18(9): 1393 - 1402. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Wang, A. Diehl, F. Wu, J. Vrebalov, J. Giovannoni, A. Siepel, and S. D. Tanksley Sequencing and Comparative Analysis of a Conserved Syntenic Segment in the Solanaceae Genetics, September 1, 2008; 180(1): 391 - 408. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Go and Y. Niimura Similar Numbers but Different Repertoires of Olfactory Receptor Genes in Humans and Chimpanzees Mol. Biol. Evol., September 1, 2008; 25(9): 1897 - 1907. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. C. Nickel, D. L. Tefft, K. Goglin, and M. D. Adams An Empirical Test for Branch-Specific Positive Selection Genetics, August 1, 2008; 179(4): 2183 - 2193. [Abstract] [Full Text] [PDF] |
||||
![]() |
Q. Xu, C.-H. C. Cheng, P. Hu, H. Ye, Z. Chen, L. Cao, L. Chen, Y. Shen, and L. Chen Adaptive Evolution of Hepcidin Genes in Antarctic Notothenioid Fishes Mol. Biol. Evol., June 1, 2008; 25(6): 1099 - 1112. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Pantalacci, A. Chaumot, G. Benoit, A. Sadier, F. Delsuc, E. J. P. Douzery, and V. Laudet Conserved Features and Evolutionary Shifts of the EDA Signaling Pathway Involved in Vertebrate Skin Appendage Development Mol. Biol. Evol., May 1, 2008; 25(5): 912 - 928. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Houzelstein, I. R. Goncalves, A. Orth, F. Bonhomme, and P. Netter Lgals6, a 2-Million-Year-Old Gene in Mice: A Case of Positive Darwinian Selection and Presence/Absence Polymorphism Genetics, March 1, 2008; 178(3): 1533 - 1545. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. L. Kelley and W. J. Swanson Dietary Change and Adaptive Evolution of enamelin in Humans and Among Primates Genetics, March 1, 2008; 178(3): 1595 - 1603. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. F. Storz, F. G. Hoffmann, J. C. Opazo, and H. Moriyama Adaptive Functional Divergence Among Triplicated {alpha}-Globin Genes in Rodents Genetics, March 1, 2008; 178(3): 1623 - 1638. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Moreno-Estrada, F. Casals, A. Ramirez-Soriano, B. Oliva, F. Calafell, J. Bertranpetit, and E. Bosch Signatures of Selection in the Human Olfactory Receptor OR5I1 Gene Mol. Biol. Evol., January 1, 2008; 25(1): 144 - 154. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. A. Ramm, P. L. Oliver, C. P. Ponting, P. Stockley, and R. D. Emes Sexual Selection and the Adaptive Evolution of Mammalian Ejaculate Proteins Mol. Biol. Evol., January 1, 2008; 25(1): 207 - 219. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Chen, S. F. Perry, S. Aris-Brosou, C. Selva, and T. W. Moon Characterization and functional divergence of the {alpha}1-adrenoceptor gene family: insights from rainbow trout (Oncorhynchus mykiss) Physiol Genomics, December 19, 2007; 32(1): 142 - 153. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Crespi, K. Summers, and S. Dorus Adaptive evolution of genes underlying schizophrenia Proc R Soc B, November 22, 2007; 274(1627): 2801 - 2810. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. A. Johnson and M. A. Thomas The Monosaccharide Transporter Gene Family in Arabidopsis and Rice: A History of Duplications, Adaptive Evolution, and Functional Divergence Mol. Biol. Evol., November 1, 2007; 24(11): 2412 - 2423. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. P. Bussink, D. Speijer, J. M. F. G. Aerts, and R. G. Boot Evolution of Mammalian Chitinase(-Like) Members of Family 18 Glycosyl Hydrolases Genetics, October 1, 2007; 177(2): 959 - 970. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. L. Clark, G. D. Findlay, X. Yi, M. J. MacCoss, and W. J. Swanson Duplication and Selection on Abalone Sperm Lysin in an Allopatric Population Mol. Biol. Evol., September 1, 2007; 24(9): 2081 - 2090. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. H. Orsi, D. R. Ripoll, M. Yeung, K. K. Nightingale, and M. Wiedmann Recombination and positive selection contribute to evolution of Listeria monocytogenes inlA Microbiology, August 1, 2007; 153(8): 2666 - 2678. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Wagner Rapid Detection of Positive Selection in Genes and Genomes Through Variation Clusters Genetics, August 1, 2007; 176(4): 2451 - 2463. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Yang PAML 4: Phylogenetic Analysis by Maximum Likelihood Mol. Biol. Evol., August 1, 2007; 24(8): 1586 - 1591. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. M. Resch, L. Carmel, L. Marino-Ramirez, A. Y. Ogurtsov, S. A. Shabalina, I. B. Rogozin, and E. V. Koonin Widespread Positive Selection in Synonymous Sites of Mammalian Genes Mol. Biol. Evol., August 1, 2007; 24(8): 1821 - 1831. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. J. O'Neill, B. R. Lawton, M. Mateos, D. M. Carone, G. C. Ferreri, T. Hrbek, R. W. Meredith, D. N. Reznick, and R. J. O'Neill From the Cover: Ancient and continuing Darwinian selection on insulin-like growth factor II in placental fishes PNAS, July 24, 2007; 104(30): 12404 - 12409. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Nei The new mutation theory of phenotypic evolution PNAS, July 24, 2007; 104(30): 12235 - 12242. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Tarraga, I. Medina, L. Arbiza, J. Huerta-Cepas, T. Gabaldon, J. Dopazo, and H. Dopazo Phylemon: a suite of web tools for molecular evolution, phylogenetics and phylogenomics Nucleic Acids Res., July 13, 2007; 35(suppl_2): W38 - W42. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. J. Barkman, T. R. Martins, E. Sutton, and J. T. Stout Positive Selection for Single Amino Acid Change Promotes Substrate Discrimination of a Plant Volatile-Producing Enzyme Mol. Biol. Evol., June 1, 2007; 24(6): 1320 - 1329. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Ruano-Rubio and M. A. Fares Testing the Neutral Fixation of Hetero-Oligomerism in the Archaeal Chaperonin CCT Mol. Biol. Evol., June 1, 2007; 24(6): 1384 - 1396. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. D. Frentiu, G. D. Bernard, C. I. Cuevas, M. P. Sison-Mangus, K. L. Prudic, and A. D. Briscoe Colloquium Papers: Adaptive evolution of color vision as seen through the eyes of butterflies PNAS, May 15, 2007; 104(suppl_1): 8634 - 8640. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. A. Bakewell, P. Shi, and J. Zhang More genes underwent positive selection in chimpanzee evolution than in human evolution PNAS, May 1, 2007; 104(18): 7489 - 7494. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Guo and J. Kim Molecular Evolution of Drosophila Odorant Receptor Genes Mol. Biol. Evol., May 1, 2007; 24(5): 1198 - 1207. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Anisimova and Z. Yang Multiple Hypothesis Testing to Detect Lineages under Positive Selection that Affects Only a Few Sites Mol. Biol. Evol., May 1, 2007; 24(5): 1219 - 1228. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. King, P. N. Goodfellow, A. J. P. Wilkerson, W. E. Johnson, S. J. O'Brien, and J. Pecon-Slattery Evolution of the Male-Determining Gene SRY Within the Cat Family Felidae Genetics, April 1, 2007; 175(4): 1855 - 1867. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Hernandez-Hernandez, L. P. Martinez-Castilla, and E. R. Alvarez-Buylla Functional Diversification of B MADS-Box Homeotic Regulators of Flower Development: Adaptive Evolution in Protein-Protein Interaction Domains after Major Gene Duplication Events Mol. Biol. Evol., February 1, 2007; 24(2): 465 - 481. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Yu and Y.-p. Zhang The Unusual Adaptive Expansion of Pancreatic Ribonuclease Gene in Carnivora Mol. Biol. Evol., December 1, 2006; 23(12): 2326 - 2335. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. Podlaha, D. M. Webb, and J. Zhang Accelerated Evolution and Loss of a Domain of the Sperm-Egg-Binding Protein SED1 in Ancestral Primates Mol. Biol. Evol., October 1, 2006; 23(10): 1828 - 1831. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Izagirre, I. Garcia, C. Junquera, C. de la Rua, and S. Alonso A Scan for Signatures of Positive Selection in Candidate Loci for Skin Pigmentation in Humans Mol. Biol. Evol., September 1, 2006; 23(9): 1697 - 1706. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. M. Akey, W. J. Swanson, J. Madeoy, M. Eberle, and M. D. Shriver TRPV6 exhibits unusual patterns of polymorphism and divergence in worldwide populations Hum. Mol. Genet., July 1, 2006; 15(13): 2106 - 2113. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. H. Zakon, Y. Lu, D. J. Zwickl, and D. M. Hillis Sodium channel genes and the evolution of diversity in communication signals of electric fishes: Convergent molecular evolution PNAS, March 7, 2006; 103(10): 3675 - 3680. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
















