MBE Advance Access originally published online on April 21, 2007
Molecular Biology and Evolution 2007 24(7):1562-1574; doi:10.1093/molbev/msm078
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Research Articles |
Comparisons of Site- and Haplotype-Frequency Methods for Detecting Positive Selection


* State Key Laboratory of Biocontrol and Key Laboratory of Gene Engineering of the Ministry of Education, Sun Yat-sen University, Guangzhou, China
Graduate School of Natural Sciences, Nagoya City University, Nagoya, Japan
Department of Ecology and Evolution, University of Chicago
E-mail: kzeng{at}uchicago.edu.
| Abstract |
|---|
|
|
|---|
In this report, we compare the differences between various site- and haplotype-frequency tests in their power to detect positive selection by doing computer simulations. Our results are the following. 1) Although haplotype-frequency tests that are conditional on the number of haplotypes (K) were developed for nonrecombining haplotypes, these tests are insensitive to recombination. Such tests, including the EwensWatterson (EW) test, can therefore be applied to recombining haplotypes. 2) Tests conditional on the number of segregating sites (S) become overly conservative in the presence of recombination. 3) The EW test is usually the most powerful test during the sweep phase, especially when the local recombination rate is high. 4) The "extended haplotype homozygosity" test relies heavily on the prior knowledge of the target of selection. With that knowledge, it is the most powerful test, whereas in the absence of this prior information, the test has little power. We also study the sensitivities of the haplotype-frequency tests to background selection and various demographic forces. We find that these tests are sensitive to some forces other than positive selection. To alleviate the problem of low specificity, compound tests, such as the DH test (Zeng et al. 2006), may be a solution. In the companion paper (Zeng K, Shi S, Wu C-I, in preparation), we use the EW test to devise 2 compound tests, which are more powerful in detecting positive selection than DH, but are also relatively insensitive to demography.
Key Words: positive selection haplotype-frequency tests site-frequency tests
| Introduction |
|---|
|
|
|---|
DNA sequence data are a rich source for detecting adaptive evolution. Many statistical methods have been proposed for this purpose (see Fay and Wu 2003
Site-Frequency Methods
These methods require only frequencies of variants at polymorphic nucleotide sites. Linkage phase of these variants is neither required nor used. These methods are in general based on the infinite-site model (Kimura 1969
; Watterson 1975
) and utilize the site-frequency spectrum for detection (Fu 1995
). Tajima's D test (1989
; referred to as the D test) is the first test of this type. Many other D-like tests have since been proposed, including Fu and Li's tests (1993
), Fu's tests (1996
, 1997
), Fay and Wu's H test (2000
), and 2 newly derived onesE and DH (Zeng et al. 2006
). Many papers have examined the properties of these tests (Braverman et al. 1995
; Simonsen et al. 1995
; Przeworski 2002
; Zeng et al. 2006
). There are other site-frequency methods which represent a major departure in methodology from the tests mentioned above and will not be considered in this study (e.g., Kim and Stephan 2002
; Jensen et al. 2005
).
Among these tests, the DH test is of some interest. Its proposal was motivated by the observation that Tajima's D test and Fay and Wu's H test are both powerful in detecting selection, but they are sensitive to different demographic factors and are affected by background selection to different degrees (Zeng et al. 2006
). Therefore, by combining these 2 tests, the sensitivity of either test to a certain demographic factor (e.g., population growth) is counterbalanced by the insensitivity of the other test to the same factor. The compound test thus has high specificity to positive selection.
Haplotype-Frequency Methods
Methods in the second category require additional information on the linkage phase among variant sites and score a haplotype as an allele. They examine the level of haplotype polymorphism using simple summary statistics. The 3 widely used statistics are haplotype homozygosity (F), frequency of the most common haplotype (M), and configuration of haplotype frequencies (C). These measures are expressed either as statistics conditional on the number of haplotypes (K) or as statistics conditional on the number of segregating sites (S), elaborated below.
Statistics Conditional on the Number of Haplotypes (Alleles)
The statistics of F, M, or C conditional on K can be expressed in terms of Ewens' sampling distribution, which was derived under the no-recombination infinite-allele model (Ewens 1972
; Karlin and McGregor 1972
). One of the most remarkable results is the "invariant" propertythe distributions of these conditional statistics (i.e., F |K, M|K, and C|K) are independent of the fundamental parameter of population genetics,
(=4Nu, where N is the effective population size of a diploid organism and u is the neutral mutation rate of the locus). This property is useful as the estimation of
may introduce further uncertainties (Donnelly and Tavaré 1995
). We shall refer to F|K (haplotype homozygosity conditional on the number of haplotypes) as the EwensWatterson (EW) test statistic (Watterson 1978
). The other 2 similar test statistics, M|K and C|K, were proposed by Ewens (1973)
and Slatkin (1994
, 1996)
, respectively. All the 3 test statistics are independent of
and are in fact highly correlated when positive selection is in operation (see Results). Because the invariant property is true only when there is no intragenic recombination, the sensitivity to (or robustness against) recombination is crucial to the applicability of these 3 test statistics to DNA haplotype data and will be addressed in this report. There are several other test statistics, which use the sampling distribution of K directly (Strobeck 1987
; Fu 1996
, 1997
). The null distributions of these statistics are therefore dependent on
. Properties of these tests have been studied elsewhere (Fu 1996
, 1997
) and are not considered here.
Statistics Conditional on the Number of Segregating Sites
These methods are based on the infinite-site model. Similar to the haplotype tests conditional on K mentioned above, these methods usually use F, M, and C to measure levels of variability, but conditional on the number of segregating sites S. Examples include the haplotype partition test of Hudson et al. (1994
; referred to as Hudson's test), the haplotype diversity and haplotype number tests (Depaulis and Veuille 1998
), and the full configuration test (Innan et al. 2005
). Typically, the null distributions of these tests are obtained by doing coalescent simulation with the number of segregating sites fixed (Hudson 1993
; Wall and Hudson 2001
). Recombination is generally not considered in the null distribution.
LD Methods
Most methods in this category incorporate recombination in their null distributions. They use various statistics to summarize patterns of LD, including length of unrecombined segment (Slatkin and Bertorelle 2001
), extent of haplotype sharing (Toomajian et al. 2003
), and pairwise measure of LD (Kelly 1997
; Wall 1999
; Kim and Nielsen 2004
). The most popular test in this category is probably the extended haplotype homozygosity (EHH) test (Sabeti et al. 2002
) and its extensions (Hanchard et al. 2006
; Voight et al. 2006
; Wang et al. 2006). The principle underlying the EHH test is that, under neutrality, a new variant requires a long time to reach high frequency in the population. During this period, substantial recombination would have broken down the haplotype on which the mutation occurred. Therefore, long-range LD surrounding a site is considered as the signature of positive selection. Because the information on the target of selection is not easily available and is not used by other tests, our study of the EHH test is limited to the effect of this information on its power.
The main task of this report is to assess the power of haplotype- and site-frequency methods to detect positive selection (also referred to as genetic hitchhiking or selective sweep; Maynard Smith and Haigh 1974
). Although several previous studies (Fu 1996
, 1997
; Depaulis et al. 2003
, 2005
) have addressed similar issues, there are important differences. First, in this study, we incorporate intragenic recombination and analyze its effects on the applicability and power of the tests. Second, we concentrate on the prefixation (or sweep) phase when the advantageous allele is sweeping through the population. In this phase, the patterns of polymorphism and LD show characteristics that are strongly associated with hitchhiking (Fay and Wu 2000
; Stephan et al. 2006
; Zeng et al. 2006
). In contrast, the postfixation phase is characterized by an excess of low-frequency variants (i.e., accumulation of new mutations). Such excess is a general characteristic among many processes including population growth, background selection, and recovery from bottleneck (e.g., Fu 1997
; Zeng et al. 2006
). In practical terms, it may be difficult to distinguish the postfixation phase of sweep from those processes using the single-locus methods considered here. Third, we compare the sensitivities of the tests to other factors including background selection and demographic changes. We conclude that all tests except for the DH test are sensitive to some forces other than positive selection. In the companion study (Zeng K, Shi S, Wu C-I, in preparation), we develop new compound tests by using the novel properties of the EW test reported in this paper to circumvent the problem of low specificity.
| Methods |
|---|
|
|
|---|
Statistical Tests
Site-Frequency Tests
Two site-frequency tests, Tajima's D test (1989)
|
| (1) |
1) segregating sites; d (X) is the value of the D statistic of sample X, and h (X) is that of the H statistic; dp and hp are critical values such that P { d (XS)
dp } = P { h (XS)
hp } = p. Note that we used the normalized H test here (Zeng et al. 2006
, is called significant if
and
Tests Based on the Ewens Sampling Formula
We focused primarily on the EW test in the simulations because this test shows better statistical properties than the other 2 tests in this category (see Results). The test statistic of the EW test is defined as
|
| (2) |
. Thus, F|K and M|K are also independent of
. Note that our implementation of Slatkin's exact test (1996)
Haplotype Tests Conditional on the Number of Segregating Sites
The Hudson test rejects neutrality by investigating the probability of occurrence of a subset of sequences with low variation, that is, a major "haplotype class" (Hudson et al. 1994
). Here, we used the simplified definition of the test statistic given in Innan et al. (2005)
:
|
| (3) |
The EHH Test
We used a single polymorphic site as the core in the EHH analyses. Because the goal was to look for signal of recent positive sweeps, we used the derived allele at the core as the core haplotype. The EHH statistic is defined as the probability that 2 randomly chosen genes carrying the core haplotype of interest are identical by descent for the entire interval from the core single nucleotide polymorphism to the point x (Sabeti et al. 2002
). EHH values are therefore directional and can be calculated in both the 3' and 5' directions. Here, we adopted the definition of EHH proposed by Hanchard et al. (2006)
, that is, we calculated EHH value at the variant farthest away from the core in the 3' direction and that at the farthest variant in the 5' direction and took the arithmetic mean of the 2 values.
Determining Critical Values
In this study, all tests were one-sided and were conducted at the 5% significance level. The tail of the null distribution, which can maximize a test's power to detect selection or minimize its sensitivity to recombination was used. For example, for tests EW and EHH, values falling into the upper 5% tail were considered significant; for D and H (performed jointly as the DH test), values falling in the lower tail were considered significant.
Site- and Haplotype-Frequency Tests
The null distributions of the EW and related tests were determined by Slatkin's exact enumeration algorithm (1994). Critical values of Tajima's D, the DH test, and haplotype tests conditional on S were obtained by coalescent simulation with the number of polymorphic sites fixed (Hudson 1993
; Wall and Hudson 2001
).
The EHH Test
For a sample with S segregating sites, to determine the level of significance of an EHH value at the given focal site (core), we need to generate neutral samples conditional on the frequency of the mutant allele at the core. To do this, first, we have to know the distribution of population frequency of the mutant allele at the core. Using the diffusion approximation, Griffiths (2003)
has shown that, for a sample of size n, when the number of occurrence of the mutant allele at the core is b, the population frequency x (0 < x < 1) of the mutant allele has the following distribution:
|
| (4) |
In each replica of the simulation, we 1) generated a population frequency x by sampling from equation (4); 2) simulated a trajectory which started at x and ended at 0 using the pseudosampling device; 3) constructed genealogy by using the structured coalescent method with b lineages in the derived background and the rest in the ancestral background (Braverman et al. 1995
; Zeng et al. 2006
); 4) put the remaining S-1 segregating sites on the genealogy according to the shape of the local gene trees; and 5) calculated EHH for this random sample. The critical values were then obtained by examining the empirical distribution produced above. In the simulation above, we also assumed (true) local recombination rate and haplotype phase were known.
Simulation Algorithms
We used the coalescent algorithms implemented in the software package "ms" (Hudson 2002
) to generate random samples under the neutral model with intragenic recombination and under various demographic models. To simulate hitchhiking, the coalescent process with a selective phase was used (Kaplan et al. 1989
; Braverman et al. 1995
; Kim and Stephan 2002
). This model assumes that the fitnesses of the 3 genotypes AA, Aa, and aa at the selected site are 1 + s, 1 + hs, and 1, respectively, where A is the derived allele, s and h are the selection and dominance coefficients. The behavior of the selected allele is mainly governed by the scaled selective pressure
(=2Ns) and h. Our implementation followed the description in Zeng et al. (2006)
with the following modification: frequency trajectories of the selected allele were obtained by using the pseudosampling device described above rather than the approximate deterministic model. This modification takes into account the random effects in the early stage of a sweep. It also allows us to examine the effects of selective sweeps with arbitrary level of dominance.
Background selection was simulated using the 2-locus model described in Hudson and Kaplan (1994)
. This model assumes that the deleterious locus is in mutation-selection equilibrium and is not recombining, but recombination can occur between the neutral locus and the deleterious locus. Here, we extended the Hudson and Kaplan model by allowing intragenic recombination within the neutral locus.
| Results |
|---|
|
|
|---|
Effects of Intragenic Recombination on Haplotype-Frequency Tests
The haplotype-frequency tests considered in this study were derived under the no-recombination neutral model. To apply them to nuclear DNA sequence data, we need to make sure recombination does not inflate type I error. In table 1, we show the rejection probabilities of 3 tests, EW, Hudson's test, and Tajima's D test, generated with various combinations of
and recombination rate (measured by
= 4Nr, where r is the recombination rate of the region every generation). Results of other tests are very similar and are not presented (e.g., the rejection rates of Ewens' test are very similar to those of the EW test). The most striking result is that the type I error rate of the EW test is insensitive to recombination. This conclusion is true over a wide range of combinations of parameter values (see supplementary table S1, Supplementary Material online), except for some rare occasions with extreme
and
values which may be unlikely to cause problems in practice. When the EW test was conducted at other significance levels (e.g., 2%, 1%, and 0.5%), the conclusion still holds (unpublished data). On the contrary, Tajima's D and Hudson's test become very conservative in the presence of intragenic recombination, in agreement with previous reports (Wall 1999
/
is large, in which case the rejection rates of Tajima's D and Hudson's test are effectively zero. This result may imply that, when the effect of recombination is un-negligible in the history of a sample, the EW test may be more powerful than Tajima's D and Hudson's test (see fig. 3B below). The difference in susceptibility to recombination between haplotype tests conditional on K and tests conditional on S may be due to the fact that K and S contain a very different amount of information about the local recombination rate. Multiple studies have shown that K is a good indicator of local recombinational pressure (Wall 2000
as the x axis and S as the y axis, no correlation was observed (unpublished data).
|
|
There is a difference between haplotype-frequency tests and site-frequency tests that we need to pay special attention. For a site-frequency test, either tail of the null distribution responses to recombination in a very similar manner; however, this is not the case for haplotype tests conditional on S. For example, if we conduct the DV test as a 2-sided test, its type I error rate is significantly higher than the nominal significance level in the presence of recombination (Depaulis et al. 2005
Power of the Haplotype-Frequency Tests to Detect Positive Selection
In this section, we compare the differences among haplotype-frequency tests in their ability to detect positive selection. The following results allow us to select the more powerful tests to compare with the 2 site-frequency tests in the next section. We only discuss the prefixation phase of a selective sweep for reasons given in Introduction. An example is given in figure 1. Several features can be observed. First, the power of all tests increases rapidly as a function of the frequency of the advantageous allele. When the frequency of the selected allele reaches 50%, all tests have obtained substantial power. Second, the power curves reach their peaks when the advantageous allele is at very high frequency and then start to decline before fixation. This is probably because at the time around fixation, levels of variation have been significantly reduced and thus, the samples tend to be uninformative. Third, tests conditional on K are generally more powerful than tests conditional on S, both before and after fixation (supplementary fig. S2, Supplementary Material online). This observation is true in most of our simulations (unpublished data; see Discussion).
|
Lastly, tests based on C, M, and F (full haplotype configuration, frequency of the most common haplotype, and haplotype homozygosity, respectively) yield very similar results. This is true no matter they are conditional on K or S. The similarity is probably due to the presence of a predominant haplotype under hitchhiking (Barton 1998
Note that the power curves in figure 1 were generated by assuming that the selected site was in the middle of an otherwise neutrally evolving region. It is well known that the strength of hitchhiking depends on the distance between the selected site and the hitchhiking variants (Maynard Smith and Haigh 1974
; Stephan et al. 1992
; Fay and Wu 2000
; Kim and Stephan 2002
). Even the LD pattern has been shown to depend on this spatial relationship (Kim and Nielsen 2004
; Stephan et al. 2006
). In the online supplementary material (supplementary fig. S3), we show that, while the power decreases when the distance between the selected site and the neutral region increases, the relative performances of these tests remain the same as in figure 1.
Contrasting the Power of Haplotype- and Site-Frequency Tests
In this section, we compare the 2 representative haplotype-frequency tests (EW and Hudson's test) with the 2 site-frequency tests, Tajima's D and the compound test, DH (Zeng et al. 2006
). We first investigate the effects of intragenic recombination on the power of the tests by generating samples under hitchhiking with or without recombination (fig. 2; Tajima's D is less powerful than DH, and is not shown). The EW test is in fact more robust against recombination than the other 2 tests. On the contrary, Hudson's test tends to suffer the most significant loss of power.
|
Figure 3 shows the power properties of the tests when the advantageous allele is at different frequencies. In general, haplotype-frequency tests are much more powerful than site-frequency tests before fixation. The EW test is usually the most powerful test, especially when recombination rate is much higher than mutation rate (fig. 3A vs.3B). Site-frequency tests can outperform haplotype tests only when the advantageous allele is very close to fixation (e.g., the fourth panel in fig 3A). In agreement with the previous report (Zeng et al. 2006
In figure 3A, all tests have good power to detect positive selection when f (frequency of the selected allele) is 0.95 (fourth panel). For this case, we further explore the effects of the distance between the selected site and the neutral region on the power (fig. 4). As expected, the power decreases when the scaled distance Cbet/ s increases, where Cbet is the recombination distance between the site under selection and the left end of the neutral region (we assume the selected locus is on the left-hand side of the neutral region). Most interestingly, the 2 haplotype tests, especially the EW test, are more powerful than the 2 site-frequency tests over a large range of Cbet/ s values. In practical terms, the EW test can detect positive selection further away from the site of selection than the other tests. This conclusion is generally true before fixation (unpublished data). The factor underlying the difference between haplotype- and site-frequency tests in the prefixation phase is likely LD. We know that the loss of diversity and the increase in LD are characteristic for this stage of hitchhiking (e.g., Stephan et al. 2006
; Zeng et al. 2006
). Site-frequency tests hence lose power because they do not consider the increase in level of LD.
|
After fixation, the patterns of polymorphism and LD caused by hitchhiking will linger on for a short period of time, and what follows is a prolonged period when the population shows a deficit of intermediate- to high-frequency variants (Kim and Stephan 2002
= 0.05 (
is time after fixation, measured in units of 4N generations). However, this recovery phase when D and E are most powerful is in fact hardly distinguishable from the recovery phase of some demographic changes (e.g., population growth) or background selection (e.g., Fu 1997
Power of the EHH Test
The main difference between the EHH test and the other tests mentioned above is that EHH requires the prior definition of the focal site (and core haplotype) and knowledge of the local recombination rate. In figure 5, we show power curves of the EHH test conducted in 2 different ways (i.e., with or without using the site under selection as the core). The power of the EHH test is indeed very high when the precise site under selection is used as the core (the top graph in fig. 5), but its power drops precipitously when the next and closest segregating site is used as the core (the bottom graph). The power of the EW test is in between those of the EHH test and is much closer to the one with the selected site as the core (fig. 5). Considering that the EW test does not directly model recombination and does not require the knowledge of the location of the selected site, its performance is indeed quite impressive. Two factors may result in the "core-dependent" property of the EHH test. First, the site under selection is the defining site for the special pattern of LD caused by the rapid fixation of the advantageous allele (Kim and Nielsen 2004
; Stephan et al. 2006
). That is LD between 2 neutral loci can be eliminated by a sweep if they are separated by the selected site, whereas strong LD can be observed if the neutral loci are on the same side of the selected site. Hence, when the selected site is not used as the core, the signal of selection may be weakened. Second, ancestral alleles are much more common in the population than derived ones when a sweep starts. Thus, at the closest polymorphic site, it is more likely for the ancestral allele to hitchhike with the advantageous allele, that is, the use of derived allele at this site causes further loss in power. But, in the search of recent sweeps, there is no point to use ancestral alleles as the core haplotype. Recently, Voight et al. (2006)
proposed an EHH-like test, the integrated haplotype score (IHS) test. They showed that this new test is not dependent on the precise knowledge of location of the selected site. The study of other properties of the EHH test and the IHS test is beyond the scope of this paper.
|
Sensitivities of the Tests to Other Driving Forces
All tests considered so far use the standard neutral model as the null hypothesis. This model makes a number of assumptions (for instance, constant population size). Violation of different assumptions can usually lead to similar test results (e.g., Zeng et al. 2006
Background Selection
Selection against linked deleterious mutations maintained by recurrent mutation is referred to as background selection (Charlesworth et al. 1993
). Typically, background selection results in a reduction in effective population size; it also reduces the levels of polymorphism at linked neutral loci (Charlesworth et al. 1993
; Hudson and Kaplan 1994
). Previous studies suggest that positive selection and background selection can leave similar patterns of polymorphism at linked neutral loci (Charlesworth et al. 1995
; Fu 1997
). Thus, discrimination between these 2 very different modes of selection is important.
Here, we used Hudson and Kaplan's 2-locus model (1994)
to simulate background selection (see Methods). The recombination distance between the deleterious region and the left end of neutral locus is denoted as Cbet. (We assume the deleterious region is on the left-hand side of the neutral region.) The conventional value sh = 0.02 was assumed in the simulations, where s and h are, respectively, selection and dominance coefficients for mutant alleles. The results are shown in table 2. There are 2 general trends that apply to all tests except DH, which is essentially not affected by background selection. First, the sensitivity of the tests is monotonely increasing as U (deleterious mutation rate per diploid genome) increases. When U is small (say, U < 0.05), the tests are not affected even if the population size is relatively small (say, N
5,000). However, as U gets larger, the tests exhibit inflated type I error rates. In this case, larger populations are affected to a lesser extent. The second trend is that, when deleterious mutation rate is high, we can observe very large haplotype blocks with reduced variability. For example, when N is 10,000, U is 0.1, and Cbet is 5 x 104, the rejection rates of the EW test and Hudson's test are still 18.1% and 10.6%, respectively. More significant blocky haplotype structure can be seen when N is smaller (unpublished data).
|
Among the tests, the DH test has the lowest sensitivity. Hudson's test is more sensitive than DH, especially when deleterious mutation rate is high. Tajima's D and the EW test are remarkably similar. The sensitivity of these 2 tests is due to the excess of low-frequency variants in the population (Fu 1997
Population Bottleneck
Population bottleneck is believed to be an important aspect of evolution for species, which may have dispersed from their ancestral range (e.g., the "out-of-Africa" hypothesis of human evolution). Here, we model bottleneck in the following way. The population size at present is N. Going backward in time, the population size reduces exponentially to ßN. The length of this size-reducing period is tb (in units of 4N generations). Then the population size restores to N instantaneously (i.e., at time tb in the past). The effects of mild bottlenecks (say, ß > 0.25) are not detectable by the tests considered (unpublished data). In figure 6, we present results for ß = 0.05. The 4 tests can be divided into 2 groups according to their dynamicsDH and Hudson's test form the first group and the other 2 tests form the second group. Tests in the first group are considerably more conservative than those in the second group. The DH test is the most conservative test. Its rejection rate never goes above 12% (the peak is 11.7% at tb = 0.08). But the highest rejection rates of other tests are all above 19%. The main difference between Tajima's D and the EW test is that the EW test tends to be more powerful in detecting more recent size reduction, whereas Tajima's D is better at detecting older ones.
|
Population Subdivision
In this section, we discuss simulation results obtained by using the symmetric finite island model (Wright 1931
|
| (5) |
0.2 (fig. 7B). Among the tests, Fay and Wu's H has the highest false-positive rate.
|
When sequences sampled from different demes are pooled together in the analysis, the picture is somewhat more complicated. Summarizing our simulation results, we find the following. 1) When sampling is extremely biased (i.e., the vast majority of the sequences are taken from one deme), the behaviors of the tests resemble those shown in figure 7. The tests, except for the H test, usually have rejection rates lower than 10%, if FST
0.2. 2) If the sequences are sampled more or less uniformly across the demes, pooling samples does not have visible effects on the type I error rates of the tests (e.g., supplemental fig. S5, Supplementary Material online).
Joint Effects of Positive Selection and Demography
The results presented above were obtained by considering positive selection and demography separately. In reality, these 2 driving forces may have occurred concurrently. It is thus desirable to understand the effects of 2 interfering processes on the power of the tests. As an example, we assume that a selective sweep occurs in a population, which experienced a recent size expansion (fig. 8). The joint effects of sweep and growth on the EW test are quite complex (fig. 8A). The power curve of the EW test obtained under the interfering model (i.e., positive selection and growth together) often falls below those obtained under simpler models (i.e., only positive selection or only growth). In particular, there is a dip in power at tg
0.03. This is because frequency of the selected allele at this time is moderately high (about 40%), and thus, linked variants are likely to have hitchhiked to intermediate frequency (due to size expansion, most of the variants initially linked to the selected allele are at low frequency). Note that the EW test is sensitive to both population growth and positive selection. Therefore, when EW rejects neutrality, it is hard to tell which factor causes the rejection. Similar results were obtained when other simple tests such as Tajima's D were considered (unpublished data). On the other hand, the DH test, which is sensitive to selection but insensitive to growth, is much less affected when both factors are present (fig. 8B). Based on these results and results obtained by considering different interfering models (unpublished data), we conclude that DH is usually able to distinguish between positive selection and demography.
|
| Discussion |
|---|
|
|
|---|
The EW Test in the Presence of Recombination
In this study, we have showed that the EW test, albeit derived under the no-recombination neutral model, is applicable to DNA sequences with recombination. Previously, Hudson (1983)
and
in a complex way, but in general the effects of recombination do not seem to be very significant. Our results suggest that, averaging over all possible values of K, recombination does not affect the type I error rate of the EW test appreciably. Because K contains information on the local mutation and recombination rates, it may not be surprising that haplotype tests conditional on K have higher power to detect positive selection than those conditional on S. In particular, results of figure 3 B indicate that the EW and related tests may be useful for analyzing data from organism like Drosophila whose recombination rate is several fold higher than the mutation rate.
Site-Frequency Tests versus Haplotype-Frequency Tests
By summarizing simulation results obtained under various combinations of parameter values (10
n
120, 0.5
30, 0
/
15, and 50
1000), we find that the most important factor determining the relative performance between the DH test (representing site-frequency tests) and the EW test (representing haplotype-frequency tests) is the absolute level of polymorphism of the sample (i.e., values of S and n). For example, when there is only one polymorphic site in the sample whose derived allele occurs in 48 of the 50 samples, the P value of the DH test is 0.005, but that of the EW test is 0.344. In other words, when S and/or n are small (say, S < 10 and/or n
10), the EW test may not be the most powerful test among the tests considered. In those cases, we recommend the DH test for 2 reasons. First, the DH test does not require the identification of linkage phase. Second, the DH test is relatively insensitive to forces other than directional selection.
The Effects of Other FactorsDominance and Haplotype Inference
So far, we have assumed the genic selection model for the advantageous allele (i.e., h = 0.5). Recent study has showed that the level of dominance can have important impact on the effect of hitchhiking (Teshima and Przeworski 2006
). Based on some exploratory simulation studies with h = 0.1 or 0.9, we find that recessive advantageous alleles are indeed more difficult to detect than dominant ones, in agreement with the results of Teshima and Przeworski. Nevertheless, the relative performance of these tests reported above remains true. Furthermore, when the neutral region is very close to the selected site, the effect of dominance tends to be weak (unpublished data). Our analysis also assumes that the haplotypes are known. But, in many nonmodel organisms, they can only be inferred. When the phase inference is incorrect, type I error rates of the haplotype tests may be inflated (unpublished data).
The Issue of Specificity
We summarize qualitatively the power (or sensitivity) of the tests to detect positive selection, background selection, and various demographic scenarios in table 3. It is clear that all simple tests can be sensitive to more than one driving forces. In comparison, the compound test, DH, is affected by factors other than positive selection to a much lesser extent. This is because the 2 component tests of DH, Tajima's D and Fay and Wu's H, are both powerful in detecting selection but are sensitive to other driving forces in a somewhat mutually exclusive way (table 3; Zeng et al. 2006
). Furthermore, figure 8B indicate that the DH test is reasonably powerful even when positive selection occurs in a nonequilibrium background. These results suggest that the compound test DH provides a novel way of separating positive selection from demography. On the other hand, we note that the EW test and Tajima's D are similar in many respects and that the EW test is very powerful in detecting positive selection. These observations suggest that, by combining the EW test and H or DH, we may be able to construct new compound tests with improved performance. We will explore this possibility in the companion paper (Zeng K, Shi S, Wu C-I, in preparation).
|
| Supplementary Material |
|---|
|
|
|---|
Supplementary table S1 and figures S1, S2, S3, S4, and S5 are available at Molecular Biology and Evolution online (http://www.mbe.oxfordjournals.org/).
| Acknowledgements |
|---|
|
|
|---|
We thank Drs R. Hudson, T. Nagylaki, J. Pritchard, and 2 anonymous reviewers for their helpful comments. K.Z. is supported by Sun Yat-sen University and the Kaisi Fund. S.S. is supported by grants from the National Natural Science Foundation of China (30230030, 30470119, 30300033, and 30500049). C.-I.W. is supported by National Institutes of Health grants and an OOCS grant from the Chinese Academy of Sciences.
| Footnotes |
|---|
Jianzhi Zhang, Associate Editor
| References |
|---|
|
|
|---|
Barton NH. The effect of hitch-hiking on neutral genealogies. Genet Res (1998) 72:123133.[CrossRef][Web of Science]
Biswas S, Akey JM. Genomic insights into positive selection. Trends Genet (2006) 22:437446.[CrossRef][Web of Science][Medline]
Braverman JM, Hudson RR, Kaplan NL, Langley CH, Stephan W. The hitchhiking effect on the site frequency spectrum of DNA polymorphisms. Genetics (1995) 140:783796.[Abstract]
Charlesworth B, Morgan MT, Charlesworth D. The effect of deleterious mutations on neutral molecular variation. Genetics (1993) 134:12891303.[Abstract]
Charlesworth D, Charlesworth B, Morgan MT. The pattern of neutral molecular variation under the background selection model. Genetics (1995) 141:16191632.[Abstract]
Coop G, Griffiths RC. Ancestral inference on gene trees under selection. Theor Popul Biol (2004) 66:219232.[CrossRef][Web of Science][Medline]
Depaulis F, Mousset S, Veuille M. Power of neutrality tests to detect bottlenecks and hitchhiking. J Mol Evol (2003) 57(Suppl 1):S190S200.[CrossRef][Web of Science][Medline]
Depaulis F, Mousset S, Veuille M. Detecting selective sweeps with haplotype tests. In: Selective sweepNurminsky D, ed. (2005) Georgetown (DC): Landes Bioscience. 3354.
Depaulis F, Veuille M. Neutrality tests based on the distribution of haplotypes under an infinite-site model. Mol Biol Evol (1998) 15:17881790.[Web of Science][Medline]
Donnelly P, Tavaré S. Coalescents and genealogical structure under neutrality. Annu Rev Genet (1995) 29:401421.[CrossRef][Web of Science][Medline]
Ewens WJ. The sampling theory of selectively neutral alleles. Theor Popul Biol (1972) 3:87112.[CrossRef][Web of Science][Medline]
Ewens WJ. Testing for increased mutation rate for neutral alleles. Theor Popul Biol (1973) 4:251258.[CrossRef][Web of Science][Medline]
Ewens WJ. Mathematical population genetics (2004) Berlin (Germany): Springer-Verlag.
Fay JC, Wu CI. Hitchhiking under positive Darwinian selection. Genetics (2000) 155:14051413.
Fay JC, Wu CI. Sequence divergence, functional constraint, and selection in protein evolution. Annu Rev Genomics Hum Genet (2003) 4:213235.[CrossRef][Web of Science][Medline]
Fu YX. Statistical properties of segregating sites. Theor Popul Biol (1995) 48:172197.[CrossRef][Web of Science][Medline]
Fu YX. New statistical tests of neutrality for DNA samples from a population. Genetics (1996) 143:557570.[Abstract]
Fu YX. Statistical tests of neutrality of mutations against population growth, hitchhiking and background selection. Genetics (1997) 147:915925.[Abstract]
Fu YX, Li WH. Statistical tests of neutrality of mutations. Genetics (1993) 133:693709.[Abstract]
Griffiths RC. The frequency spectrum of a mutation, and its age, in a general diffusion model. Theor Popul Biol (2003) 64:241251.[CrossRef][Web of Science][Medline]
Hanchard NA, Rockett KA, Spencer C, Coop G, Pinder M, Jallow M, Kimber M, McVean G, Mott R, Kwiatkowski DP. Screening for recently selected alleles by analysis of human haplotype similarity. Am J Hum Genet (2006) 78:153159.[CrossRef][Web of Science][Medline]
Hudson RR. Properties of a neutral allele model with intragenic recombination. Theor Popul Biol (1983) 23:183201.[CrossRef][Web of Science][Medline]
Hudson RR. The how and why of generating gene genealogies. In: Mechanisms of molecular evolutionTakahata N, Clark AG, eds. (1993) Sunderland (MA): Sinauer Associates. 2336.
Hudson RR. Generating samples under a Wright-Fisher neutral model of genetic variation. Bioinformatics (2002) 18:337338.
Hudson RR, Bailey K, Skarecky D, Kwiatowski J, Ayala FJ. Evidence for positive selection in the superoxide dismutase (Sod) region of Drosophila melanogaster. Genetics (1994) 136:13291340.[Abstract]
Hudson RR, Kaplan NL. Gene trees with background selection. In: Non-neutral evolution: theories and molecular dataGolding B, ed. (1994) London: Chapman & Hall. 140153.
Innan H, Zhang K, Marjoram P, Tavaré S, Rosenberg NA. Statistical tests of the coalescent model based on the haplotype frequency distribution and the number of segregating sites. Genetics (2005) 169:17631777.
Jensen JD, Kim Y, DuMont VB, Aquadro CF, Bustamante CD. Distinguishing between selective sweeps and demography using DNA polymorphism data. Genetics (2005) 170:14011410.
Kaplan NL, Hudson RR, Langley CH. The "hitchhiking effect" revisited. Genetics (1989) 123:887899.
Karlin S, McGregor J. Addendum to a paper of W. Ewens. Theor Popul Biol (1972) 3:113116.[CrossRef][Web of Science][Medline]
Kelly JK. A test of neutrality based on interlocus associations. Genetics (1997) 146:11971206.[Abstract]
Kim Y, Nielsen R. Linkage disequilibrium as a signature of selective sweeps. Genetics (2004) 167:15131524.
Kim Y, Stephan W. Detecting a local signature of genetic hitchhiking along a recombining chromosome. Genetics (2002) 160:765777.
Kimura M. The number of heterozygous nucleotide sites maintained in a finite population due to steady flux of mutations. Genetics (1969) 61:893903.
Kimura M. Average time until fixation of a mutant allele in a finite population under continued mutation pressure: studies by analytical, numerical, and pseudo-sampling methods. Proc Natl Acad Sci USA (1980) 77:522526.
Maynard Smith J, Haigh J. The hitch-hiking effect of a favourable gene. Genet Res (1974) 23:2335.[Web of Science][Medline]
Nielsen R. Molecular signatures of natural selection. Annu Rev Genet (2005) 39:197218.[CrossRef][Web of Science][Medline]
Przeworski M. The signature of positive selection at randomly chosen loci. Genetics (2002) 160:11791189.
Sabeti PC, Reich DE, Higgins JM, et al, (17 co-authors). Detecting recent positive selection in the human genome from haplotype structure. Nature (2002) 419:832837.[CrossRef][Medline]
Sabeti PC, Schaffner SF, Fry B, Lohmueller J, Varilly P, Shamovsky O, Palma A, Mikkelsen TS, Altshuler D, Lander ES. Positive natural selection in the human lineage. Science (2006) 312:16141620.
Simonsen KL, Churchill GA, Aquadro CF. Properties of statistical tests of neutrality for DNA polymorphism data. Genetics (1995) 141:413429.[Abstract]
Slatkin M. Inbreeding coefficients and coalescence times. Genet Res (1991) 58:167175.[Web of Science][Medline]
Slatkin M. An exact test for neutrality based on the Ewens sampling distribution. Genet Res (1994) 64:7174.[Web of Science][Medline]
Slatkin M. A correction to the exact test based on the Ewens sampling distribution. Genet Res (1996) 68:259260.[Web of Science][Medline]
Slatkin M, Bertorelle G. The use of intraallelic variability for testing neutrality and estimating population growth rate. Genetics (2001) 158:865874.
Spencer CC, Coop G. SelSim: a program to simulate population genetic data with natural selection and recombination. Bioinformatics (2004) 20:36733675.
Stephan W, Song YS, Langley CH. The hitchhiking effect on linkage disequilibrium between linked neutral loci. Genetics (2006) 172:26472663.
Stephan W, Wiehe THE, Lenz MW. The effect of strongly selected substitutions on neutral polymorphismanalytical results based on diffusion-theory. Theor Popul Biol (1992) 41:237254.[CrossRef][Web of Science]
Strobeck C. Average number of nucleotide differences in a sample from a single subpopulation: a test for population subdivision. Genetics (1987) 117:149153.
Tajima F. Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics (1989) 123:585595.
Teshima KM, Przeworski M. Directional positive selection on an allele of arbitrary dominance. Genetics (2006) 172:713718.
Toomajian C, Ajioka RS, Jorde LB, Kushner JP, Kreitman M. A method for detecting recent selection in the human genome from allele age estimates. Genetics (2003) 165:287297.
Voight BF, Kudaravalli S, Wen X, Pritchard JK. A map of recent positive selection in the human genome. PLoS Biol (2006) 4:e72.[CrossRef][Medline]
Wall JD. Recombination and the power of statistical tests of neutrality. Genet Res (1999) 74:6579.[CrossRef][Web of Science]
Wall JD. A comparison of estimators of the population recombination rate. Mol Biol Evol (2000) 17:156163.
Wall JD, Hudson RR. Coalescent simulations and statistical tests of neutrality. Mol Biol Evol (2001) 18:11341135.
Wang ET, Kodama G, Baldi P, Moyzis RK. Global landscape of recent inferred Darwinian selection for Homo sapiens. Proc Natl Acad Sci USA (2006) 103:135140.
Watterson GA. On the number of segregating sites in genetical models without recombination. Theor Popul Biol (1975) 7:256276.[CrossRef][Web of Science][Medline]
Watterson GA. The homozygosity test of neutrality. Genetics (1978) 88:405417.
Wright S. Evolution in Mendelian populations. Genetics (1931) 16:97159.
Zeng K, Fu YX, Shi S, Wu CI. Statistical tests for detecting positive selection by utilizing high-frequency variants. Genetics (2006) 174:14311439.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
D. Garrigan, R. Lewontin, and J. Wakeley Measuring the Sensitivity of Single-locus "Neutrality Tests" Using a Direct Perturbation Approach Mol. Biol. Evol., January 1, 2010; 27(1): 73 - 89. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. P. Jackson The Evolution of Amastin Surface Glycoproteins in Trypanosomatid Parasites Mol. Biol. Evol., January 1, 2010; 27(1): 33 - 45. [Abstract] [Full Text] [PDF] |
||||
![]() |
C.-H. Chen, T.-J. Chuang, B.-Y. Liao, and F.-C. Chen Scanning for the Signatures of Positive Selection for Human-Specific Insertions and Deletions Gen Biol Evol, November 23, 2009; 2009(0): 415 - 419. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. J. Eckert, J. L. Wegrzyn, B. Pande, K. D. Jermstad, J. M. Lee, J. D. Liechty, B. R. Tearse, K. V. Krutovsky, and D. B. Neale Multilocus Patterns of Nucleotide Diversity and Divergence Reveal Positive Selection at Candidate Genes Related to Cold Hardiness in Coastal Douglas Fir (Pseudotsuga menziesii var. menziesii) Genetics, September 1, 2009; 183(1): 289 - 298. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. E. Lohmueller, C. D. Bustamante, and A. G. Clark Methods for Human Demographic Inference Using Haplotype Patterns From Genomewide Single-Nucleotide Polymorphism Data Genetics, May 1, 2009; 182(1): 217 - 231. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Zhai, R. Nielsen, and M. Slatkin An Investigation of the Statistical Power of Neutrality Tests Based on Comparative and Population Genetic Data Mol. Biol. Evol., February 1, 2009; 26(2): 273 - 283. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. A. Rosenberg and M. Jakobsson The Relationship Between Homozygosity and the Frequency of the Most Frequent Allele Genetics, August 1, 2008; 179(4): 2027 - 2036. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Llopart and J. M. Comeron Recurrent Events of Positive Selection in Independent Drosophila Lineages at the Spermatogenesis Gene roughex Genetics, June 1, 2008; 179(2): 1009 - 1020. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Lu, Y. Fu, S. Kumar, Y. Shen, K. Zeng, A. Xu, R. Carthew, and C.-I Wu Adaptive Evolution of Newly Emerged Micro-RNA Genes in Drosophila Mol. Biol. Evol., May 1, 2008; 25(5): 929 - 938. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Zeng, S. Shi, and C.-I Wu Compound Tests for the Detection of Hitchhiking Under Positive Selection Mol. Biol. Evol., August 1, 2007; 24(8): 1898 - 1908. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||











