MBE Advance Access originally published online on June 8, 2007
Molecular Biology and Evolution 2007 24(8):1898-1908; doi:10.1093/molbev/msm119
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Research Articles |
Compound Tests for the Detection of Hitchhiking Under Positive Selection

* State Key Laboratory of Biocontrol and Key Laboratory of Gene Engineering of the Ministry of Education, Sun Yat-sen University, Guangzhou, China
Department of Ecology and Evolution, University of Chicago
E-mail: kzeng{at}uchicago.edu.
| Abstract |
|---|
|
|
|---|
Many statistical tests have been developed for detecting positive selection. Most of these tests draw conclusions based on significant deviations from the patterns of polymorphism predicted by the neutral model. However, many non-equilibrium forces may cause similar deviations, and thus the tests usually have low statistical specificity to positive selection. The main challenge is hence to construct test statistics that are reasonably powerful in detecting positive selection, but are relatively insensitive to other forces. Recently, Zeng et al. (2006) proposed a new test, DH, which is a compound of Tajima's D and Fay and Wu's H, and showed that DH has reasonably high statistical specificity to positive selection. In this report, we expand the idea of a compound test by combining Fay and Wu's H or DH with the Ewens-Watterson (EW) test. We refer to these 2 new tests as HEW and DHEW, respectively. Compared to the DH test, HEW and DHEW are more robust against the presence of recombination, and are also more powerful in detecting positive selection. Furthermore, the DHEW test, similar to DH, is also relatively insensitive to background selection and demography. The HEW test, on the other hand, tends to be somewhat less conservative than DH and DHEW in some cases.
Key Words: compound tests positive selection demography background selection
| Introduction |
|---|
|
|
|---|
Clarifying the roles of genetic drift and natural selection in the course of evolution is an issue of fundamental importance in the study of evolutionary biology (Kimura 1983
Many methods have been proposed for specifically detecting positive selection. Broadly speaking, these methods are derived by using one of the 4 general approaches described below. The first approach, loosely speaking, is to develop methods in the HKA framework (Hudson et al. 1987
; Innan 2006
). The underlying principle is that positive selection tends to affect only a small number of loci that are closely linked to the advantageous allele, but demographic changes affect all loci in the genome equally. Usually, demographic changes are directly modeled, and genomic data are then used to estimate the parameters. Genes are considered as targets of positive selection if their patterns of genetic variation are incompatible with those predicted by the estimated demographic parameters (Wall et al. 2002
; Akey et al. 2004
; Li and Stephan 2006
; Thornton and Andolfatto 2006
). The difficulty with this approach is that it requires the prior knowledge of the demographic history of the species. But this information is usually not available, or is known only vaguely. Furthermore, it is not clear whether we can obtain the same results if we assume a different demographic model.
The second approach is more empirical. It scans patterns of variation over many loci, and considers loci in the tails of the empirical distribution as candidate targets of selection (Akey et al. 2002
; Carlson et al. 2005
; Voight et al. 2006
; Wang et al. 2006
). This approach is appealing in that it avoids specifying a complex demographic model. Nonetheless, the support for positive selection is usually obtained by comparing the empirical distribution with distributions obtained by simulating various demographic models. Furthermore, a recent study suggests that this kind of analysis may suffer from high rates of false positives and false negatives (Teshima et al. 2006
).
The third approach relies on a composite likelihood method, which treats nucleotide sites as though they were statistically independent. Methods derived by this approach differ from those mentioned above in that the null hypothesis is not a specific population genetic model, but is derived from the background pattern of variation in the data itself (Jensen et al. 2005
; Nielsen et al. 2005
).
The fourth approach, which is the focus of this paper, has a simple structure. It requires no estimation of parameters, and is most suitable to data from a single locus. The first test derived by this approach is called the DH test (Zeng et al. 2006
), which is a compound test combining Tajima's D (Tajima 1989
) and Fay and Wu's H (Fay and Wu 2000
). The underlying idea is that Tajima's D and Fay and Wu's H are both powerful in detecting positive selection, but are sensitive to other processes in a mutually exclusive manner (Zeng et al. 2006
). Therefore, by properly combining these 2 tests, the resulting test (DH) obtains relatively high statistical specificity to positive selection.
The DH test, Tajima's D, and Fay and Wu's H only use the site-frequency spectrum (Fu 1995
) for detection. It is known that these methods tend to be too conservative in the presence of intragenic recombination (Wall 1999
; Przeworski et al. 2001
; Zeng et al. 2007
). In the companion study (Zeng et al. 2007
), we have shown that these site-frequency methods are usually not as powerful in detecting positive selection as methods based on the haplotype-frequency spectrum such as the Ewens-Watterson (EW) test (Watterson 1978
). Furthermore, we have shown that the EW test is insensitive to recombination, and is usually very powerful in detecting positive selection especially when recombination rate is high. In the light of these recent results, we expand the idea of a compound test by constructing new test statistics which make use of both site- and haplotype-frequency spectra.
In this paper, we propose two new tests by combining Fay and Wu's H or the DH test with the Ewens-Watterson (EW) test. We refer to these two tests as HEW (H + EW) and DHEW (DH + EW), respectively. The effect of positive selection is studied using the model first introduced by Maynard Smith and Haigh (1974)
. By doing extensive computer simulations, we try to address the following questions: (1) Are HEW and DHEW more robust against the presence of intragenic recombination than is the DH test? (2) Are HEW and DHEW more powerful than DH in detecting positive selection? (3) Are HEW and DHEW sensitive to background selection and demographic changes?
| Methods |
|---|
|
|
|---|
Statistical Tests
We first describe 4 previously developed tests (Tajima's D, Fay and Wu's H, Ewens-Watterson's test, and the DH test), and then we describe 2 new tests, HEW and DHEW.
Tajima's D and Fay and Wu's H
Tajima's well-known test statistic is defined as:
|
| (1) |

(Tajima 1983
W (Watterson 1975
(= 4Nu, where N is the effective population size and u is the neutral mutation rate of the locus).
Fay and Wu's H contrasts the abundance of high-frequency variants with the abundance of intermediate-frequency ones (Fay and Wu 2000
). Here we use the normalized H statistic, defined as:
|
| (2) |
L above is a new unbiased estimator of
, and by using
L, the variance term Var(
–
L) can be derived (Zeng et al. 2006
The Ewens-Watterson test
The test statistic of the Ewens-Watterson (EW) test is:
|
| (3) |
.
The DH test
We denote an observed sample with S (
1) segregating sites as XS,obs. We say XS,obs is rejected by the DH test at the significance level p if
|
| (4) |
|
| (5) |
|
| (6) |
The HEW test
Similar to the DH test, for a given significance level p, the rejection region (RR) for HEW is:
|
| (7) |
,neu represents random samples generated with a given
, EW(X) is the value of the test statistic of the EW test for a sample X, and the critical values (H
,cri and EW
,cri) satisfying
|
| (8) |
|
| (9) |
is unknown. In our implementation, we set
=
W (Watterson 1975
The DHEW test
The rejection region (RR) for DHEW for a given significance level p is:
![]() | (10) |
Similarly, the critical values satisfy the following conditions:
![]() | (11) |
![]() | (12) |
Simulating the null distributions of the tests
We obtain the null distribution of the EW test by using Slatkin's algorithm to enumerate all possible haplotype-frequency configurations for a given K (Slatkin 1994
). For D, H, HEW and DHEW, we conduct standard coalescent simulation without recombination (Hudson 1990
) and with
estimated by Watterson's estimator (Watterson 1975
). Finally, the critical values of the DH test are determined using coalescent simulation with the number of segregating sites fixed (Hudson 1993
; Wall and Hudson 2001
).
All tests are one-sided and are conducted at the 5% significance level. The tail of the null distribution of a test statistic which can maximize its power to detect positive selection is used. Thus, for the EW test, values falling into the upper 5% tail are considered significant, and for Tajima's D and Fay and Wu's H, values falling into the lower 5% tail are considered significant.
Simulating evolutionary processes
We simulate various evolutionary processes using the coalescent method (Hudson 1983
, 1990
) and the infinite-site model (Kimura 1969
). Intragenic recombination is included in all the simulations. The intensity of recombination is measured by
= 4Nr, where r is the recombination rate of the locus per generation. We use the coalescent process with a selective phase to generate random samples under the hitchhiking model. This model assumes that a beneficial mutation arises on a single chromosome, and that the fitnesses of the 3 genotypes (BB, Bb, and bb) are 1 + s, 1 + h·s, and 1, respectively, where B is the advantageous allele, b is the wild type allele, and s and h are the selection and dominance coefficients. The behavior of the selected allele is mainly determined by the scaled selection coefficient
(= 2Ns) and h. The trajectory of the advantageous allele is obtained by using the time-reversal property of the diffusion process (Ewens 2004
), and the pseudo-sampling device (Kimura 1980
). This procedure takes into account the stochastic effects on the change in frequency of the selected allele. It also allows us to explore selected alleles with different levels of dominance. (Only results obtained by assuming h = 0.5 have been presented. Other results obtained by assuming h = 0.1 or 0.9 are qualitatively similar.) The genealogy is then generated using the structured coalescent method (Braverman et al. 1995
; Zeng et al. 2006
). Background selection is simulated using the 2-locus model described in Hudson and Kaplan (1994)
. This model assumes that the deleterious locus is in mutation-selection equilibrium and is not recombining, but that recombination can occur between the neutral locus and the deleterious locus. Here we extend the Hudson and Kaplan model by allowing intragenic recombination within the neutral locus. Finally, we use the algorithms implemented in the software package ms (Hudson 2002
) to simulate various demographic scenarios.
| Results |
|---|
|
|
|---|
In the companion study, we have shown that the EW test is more powerful than Tajima's D in detecting selection before and around the fixation of an advantageous allele, while under background selection or other demographic perturbations these 2 tests behave similarly (Zeng et al. 2007
Null distributions of the compound tests
Statistically, a compound test transforms a sequence sample into a vector, and uses this vector as the test statistic. The elements of the vector are p-values of the component tests. An example of such a transformation is shown in figure 1 for the DH test, generated with n = 50 and S = 20. By definition, the 5% rejection region (the shaded area) is in the lower-left corner of the plot. Interestingly, under neutrality the vectors are largely uniformly distributed, suggesting that the p-values of D and H may be only weakly correlated (the left panel of figure 1). In contrast, when positive selection is in operation, the vectors tend to cluster in the lower-left corner, resulting in rejections of neutrality (the right panel of figure 1). In table 1, we show the significance levels of the component tests (i.e., p*s defined by equations (6), (9) and (12)) of the 3 compound tests under various combinations of parameter values. In all cases, p* is several-fold higher than p, the nominal significance level of a compound test. The DHEW test has the highest p* value, because this test is constrained by 3 different tests. The actual rejection rates of the 3 compound tests under neutrality are presented in table 2. When there is no recombination (
/
= 0), the actual rejection rates of the tests are very close to the nominal significance level (5% in table 2), suggesting that they are indeed legitimate statistical tests. When intragenic recombination is included, the DH test becomes overly conservative, in agreement with previously reports (Wall 1999
; Zeng et al. 2007
). However, the other 2 tests (and especially HEW) are affected by recombination to a much lesser extent. This gain in robustness is due to the EW test whose rejection rate has been shown to be insensitive to a change in local recombination rate (Zeng et al. 2007
).
|
|
|
Power of the compound tests to detect positive selection
In this section we study the power of the compound tests to detect positive selection, and the differences between compound tests and simple site-frequency tests (i.e., Tajima's D and Fay and Wu's H). In the simulations, we incorporate intragenic recombination, and assume that
= 300, h = 0.5. Results obtained by using other parameter values are qualitatively comparable (Supplemental Figure S1). Figure 2 shows the power of the tests as a function of the size of the neutral region when the advantageous allele is still segregating in the population. In this plot, the selected site is located in the middle of the neutral region. It is clear that HEW and DHEW are more powerful than the other 3 tests which rely solely on the site-frequency spectrum. This suggests that before fixation the haplotype-frequency spectrum contains useful information about the hitchhiking process, and that ignoring this information would cause a loss of power. At this stage of a sweep, the power of a test to detect positive selection decreases in the following order: HEW (the most powerful), DHEW, H, DH, and D. However, the difference between H and DH is not significant. Increasing the local recombination rate does not change this order, but does increase the advantage of HEW and DHEW over the other tests (figure 2 versus Supplemental figure S2). In general, recombination reduces the size of the region that a selective sweep affects, and lowers the power of the tests, especially that of Tajima's D (e.g., compare the fourth panels in figures 2 and S2). HEW is less powerful than the EW test when the selected allele is at low to intermediate frequency (e.g., less than 70%), but otherwise outperforms it.
|
After the advantageous allele has been fixed, relative performances of the tests are not always the same as those before fixation. When recombination rate is of the same order as the mutation rate (
/
= 1; figure 3A), Tajima's D tends to be the most powerful test. In this case, the DHEW test outperforms the other 2 compound tests, and the HEW test is somewhat better than DH. However, if recombination rate is high (
/
= 5; figure 3B), the relative performances of the tests resemble those before fixation, i.e. HEW tends to be the most powerful, followed by DHEW, and then the site-frequency tests. Although the power of the H test decreases quickly after fixation (Kim and Stephan 2002
(time after fixation, measured in units of 4N generations) becomes larger, the special patterns of polymorphism and LD (linkage disequilibrium) due to hitchhiking disappear, and the population exhibits an excess of low- and intermediate-frequency mutations (Zeng et al. 2006
|
It has been shown that the patterns of polymorphism and LD caused by hitchhiking are highly dependent on the distance between the neutral region and the site under selection (Kim and Stephan 2002
= 10, and place this locus in regions at various distances from the site under selection (figure 4). In this plot, the frequency of the selected allele is 90%, and the distance between the neutral locus and the selected locus is measured by Cbet/s, where Cbet is the recombination distance between the 2 loci, and s is the selection coefficient. The 2 new compound tests (i.e., HEW and DHEW) are in most cases more powerful than DH, Tajima's D, and Fay and Wu's H. In practical terms, HEW and DHEW are able to detect a sweep further away than are the other tests. A similar picture is observed shortly after fixation (Supplemental figure S4).
|
Sensitivity to background selection
Purifying selection against linked deleterious mutations maintained by recurrent mutation is referred to as background selection (Charlesworth et al. 1993
|
The effects of population demography
Demographic changes may be the most visible violation of the neutral assumptions. Therefore, discrimination between rejections due to demography and those due to positive selection is very important.
Population growth
We use the following model to study the effect of population growth: the population size at present is N; going backward in time, the population size decreases instantly to N0 at time tg (in units of 4N generations). In figure 5, we show the results obtained by assuming N0/N = 0.05. It is clear that the compound tests are largely unaffected. In contrast, the power of the D test can be as high as 60%. The EW test and Tajima's D behave similarly in this case (unpublished data).
|
Population bottleneck
Here we model bottleneck in the following way. The population size at present is N. Going backward in time the population size reduces exponentially to
N. The length of this size-reducing period is tb (in units of 4N generations). Then the population size is restored to N instantaneously (i.e., at time tb in the past). Summarizing the results of our simulations, we find the following general trends. (1) The DH and DHEW tests behave similarly, and they are in most cases the most conservative tests. (2) When the reduction in population size is mild (e.g.,
0.25), none of the tests have a rejection rate higher than 10%. (3) For an intermediate level of size contraction (e.g., 0.001 <
< 0.25), the compound tests may show limited sensitivity. In this case, HEW tends to be less conservative than DH and DHEW (e.g., figure 6A). However, Tajima's D, and to a lesser extent, Fay and Wu's H, are usually much more sensitive than the compound tests (figure 6A). (4) Severe bottlenecks (e.g.,
0.001) have no significant effect on the compound tests, although Fay and Wu's H and Tajima's D are both highly sensitive (e.g., figure 6B). It has been argued that patterns of variation caused by a bottleneck may be hard to distinguish from those caused by hitchhiking (Barton 1998
|
Population subdivision
In this section, we study the effects of population structure on the tests using the simple symmetric island model (Wright 1931
0.25 (this is also true under other sampling schemes; unpublished data). HEW, on the other hand, is not as robust as DH and DHEW, though it is affected to a lesser extent than Fay and Wu's H. Usually HEWs rejection rate is reasonably low when FST
0.2. In the Supplement, we explore the effects of pooling samples taken from different demes on the rejection rates of the tests (figure S5). We find that even when sampling is quite biased, the type I error rates of the tests are usually unaffected.
|
Differences between compound tests and composite likelihood methods
The composite likelihood test due to Nielsen et al. (2005
becomes sufficiently large (e.g.,
= 50). Two factors may account for this difference in power. First, the performance of the CL test may rely heavily on the accurate inference of the background pattern of variation. However, this inference may require a large amount of data which may not be always available. Second, the lower power of the CL test could also be explained by our not having incorporated recombination when simulating CLs null distribution. However, we have been cautious about including recombination in the simulation, since it may make the CL test less robust against demographic changes (Nielsen et al. 2005
|
| Discussion |
|---|
|
|
|---|
In this study, we develop two new compound tests—HEW and DHEW—by using the novel properties of the Ewens-Watterson (EW) test reported in the companion paper (Zeng et al. 2007
Compared with Fay and Wu's H, the compound tests are better in the following ways: (1) HEW and DHEW are more robust against the presence of intragenic recombination. (2) HEW and DHEW are more powerful in detecting positive selection. (3) The power of all 3 compound tests decreases at a much lower rate after fixation. (4) The compound tests, especially DH and DHEW, are less sensitive to population bottleneck and population subdivision. Another important issue is that the use of Fay and Wu's H requires the identification of ancestral/derived alleles at each polymorphic site. This is usually done by using sequences from a closely related species. However, the inference may be inaccurate due to the violation of the infinite-site model. Incorrect inference of the ancestral state usually results in a significantly negative H statistic (Baudry and Depaulis 2003
). The effect of an incorrect inference on the 3 compound tests is reduced due to the inclusion of Tajima's D and/or the EW test, both of which are not dependent on the separation of ancestral/derived alleles (table 6). Among the compound tests, the DHEW test seems to be the most robust. In general, if the probability of an incorrect inference per site is lower than 5%, none of the tests tend to be seriously affected.
|
In the companion study (Zeng et al. 2007
It is of some interest to compare the compound tests with tests based on patterns of linkage disequilibrium (e.g., Sabeti et al. 2002
). In the companion paper (Zeng et al. 2007
), we have shown that when the site under positive selection is known and is used as the core, the extended haplotype homozygosity (EHH) test (Sabeti et al. 2002
) is very powerful; whereas when this prior information about the selected site is not available, the test has little power. Therefore, the compound tests should be more powerful than the EHH test, unless the site under selection is known. Recently, Voight et al. (2006)
proposed the integrated haplotype score (iHS) test, which can be seen as an extension of the EHH test. Unlike the EHH test, the iHS test has power to detect positive selection when the site under selection is not included in the data set (Voight et al. 2006
). However, some preliminary results suggest that the iHS test can be less conservative than the compound tests under certain demographic schemes. We are currently preparing a manuscript to further investigate the differences between LD-based methods and the compound tests.
| Conclusion |
|---|
|
|
|---|
We have proposed 2 new compound tests, HEW and DHEW, by combining Fay and Wu's H or DH with the Ewens-Watterson (EW) test. Compared to the DH test, these 2 new tests are: (1) more robust against recombination, (2) more powerful in detecting positive selection. Summarizing the results, we conclude that: (1) the 3 compound tests (DH, HEW and DHEW) are most suitable for detecting ongoing or recently fixed selective sweeps; (2) DH and DHEW tend to have higher specificity to positive selection than HEW. In practice, when only site-frequency data is available, we suggest using the DH test due to its high specificity. If haplotype phase is also known, the DHEW test may be a good choice since it automatically takes into account the effect of recombination, and also has high specificity to positive selection. The HEW test, however, should be used when the population does not show much geographical structure and is not greatly affected by recent bottleneck events.
| Supplementary Material |
|---|
|
|
|---|
Supplementary figures S1 through S5 are available at Molecular Biology and Evolution online (http://www.mbe.oxfordjournals.org).
| Acknowledgements |
|---|
|
|
|---|
We are very grateful to Dr. Richard Hudson for his help during the course of this study. We thank Dr. Thomas Nagylaki for his helpful comments. Thanks are also due to Drs Supriya Kumar, Graham Coop, Shuhei Mano, Eric Hungate, and 2 anonymous reviewers who helped us improve the manuscript. K.Z. is supported by Sun Yat-sen University and the Kaisi Fund. S.S. is supported by grants from the National Natural Science Foundation of China (30230030, 30470119, 30300033, and 30500049). C.W. is supported by National Institutes of Health grants and an OOCS grant from the Chinese Academy of Sciences.
| Footnotes |
|---|
1 Present address: 1101 E 57th Street, Chicago, IL 60637
Jianzhi Zheng, Associate Editor
| References |
|---|
|
|
|---|
Akey JM, Eberle MA, Rieder MJ, Carlson CS, Shriver MD, Nickerson DA, Kruglyak L. Population history and natural selection shape patterns of genetic variation in 132 genes. PLoS Biol (2004) 2:e286.[CrossRef][Medline]
Akey JM, Zhang G, Zhang K, Jin L, Shriver MD. Interrogating a high-density SNP map for signatures of natural selection. Genome Res. (2002) 12:1805–1814.
Andolfatto P, Przeworski M. Regions of lower crossing over harbor more rare variants in African populations of Drosophila melanogaster. Genetics (2001) 158:657–665.
Barton NH. The effect of hitch-hiking on neutral genealogies. Genet. Res. (1998) 72:123–133.[CrossRef][Web of Science]
Baudry E, Depaulis F. Effect of misoriented sites on neutrality tests with outgroup. Genetics (2003) 165:1619–1622.
Biswas S, Akey JM. Genomic insights into positive selection. Trends Genet. (2006) 22:437–446.[CrossRef][Web of Science][Medline]
Braverman JM, Hudson RR, Kaplan NL, Langley CH, Stephan W. The hitchhiking effect on the site frequency spectrum of DNA polymorphisms. Genetics (1995) 140:783–796.[Abstract]
Carlson CS, Thomas DJ, Eberle MA, Swanson JE, Livingston RJ, Rieder MJ, Nickerson DA. Genomic regions exhibiting positive selection identified from dense genotype data. Genome Res. (2005) 15:1553–1565.
Charlesworth B. Background selection and patterns of genetic diversity in Drosophila melanogaster. Genet Res. (1996) 68:131–149.[Web of Science][Medline]
Charlesworth B, Morgan MT, Charlesworth D. The effect of deleterious mutations on neutral molecular variation. Genetics (1993) 134:1289–1303.[Abstract]
Charlesworth D, Charlesworth B, Morgan MT. The pattern of neutral molecular variation under the background selection model. Genetics (1995) 141:1619–1632.[Abstract]
Depaulis F, Mousset S, Veuille M. Power of neutrality tests to detect bottlenecks and hitchhiking. J Mol Evol (2003) 57(Suppl. 1):S190–200.[CrossRef][Web of Science][Medline]
Ewens WJ. The sampling theory of selectively neutral alleles. Theor Popul Biol. (1972) 3:87–112.[CrossRef][Web of Science][Medline]
Ewens WJ. Mathematical population genetics (2004) Berlin: Springer-Verlag. 188–191.
Fay JC, Wu C-I. Hitchhiking under positive Darwinian selection. Genetics (2000) 155:1405–1413.
Fay JC, Wu C-I. Sequence divergence, functional constraint, and selection in protein evolution. Annu Rev Genomics Hum Genet. (2003) 4:213–235.[CrossRef][Web of Science][Medline]
Fu Y-X. Statistical properties of segregating sites. Theor Popul Biol. (1995) 48:172–197.[CrossRef][Web of Science][Medline]
Fu Y-X. Statistical tests of neutrality of mutations against population growth, hitchhiking and background selection. Genetics (1997) 147:915–925.[Abstract]
Gillespie JH. The cause of molecular evolution (1991) Oxford: Oxford University Press.
Golding B. The effect of purifying selection on genealogies. In: Progress in population genetics and human evolution—Donnelly P, Tavaré S, eds. (1997) New York, NY: Springer-Verlag. 271–285.
Hudson RR. Properties of a neutral allele model with intragenic recombination. Theor Popul Biol. (1983) 23:183–201.[CrossRef][Web of Science][Medline]
Hudson RR. Gene genealogies and the coalescent process. In: Oxford surveys in evolutionary biology—Futuyma D, Antonovics J, eds. (1990) New York, NY: Oxford University Press. 1–44.
Hudson RR. The how and why of generating gene genealogies. In: Mechanisms of molecular evolution—Takahata N, Clark AG, eds. (1993) Sunderland, MA: Sinauer. 23–36.
Hudson RR. Generating samples under a Wright-Fisher neutral model of genetic variation. Bioinformatics (2002) 18:337–338.
Hudson RR, Kaplan NL. Gene trees with background selection. In: Non-neutral evolution: theories and molecular data—Golding B, ed. (1994) London: Chapman & Hall. 140–153.
Hudson RR, Kaplan NL. Deleterious background selection with recombination. Genetics (1995) 141:1605–1617.[Abstract]
Hudson RR, Kreitman M, Aguade M. A test of neutral molecular evolution based on nucleotide data. Genetics (1987) 116:153–159.
Innan H. Modified Hudson-Kreitman-Aguade test and two-dimensional evaluation of neutrality tests. Genetics (2006) 173:1725–1733.
Jensen JD, Kim Y, DuMont VB, Aquadro CF, Bustamante CD. Distinguishing between selective sweeps and demography using DNA polymorphism data. Genetics (2005) 170:1401–1410.
Karlin S, McGregor J. Addendum to a paper of W. Ewens. Theor Popul Biol. (1972) 3:113–116.[CrossRef]
Kim Y, Nielsen R. Linkage disequilibrium as a signature of selective sweeps. Genetics (2004) 167:1513–1524.
Kim Y, Stephan W. Detecting a local signature of genetic hitchhiking along a recombining chromosome. Genetics (2002) 160:765–777.
Kimura M. The number of heterozygous nucleotide sites maintained in a finite population due to steady flux of mutations. Genetics (1969) 61:893–903.
Kimura M. Average time until fixation of a mutant allele in a finite population under continued mutation pressure: Studies by analytical, numerical, and pseudo-sampling methods. Proc Natl Acad Sci USA (1980) 77:522–526.
Kimura M. The neutral theory of molecular evolution (1983) Cambridge: Cambridge University Press.
Li H, Stephan W. Inferring the demographic history and rate of adaptive substitution in Drosophila. PLoS Genet (2006) 2:e166.[CrossRef][Medline]
Maynard Smith J, Haigh J. The hitch-hiking effect of a favourable gene. Genet Res. (1974) 23:23–35.[Web of Science][Medline]
Nielsen R. Molecular signatures of natural selection. Annu Rev Genet. (2005) 39:197–218.[CrossRef][Web of Science][Medline]
Nielsen R, Williamson S, Kim Y, Hubisz MJ, Clark AG, Bustamante C. Genomic scans for selective sweeps using SNP data. Genome Res. (2005) 15:1566–1575.
Przeworski M. The signature of positive selection at randomly chosen loci. Genetics (2002) 160:1179–1189.
Przeworski M, Charlesworth B, Wall JD. Genealogies and weak purifying selection. Mol Biol Evol. (1999) 16:246–252.[Abstract]
Przeworski M, Wall JD, Andolfatto P. Recombination and the frequency spectrum in Drosophila melanogaster and Drosophila simulans. Mol Biol Evol. (2001) 18:291–298.
Sabeti PC, Reich DE, Higgins JM, et al. Detecting recent positive selection in the human genome from haplotype structure. Nature (2002) 419:832–837.[CrossRef][Medline]
Sabeti PC, Schaffner SF, Fry B, Lohmueller J, Varilly P, Shamovsky O, Palma A, Mikkelsen TS, Altshuler D, Lander ES. Positive natural selection in the human lineage. Science (2006) 312:1614–1620.
Simonsen KL, Churchill GA, Aquadro CF. Properties of statistical tests of neutrality for DNA polymorphism data. Genetics (1995) 141:413–429.[Abstract]
Slatkin M. An exact test for neutrality based on the Ewens sampling distribution. Genet Res. (1994) 64:71–74.[Web of Science][Medline]
Stephan W, Song YS, Langley CH. The hitchhiking effect on linkage disequilibrium between linked neutral loci. Genetics (2006) 172:2647–2663.
Tajima F. Evolutionary relationship of DNA sequences in finite populations. Genetics (1983) 105:437–460.
Tajima F. Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics (1989) 123:585–595.
Teshima KM, Coop G, Przeworski M. How reliable are empirical genomic scans for selective sweeps? Genome Res. (2006) 16:702–712.
Thornton K, Andolfatto P. Approximate Bayesian inference reveals evidence for a recent, severe bottleneck in a Netherlands population of Drosophila melanogaster. Genetics (2006) 172:1607–1619.
Voight BF, Kudaravalli S, Wen X, Pritchard JK. A map of recent positive selection in the human genome. PLoS Biol (2006) 4:e72.[CrossRef][Medline]
Wall JD. Recombination and the power of statistical tests of neutrality. Genet Res. (1999) 74:65–79.[CrossRef][Web of Science]
Wall JD. A comparison of estimators of the population recombination rate. Mol Biol Evol. (2000) 17:156–163.
Wall JD, Andolfatto P, Przeworski M. Testing models of selection and demography in Drosophila simulans. Genetics (2002) 162:203–216.
Wall JD, Hudson RR. Coalescent simulations and statistical tests of neutrality. Mol Biol Evol. (2001) 18:1134–1135. author reply 1136–1138.
Wang ET, Kodama G, Baldi P, Moyzis RK. Global landscape of recent inferred Darwinian selection for Homo sapiens. Proc Natl Acad Sci USA (2006) 103:135–140.
Watterson GA. On the number of segregating sites in genetical models without recombination. Theor Popul Biol. (1975) 7:256–276.[CrossRef][Web of Science][Medline]
Watterson GA. The homozygosity test of neutrality. Genetics (1978) 88:405–417.
Wright S. Evolution in Mendelian populations. Genetics (1931) 16:97–159.
Zeng K, Fu Y.-X, Shi S, Wu C-I. Statistical tests for detecting positive selection by utilizing high-frequency variants. Genetics (2006) 174:1431–1439.
Zeng K, Mano S, Shi S, Wu C-I. Comparisons of site- and haplotype-frequency methods for detecting positive selection. Mol Biol Evol (2007) 24:1562–1574.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
C.-H. Chen, T.-J. Chuang, B.-Y. Liao, and F.-C. Chen Scanning for the Signatures of Positive Selection for Human-Specific Insertions and Deletions Gen Biol Evol, November 23, 2009; 2009(0): 415 - 419. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Zeng and B. Charlesworth Estimating Selection Intensity on Synonymous Codon Usage in a Nonequilibrium Population Genetics, October 1, 2009; 183(2): 651 - 662. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. J. Eckert, J. L. Wegrzyn, B. Pande, K. D. Jermstad, J. M. Lee, J. D. Liechty, B. R. Tearse, K. V. Krutovsky, and D. B. Neale Multilocus Patterns of Nucleotide Diversity and Divergence Reveal Positive Selection at Candidate Genes Related to Cold Hardiness in Coastal Douglas Fir (Pseudotsuga menziesii var. menziesii) Genetics, September 1, 2009; 183(1): 289 - 298. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Yu, T. Tang, Q. Qian, Y. Wang, M. Yan, D. Zeng, B. Han, C.-I Wu, S. Shi, and J. Li Independent Losses of Function in a Polyphenol Oxidase in Rice: Differentiation in Grain Discoloration between Subspecies and the Role of Positive Selection under Domestication PLANT CELL, November 1, 2008; 20(11): 2946 - 2959. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Lu, Y. Fu, S. Kumar, Y. Shen, K. Zeng, A. Xu, R. Carthew, and C.-I Wu Adaptive Evolution of Newly Emerged Micro-RNA Genes in Drosophila Mol. Biol. Evol., May 1, 2008; 25(5): 929 - 938. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||












