Molecular Biology and Evolution 18:214-222 (2001)
© 2001 Society for Molecular Biology and Evolution
ARTICLE |
Global Patterns of Human DNA Sequence Variation in a 10-kb Region on Chromosome 1
*Department of Ecology and Evolution, University of Chicago;
Human Genetics Center, University of Texas at Houston;
Neurology Research, Phoenix, Arizona;
§Department of Human Genetics, South African Institute for Medical Research, Johannesburg, South Africa;
||Department of Biology, University of Oulu, Finland;
¶Institute of Enzymology, Hungarian Academy of Sciences, Budapest, Hungary; and
**Department of Human Genetics, University of Utah
| Abstract |
|---|
|
|
|---|
Human DNA variation is currently a subject of intense research because of its importance for studying human origins, evolution, and demographic history and for association studies of complex diseases. A
10-kb region on chromosome 1, which contains only four small exons (each <155 bp), was sequenced for 61 humans (20 Africans, 20 Asians, and 21 Europeans) and for 1 chimpanzee, 1 gorilla, and 1 orangutan. We found 52 polymorphic sites among the 122 human sequences and 382 variant sites among the human, chimpanzee, gorilla, and orangutan sequences. For the introns sequenced (8,991 bp), the nucleotide diversity (
) was 0.058% among all sequences, 0.076% among the African sequences, 0.047% among the Asian sequences, and 0.045% among the European sequences. A compilation of data revealed that autosomal regions have, on average, the highest
value (0.091%), X-linked regions have a somewhat lower
value (0.079%), and Y-linked regions have a very low
value (0.008%). The lower polymorphism in the present region may be due to a lower mutation rate and/or selection in the gene containing these introns or in genes linked to this region. The present region and two other 10-kb noncoding regions all show a strong excess of low-frequency variants, indicating a relatively recent population expansion. This region has a low mutation rate, which was estimated to be 0.74 x 10 per nucleotide per year. An average estimate of
12,600 for the long-term effective population size was obtained using various methods; the estimate was not far from the commonly used value of 10,000. Fu and Li's tests rejected the assumption of an equilibrium neutral Wright-Fisher population, largely owing to the high proportion of low-frequency variants. The age of the most recent common ancestor of the sequences in our sample was estimated to be more than 1 Myr. Allowing for some unrealistic assumptions in the model, this estimate would still suggest an age of more than 500,000 years, providing further evidence for a genetic history of humans much more ancient than the emergence of modern humans. The fact that many unique variants exist in Europe and Asia also suggests a fairly long genetic history outside of Africa and argues against a complete replacement of all indigenous populations in Europe and Asia by a small Africa stock. Moreover, the ancient genetic history of humans indicates no severe bottleneck during the evolution of humans in the last half million years; otherwise, much of the ancient genetic history would have been lost during a severe bottleneck. We suggest that both the "Out of Africa" and the multiregional models are too simple to explain the evolution of modern humans. | Introduction |
|---|
|
|
|---|
Human DNA variation is currently a subject of intense research for several reasons. First, DNA variation within and between human populations is of great interest to human geneticists and evolutionists. Second, there is much interest in the utility of single nucleotide polymorphisms (SNPs) in molecular medicine, because SNP markers may be useful in association studies of complex diseases, assessment of individuals' predisposition to diseases, and tailoring of therapies. Third, the availability of long genomic sequences generated by the Human Genome Project and the advent of inexpensive DNA sequencing techniques have made large-scale population studies feasible. In fact, there are now many large-scale population studies of human DNA variation. These include regions containing the genes for, respectively, ß-globin (Harding et al. 1997
We have been pursuing human DNA variation studies in noncoding regions for two purposes. First, we wish to establish a genomewide and worldwide neutrality standard of nucleotide diversity. By "neutrality standard," we mean the level of nucleotide diversity expected in a region in which all mutations are neutral and not directly subject to natural selection. This standard will be a very useful reference, especially for comparison with the levels of nucleotide diversity in coding regions. Obviously, such a standard requires data from many genomic regions, because the level of nucleotide diversity in a region is subject to strong stochastic effects. Second, we wish to study the origin and evolution of modern humans. DNA sequence data from noncoding regions may more accurately reflect human history than data from coding regions, because noncoding regions are not directly subject to natural selection. The majority of past studies on human DNA variation, which are mainly from mitochondrial (mt) DNA, microsatellite DNA, and the Y chromosome, have largely given the impression of a relatively shallow genetic history of humans. These observations have been taken as evidence for the Out of Africa model for the origin of modern humans, which postulates that a founder group of modern humans emigrated from Africa about 100,000 years ago to Europe and Asia and completely replaced all the indigenous populations outside of Africa (Cann, Stoneking, and Wilson 1987
; Stringer and Andrew 1988
). However, recent studies of the ß-globin and the PDHA1 gene regions (Harding et al. 1997
; Harris and Hey 1999
) and a 10-kb noncoding region on chromosome 22 (Zhao et al. 2000
) have revealed an ancient genetic history of humans and suggested that human evolution has been more complex than depicted by the simple Out of Africa model. To attain a better understanding of this issue, it is necessary to obtain sequence data from other noncoding regions.
For the above purposes, we selected a
10-kb region on chromosome 1 and obtained sequence data from worldwide populations. This region contains mostly introns, although it also includes four short exons. The new data were compared with the data from Xq13.3 and 22q11.2 and other regions to study the features of sequence variation within and between populations and were used to infer the genetic history of human evolution.
| Materials and Methods |
|---|
|
|
|---|
Region Selection and Populations Sampled
A 12-kb region corresponding to nucleotide positions 1833330332 in locus HS125H23 on human chromosome 1q24 was selected (GenBank accession number Z94054) because no gene was registered in this region at GenBank. After excluding a region containing a polyA segment and a region containing an MER33, an incomplete AluSx repeat, and an MIR2 repeat, a total of
10 kb of nucleotide sites were selected for sequencing. Initially, no potential coding region was detected in the 12-kb region by XGRAIL. However, upon reexamination, three strong potential coding regions (exons) and a weak potential coding region were detected by both GenScan and GRAIL-EXP (see Results). As these potential exons are short (
155 bp), the region selected covers largely introns. Sixty-one individuals were collected worldwide from 14 human populations in three major geographic areas: 20 Africans (5 South African Bantu speakers, 1 !Kung, 2 Mbuti Pygmies, 2 Biaka Pygmies, 5 Nigerians, 5 Kenyans), 20 Asians (8 Chinese, 3 Japanese, 6 Indians, 3 Yakuts), and 21 Europeans (6 Swedes, 2 Finns, 5 French, 5 Hungarians, 3 Italians). One chimpanzee, one gorilla, and one orangutan were used as outgroups.
PCR Amplification and DNA Sequencing
Five primer pairs were designed to amplify three overlapping fragments covering positions 96141 and two overlapping fragments covering positions 688011022 in the 12-kb region. Touchdown PCR (Don et al. 1991
) was used, and the reactions were carried out under the conditions described in Zhao et al. (2000)
. The PCR products were purified with the Wizard PCR Preps DNA Purification Resin Kit (Promega). Sequencing reaction was performed according to the protocol of ABI Prism BigDye Terminator Sequencing Kits (Perkin Elmer) modified by quarter reaction. The extension products were purified by Sephadex G-50 (DNA grade, Pharmacia) and run on an ABI 377XL DNA sequencer using 4.25% gels (Sooner Scientific).
ABI DNA Sequence Analysis 3.0 was used for lane tracking and base calling. The data were then proofread; the fluorescence traces were reread manually and heterozygous sites were detected as double peaks. The segment sequences were assembled automatically using SeqMan in DNASTAR. The assembled files were carefully checked manually using the same program, and variant sites were identified in the aligned sequences in MegAlign in DNASTAR. All of the nucleotides in each segment were sequenced at least once in both directions. Furthermore, all singletons and doubletons, which are defined as variants that appear, respectively, only once and twice in the total sample, were verified by reamplifying the region containing the variant site and resequencing the region in both directions using new internal primers that were close to the site.
Data Analysis
The sequences were aligned by MegAlign in the DNASTAR software package. The human consensus sequence was obtained from the alignment using DNASTAR. The human ancestral sequence was inferred by comparing the human sequences with the outgroup sequences using the maximum-parsimony principle.
For a DNA sequence subject to no natural selection, the mutation rate per sequence per generation (µ) is estimated by
![]() | (1) |
20 years for humans). Watterson's (1975)
= 4Neµ, where Ne is the effective population size.
Tajima's (1989)
test and Fu and Li's (1993)
tests were used to test the selective neutrality of the region studied; a program is available at http://hgc.sph.uth.tmc.edu/fu. The critical points (values) for the neutrality tests were obtained from 5,000 simulated samples. Fu's (1996)
and Fu and Li's (1997)
methods were used to estimate the age of the most recent common ancestor (MRCA) of the DNA sequences in a sample. We computed both the mode and mean of the age (T) of the MRCA in years.
Note that all of the above computations require only segregating site data but do not require haplotype data.
| Results and Discussion |
|---|
|
|
|---|
Sequence Data
We sequenced
9,626 nucleotide sites in the selected region in 61 humans (AF310265AF310325), 1 chimpanzee (AF310683), 1 gorilla (AF310682), and 1 orangutan (AF310681). The human consensus and ancestral sequences were obtained as explained in Materials and Methods. The GC contents of the human consensus, human ancestral, chimpanzee, gorilla, and orangutan sequences were
31.5%, which is much lower than the genome average of 42%. Thus, the region studied was GC-poor.
Three strong potential coding regions (exons) in the selected 12-kb segment were predicted by GenScan (each with a probability >95.5%) and GRAIL-EXP; these potential exons were at sites 2352223632, 2613126285, and 2754227679 in locus HS125H23 (Z94054), respectively. In fact, a BLAST search showed that the amino acid sequence translated from these three exons was 86% similar to a segment of a human membrane protein CH1 (GenBank accession number AF097535). Later on a BLAST search of the GenBank with the nucleotide sequence of the
10 kb region indicated that four exons are similar to the human membrane protein CH1 (similarity > 99%) and the amino acid sequence translated from these four exons is identical to that translated from this gene. These exons are from site 21,773 to 21,888, site 26,128 to 26,285, site 27,542 to 27,679, and site 27,902 to 28,056 in locus HS125H23 (Z94054), respectively.
Pattern of Sequence Variation
A total of 48 variant sites were found in the alignment of human sequences; 19 of them were observed only once (i.e., singletons), 7 were observed twice (i.e., doubletons), and 22 were observed more than twice (i.e., others) (table 1
). Two variant sites (one synonymous and one nonsynonymous singleton) were observed in the second exon, while all of the remaining 46 variant sites were found in introns. All singletons and doubletons were verified as explained in Materials and Methods, and no error was found. In addition to the 48 single-nucleotide variants, we found 4 insertions/deletions (indels) among the 122 human sequences. On average,
5 variant sites per 1,000 bp were found in the region studied.
|
The numbers of variant sites (excluding indels) in the African, Asian, and European sequences were 29, 20, and 16, respectively (table 1 ). Thus, Africans had the largest number of variants. The pattern of sequence variation in Africans differed somewhat from that in non-Africans. For example, less than one third of the variant sites in Africans were singletons, whereas close to one half of the variant sites in non-Africans were singletons. Interestingly, the proportions of unique variant sites in each of the three continents were high: 20 (69%, including 7 singletons), 11 (55%, 8 singletons), and 8 (50%, 4 singletons) among the African, Asian, and European sequences, respectively. This observation suggests a substantial degree of isolation between continents.
Table 1
also includes the patterns of sequence variation in the 10-kb noncoding regions on chromosomes X and 22 (Kaessmann et al. 1999
; Zhao et al. 2000
). Note that among the African sequences, the number of low-frequency variants (i.e., singletons and doubletons) was smaller than that of high-frequency variants (i.e., others) in the present region, whereas the opposite was true for the two other regions. On the other hand, among non-African sequences, the number of low-frequency variants was larger than that of the high-frequency variants in the present region, whereas the opposite was true for the other two regions. Thus, although a stronger excess of low-frequency variants in Africans than in non-Africans was observed in the previous two regions, the opposite was found in the present region. This difference in pattern notwithstanding, two features common to the three regions were noted. First, there were more variants in the African sample than in the non-African sample, despite the number of Africans studied being less than half that of non-Africans studied. Thus, Africans were considerably more polymorphic than non-Africans, in agreement with previous observations (Cann, Stoneking, and Wilson 1987
; Kaessmann et al. 1999
; Zhao et al. 2000
). Second, the number of low-frequency variants (singletons and doubletons) in the total sample was larger than the number of high-frequency variants, e.g., 26 versus 22 in the present region. This excess of low-frequency variants in all three regions is in sharp contrast to the situations for the dystrophin and PDHA1 genes (Zietkiewicz et al. 1998
; Harris and Hey 1999
) and suggests a relatively recent population expansion, because such an excess is not expected from an equilibrium Wright-Fisher population.
The present region was considerably less polymorphic than the region on chromosome 22it had fewer high-frequency variants, especially among non-Africans, and a much smaller number of doubletons. The relatively low polymorphism may be due to a lower mutation rate (see below) and selection in the gene containing these introns or in genes linked to this region. The X-linked region was even less variable (table 1
). This may be because an X-linked region has a smaller effective population size than an autosomal region (3Ne/4 vs. Ne) and because the X region has a low recombination rate (Kaessmann et al. 1999
), so that compared to the other two regions, it is subject to stronger background selection (i.e., effects of deleterious mutations in genes linked to the region) or selective sweep (effects of positive selection in genes linked to the region) (Begun and Aquadro 1992
; Charlesworth 1994
).
A comparison of all sequences, including the chimpanzee, gorilla, and orangutan sequences, revealed 382 variant sites, 44 of which were indels. The 382 variant sites were evenly distributed in this region (
2 = 14.4, df = 9, P = 0.11) (table 2
). The number of variant sites in human populations was also evenly distributed (
2 = 9.9, df = 9, P = 0.36).
|
Among the 44 indels in our data, 13, 7, 10, and 10 were 1-, 2-, 3-, and 4-nt indels, respectively. The remaining 4 indels involved 5, 8, or >10 nt. Therefore, the majority of these indels were short.
Mutation Pattern
Comparing the human, chimpanzee, gorilla, and orangutan sequences, we were able to infer the direction of 172 mutations (table 3 ); the proportion of transitional changes was 66%. For the 169 mutations for which the direction could not be inferred, the proportion of transitions was 65% (table 3
). This proportion was between the values (59% and 70%) observed in pseudogenes (Li, Wu, and Luo 1984
) and for the 10-kb region in 22q11.2 (Zhao et al. 2000
). For those mutations whose direction could be inferred, the number of G/C-to-A/T mutations was 57, while that of A/T-to-G/C mutations was 90. According to the GC and AT contents of 68.5% and 31.5%, the expected numbers of G/C-to-A/T mutations and A/T-to-G/C mutations are 100.7 and 46.3, respectively, and a comparison with the observed numbers gives
2 = 3.61 and P = 0.057, which is close to significant. This result suggests that G/C-to-A/T mutations might occur more frequently than A/T-to-G/C mutations, similar to the situation for mammalian pseudogenes (G/C-to-A/T, 64.5%; A/T-to-G/C, 35.5%) (Li 1997
).
|
Neutrality Tests
The assumption that the region under study is subject to no natural selection was tested using the sequence data. Using the critical points obtained from the 5,000 samples we simulated, we found that Tajima's and Fu and Li's tests could not reject the neutral Wright-Fisher model when each continent was considered separately (table 4 ). However, when we used data from more than one continent, the results became different. Although Tajima's test remained nonsignificant, Fu and Li's tests for non-Africans and for all samples were significant (table 4 ); the conclusion became even stronger when indels were included in the tests. The rejection of neutrality was largely due to the high proportion of low-frequency variants. As the region studied was largely noncoding, the rejection of neutrality may be due to two factors: (1) a relatively recent population expansion, which can increase the number of low-frequency variants, and (2) natural selection in the exons or in genes linked to this region. In fact, the region is
44 kb and
110 kb away from a functional gene at its 5' and 3' ends, respectively.
|
As the rejection of the neutrality assumption was largely due to the excess of low-frequency variants, one might wonder whether the excess was due to pooling of data from different populations. However, data pooling actually tends to increase, rather than decrease, the proportion of low-frequency variants, as implied by the result of Fu (1996)
= 4Neµ, where Ne is the effective size of the entire population and µ is the rate of mutation per sequence per generation. The results are shown in table 5
, in which each entry is the ratio of the expected sum of length of external branches and the expected total tree length of the simulated sequence genealogy; the ratio is independent of
. Under the assumption of a single random mating population with the same
, the expected sums of lengths of external branches and all branches are 1 and 1 +
+ ... + 1/(n - 1), respectively, where n is the sample size. As can be seen from table 5
, the more subpopulations or the less migration, the lower the proportion of singletons relative to the total number of mutations in the sample. Clearly, population subdivision tends to reduce the proportion of low-frequency variants.
|
Nucleotide Diversity
Nucleotide diversity (
) is defined as the average number of nucleotide differences per site between two randomly chosen sequences from the population. The
value was calculated using the program DNASP, version 3.0 (Rozas and Rozas 1999
value was 0.058% among all sequences, 0.076% among the African sequences, 0.047% among the Asian sequences, and 0.045% among the European sequences (table 6
). Thus, the
value was largest among Africans. These
values were considerably lower than those for the 10-kb noncoding region in 22q11.2 (Zhao et al. 2000
|
Table 6 presents a list of the
values for various noncoding regions. The
values are usually highest in Africans but are quite similar in Asians and Europeans. For example, the average
values for the autosomal regions are 0.093%, 0.081%, and 0.076% for the Africans, Asians, and Europeans, respectively. These values are somewhat lower than the average
value (0.11%) at fourfold-degenerate sites in 49 genes. However, the number of noncoding regions studied is small, so no general conclusion can be drawn yet.
A large variation in
is seen among regions (table 6
). In particular, the 5' and 3' flanking regions of the ß-globin gene and the ß-globin replication origin initiation region (IR) have high
values (Harding et al. 1977; Fullerton et al. 2000
). The high
value in the IR has been speculated to be due to a high mutation rate because of the peculiar feature of the DNA unwinding element in the IR (Fullerton et al. 2000
). On average, the autosomal regions have the highest
value (0.091%), the X-linked regions have a somewhat lower
value (0.079%), and the Y-linked regions have a very low
value (0.008%). These differences may be partly due to the fact that the relative effective population sizes are Ne, 3Ne/4, and Ne/4 for an autosomal, an X-linked, and a Y-linked sequence, respectively. However, the extremely low
value for Y-linked sequences may be mainly due to background selection and selective sweep, because there is no recombination in the Y chromosome except for the pseudoautosomal region. Background selection and selective sweep should have, on average, stronger effects on an X-linked region than on an autosomal region because of a lower average recombination rate in the X chromosome than in an autosome, partly accounting for the lower
value for X-linked regions. The number of Alu sequences studied is small, but the data suggest a higher average
value for Alus than for other regions. This is not surprising, because Alus have higher mutation rates due to the presence of a high frequency of CpG dinucleotides.
Mutation Rate,
, Ne
The average numbers of nucleotide substitutions per site were 0.62% between human and chimpanzee sequences, 1.07% between human and gorilla sequences, and 2.44% between human and orangutan sequences; here, we exclude the four exons. The mutation rates (v) were estimated to be 0.52 x 10-9, 0.67 x 10-9, and 1.02 x 10-9 per nucleotide site per year based on divergence times of 6 Myr between humans and chimpanzees, 8 Myr between humans and gorillas, and 12 Myr between humans and orangutans, respectively; other divergence dates are also considered in table 7
. The first two values are considerably smaller than the third, but the differences may be largely due to stochastic fluctuations. The average for the three estimates is 0.74 x 10-9. This value is considerably lower than the estimate (1.15 x 10-9) obtained from the 10-kb region on chromosome 22, but it is consistent with the lower nucleotide diversity in this region than in the 10-kb region on chromosome 22. It is possible that this region has a lower (neutral) mutation rate.
|
Several methods are available for estimating the parameter
= 4Neµ, where µ = vgL, g is the generation time (20 years), and L = 8,991 bp (see eq. 1). If we use the commonly used value 10,000 for Ne (Takahata 1993
varies with the assumption of divergence dates (table 7
). For the average mutation rate obtained above, we obtained
= 5.32. For the two commonly used methods, one known as Watterson's (1975)
= 8.55 and 5.26, respectively. Note that these two methods are not optimal in terms of minimizing variance. The minimum variance estimator (BLUE) by Fu (1994)
= 11.50. BLUE and Watterson's (1975)
values, mainly because there was an excess of singletons (table 1
). To avoid the effect of the excess of low-frequency variants, we excluded singletons and doubletons from analysis and obtained the BLUE estimate of
= 6.30, which is similar to the other two estimates.
If we know the mutation rate, we can estimate the effective population size Ne from
. As the estimate of the mutation rate varies with the assumption of the divergence dates, the estimate of Ne also varies (table 7
). Moreover, it also depends on the estimation methods used (table 7
). If we use the average mutation rate obtained above and the average (6.70) of the
values estimated by Watterson's, Tajima's, and the BLUE methods, we obtain Ne = 12,600, which is not far from the commonly used value of 10,000 (Takahata 1993
).
Age of the MRCA
To estimate the age (T) of the MRCA of the sequences in a sample, the values of both Ne and mutation rate per sequence per generation (u) are required. As mentioned above, the estimate of mutation rate depends on the species pair and the divergence dates used. For simplicity, we shall use the average mutation rate obtained above, i.e., v = 0.74 x 10-9 changes per site per year and u = 1.33 x 10-4 changes per sequence per generation in humans; the estimate of T increases with decreasing u. Table 8
presents the estimates of T for several values of effective population sizes for the entire sample, the subsample of sequences from Africa only, and the subsample of non-African sequences only, respectively. If the commonly used Ne = 10,000 was assumed, the mode estimate (Tmode) and mean estimate (Tmean) were, respectively, 1,376,000 and 1,559,000 years for the entire sample. These estimates were comparable to our previous estimates based on the polymorphism data from a 10-kb region on chromosome 22 (Zhao et al. 2000
). The estimates based on the African sample were only somewhat smaller than those based on the entire sample, while those based on the non-African sample were the smallest. This pattern was also consistent with our previous study (Zhao et al. 2000
).
|
It should be pointed out that the method used to estimate the age of the MRCA makes the assumption that the sample was taken from a random-mating population with a constant effective population size. However, the fact that there are many unique variants in each of the three continents (see above) suggests that this assumption may not be appropriate. The existence of more singletons than expected would tend to inflate our estimates of the age of the MRCA. Methods that make a better use of segregating sites of different types are needed for correcting such biases, and one of us is in the process of developing such methods. Note that the alternative method developed by Griffiths and Tavaré (1994)
In summary, like the data from the PDHA1 locus (Harris and Hey 1999
) and the 10-kb region on chromosome 22 (Zhao et al. 2000
), the present data also provide evidence for a genetic history of humans that is much more ancient than the emergence of modern humans. The observation that both the region on chromosome 22 and the present region show an ancient genetic history outside of Africa argues against a complete replacement of all indigenous populations in Europe and Asia by an African stock. Moreover, the ancient genetic history of humans indicates no severe bottleneck during the evolution of humans in the last half million years, because much of the ancient genetic history would have been lost during a severe bottleneck. On the other hand, the fact that most available nuclear DNA variation data, as well as mitochondrial DNA data, show a considerably shallower genetic history in Asia and Europe than in Africa suggests that human evolution has not occurred in parallel in different parts of the Old World, as depicted by the multiregional model. Thus, both the Out of Africa and the multiregional models appear to be too simple to explain the evolution of modern humans.
| Acknowledgements |
|---|
|
|
|---|
We thank Drs. J. B. Clegg, Marie Lin, and Maryellen Ruvolo for DNA samples. This work was supported by NIH grants GM55759 (W.-H.L.), GM50428 (Y.-X.F.), GM59290 (L.B.J.) and NSF DEB9707567 (Y.-X.F.).
| Footnotes |
|---|
Keith Crandall, Reviewing Editor
1 Keywords: nucleotide diversity
DNA variation
human evolution
unique variants ![]()
2 Address for correspondence and reprints: Wen-Hsiung Li, Department of Ecology and Evolution, University of Chicago, 1101 East 57th Street, Chicago, Illinois 60637. E-mail: whli{at}uchicago.edu ![]()
| literature cited |
|---|
|
|
|---|
Begun, D. J., and C. F. Aquadro. 1992. Levels of naturally occurring DNA polymorphism correlate with recombination rates in D. melangaster. Nature 356:519520.
Cann, R. L., M. Stoneking, and A. C. Wilson. 1987. Mitochondrial DNA and human evolution. Nature 325:3136.
Charlesworth, B. 1994. The effect of background selection against deleterious alleles on weakly selected, linked variants. Genet. Res. 63:213228.[Web of Science][Medline]
Don, R. H., P. T. Cox, B. J. Wainwright, K. Baker, and J. S. Mattick. 1991. Touchdown PCR to circumvent spurious priming during gene amplification. Nucleic Acids Res. 19:4008.
Fu, Y. X. 1994. Estimating effective population size or mutation rate using the frequencies of mutations of various classes in a sample of DNA sequences. Genetics 138:13751386.
. 1996. Estimating the age of the common ancestor of a DNA sample using the number of segregating sites. Genetics 144:829838.
Fu, Y. X., and W. H. Li. 1997. Estimating the age of the common ancestor of a sample of DNA sequences. Mol. Biol. Evol. 14:195199.[Abstract]
. 1993. Statistical tests of neutrality of mutations. Genetics 133:693709.
Fullerton, S. M., J. Bond, J. A. Schneider, B. Hamilton, R. M. Harding, A. J. Boyce, and J. B. Clegg. 2000. Polymorphism and divergence in the ß-globin replication origin initiation region. Mol. Biol. Evol. 17:179188.
Griffiths, R. C., and S. Tavaré. 1994. Ancestral inference in population genetics. Stat. Sci. 9:307319.
Harding, R. M., S. M. Fullerton, R. C. Griffiths, J. Bond, M. J. Cox, J. A. Schneider, D. S. Moulin, and J. B. Clegg. 1997. Archaic African and Asian lineages in the genetic ancestry of modern humans. Am. J. Hum. Genet. 60:772789.[Web of Science][Medline]
Harris, E. E., and J. Hey. 1999. X chromosome evidence for ancient human histories. Proc. Natl. Acad. Sci. USA 96:33203324.
Jaruzelska, J., E. Zietkiewicz, M. Batzer, D. E. C. Cole, J.-P. Moisan, R. Scozzari, S. Tavare, and D. Labuda. 1999. Spatial and temporal distribution of the neutral polymorphisms in the last ZFX intron: analysis of the haplotype structure and genealogy. Genetics 152:10911101.
Jaruzelska, J., E. Zietkiewicz, and D. Labuda. 1999. Is selection responsible for the low level of variation in the last intron of the ZFY locus? Mol. Biol. Evol. 16(11):16331640. J. Mol. Evol. 21:5871.
Kaessmann, H., F. Heißig, A. von Haeseler, and S. Pääbo. 1999. DNA sequence variation in a non-coding region of low recombination on the human X chromosome. Nat. Genet. 22:7881.[Web of Science][Medline]
Li, W. H. 1997. Molecular evolution. Sinauer, Sunderland, Mass.
Li, W. H., and L. Sadler. 1991. Low nucleotide diversity in man. Genetics 129:513523.
Li, W. H., C.-I. Wu, and C.-C. Luo. 1984. Nonrandomness of point mutation as reflected in nucleotide substitutions in pseudogenes and its evolutionary implication. J. Mol. Evol. 21:5871.[Web of Science][Medline]
Nickerson, D. A., S. L. Taylor, K. M. Weiss, A. G. Clark, R. G. Hutchinson, J. Stengard, V. Salomaa, E. Vartiainen, E. Boerwinkle, and C. F. Sing. 1998. DNA sequence diversity in a 9.7-kb region of the human lipoprotein gene. Nat. Genet. 19:233240.[Web of Science][Medline]
Rieder, M. J., S. L. Taylor, A. G. Clark, and D. A. Nickerson. 1999. Sequence variation in the human angiotensin converting enzyme. Nat. Genet. 22:5962.[Web of Science][Medline]
Rozas, J., and R. Rozas. 1999. DnaSP version 3: an integrated program for molecular population genetics and molecular evolution analysis. Bioinformatics 15:174175.
Stringer, C. B., and P. Andrews. 1988. Genetic and fossil evidence for the origin of modern humans. Science 139:12631268.
Tajima, F. 1983. Evolution relationship of DNA sequences in finite populations. Genetics 105:437460.
. 1989. Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics 123:585595.
Takahata, N. 1993. Allelic genealogy and human evolution. Mol. Biol. Evol. 10:222.[Abstract]
Watterson, G. A. 1975. On the number of segregation sites. Theor. Popul. Biol. 7:256276.[Web of Science][Medline]
Whitfield, L. S., J. E. Sulston, and P. N. Goodfellow. 1995. Sequence variation of the human Y chromosome. Nature 378:379380.
Zhao, Z., J. Li, Y.-X. Fu et al. (13 co-authors). 2000. Worldwide DNA sequence variation in a 10 kb noncoding region on human chromosome 22. Proc. Natl. Acad. Sci. USA 97:1135411358.
Zietkiewicz, E., V. Yotova, M. Jarnik, et al. (11 co-authors). 1998. Genetic structure of the ancestral population of modern humans. J. Mol. Evol. 47:146155.[Web of Science][Medline]
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
R. Burgess and Z. Yang Estimation of Hominoid Ancestral Population Sizes under Bayesian Coalescent Models Incorporating Mutation Rate Variation and Sequencing Errors Mol. Biol. Evol., September 1, 2008; 25(9): 1979 - 1994. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Ebersberger, P. Galgoczy, S. Taudien, S. Taenzer, M. Platzer, and A. von Haeseler Mapping Human Genetic Ancestry Mol. Biol. Evol., October 1, 2007; 24(10): 2266 - 2276. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. D. Evans, N. Mekel-Bobrov, E. J. Vallender, R. R. Hudson, and B. T. Lahn Evidence that the adaptive allele of the brain size gene microcephalin introgressed into Homo sapiens from an archaic Homo lineage PNAS, November 28, 2006; 103(48): 18178 - 18183. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Foll and O. Gaggiotti Identifying the Environmental Factors That Determine the Genetic Structure of Populations Genetics, October 1, 2006; 174(2): 875 - 891. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. D. Cutter, S. E. Baird, and D. Charlesworth High Nucleotide Polymorphism and Rapid Decay of Linkage Disequilibrium in Wild Populations of Caenorhabditis remanei Genetics, October 1, 2006; 174(2): 901 - 913. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Zhao, N. Yu, Y.-X. Fu, and W.-H. Li Nucleotide Variation and Haplotype Diversity in a 10-kb Noncoding Region in Three Continental Human Populations Genetics, September 1, 2006; 174(1): 399 - 409. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. D. Cutter Nucleotide Polymorphism and Linkage Disequilibrium in Wild Populations of the Partial Selfer Caenorhabditis elegans Genetics, January 1, 2006; 172(1): 171 - 184. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. S. Carlson, D. J. Thomas, M. A. Eberle, J. E. Swanson, R. J. Livingston, M. J. Rieder, and D. A. Nickerson Genomic regions exhibiting positive selection identified from dense genotype data Genome Res., November 1, 2005; 15(11): 1553 - 1565. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Ray, M. Currat, P. Berthier, and L. Excoffier Recovering the geographic origin of early modern humans by realistic and spatially explicit simulations Genome Res., August 1, 2005; 15(8): 1161 - 1167. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Wang, S. D. Thomas, and J. Zhang Relaxation of selective constraint and loss of function in the evolution of human bitter taste receptor genes Hum. Mol. Genet., November 1, 2004; 13(21): 2671 - 2678. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Bosch, M. E. Hurles, A. Navarro, and M. A. Jobling Dynamics of a Human Interparalog Gene Conversion Hotspot Genome Res., May 1, 2004; 14(5): 835 - 844. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. F. Hughes and J. M. Coffin Human endogenous retrovirus K solo-LTR formation and insertional polymorphisms: Implications for human and viral evolution PNAS, February 10, 2004; 101(6): 1668 - 1672. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. T. Marth, E. Czabarka, J. Murvai, and S. T. Sherry The Allele Frequency Spectrum in Genome-Wide Human Variation Data Reveals Signals of Differential Demographic History in Three Large World Populations Genetics, January 1, 2004; 166(1): 351 - 372. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Zhou, Y. Zhai, X. Dong, X. Zhang, F. He, K. Zhou, Y. Zhu, H. Wei, Z. Yao, S. Zhong, et al. Haplotype Structure and Evidence for Positive Selection at the Human IL13 Locus Mol. Biol. Evol., January 1, 2004; 21(1): 29 - 36. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. F. Hammer, F. Blackmer, D. Garrigan, M. W. Nachman, and J. A. Wilder Human Population Structure and Its Effects on Sampling Y Chromosome Sequence Variation Genetics, August 1, 2003; 164(4): 1495 - 1509. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Rannala and Z. Yang Bayes Estimation of Species Divergence Times and Ancestral Population Sizes Using DNA Sequences From Multiple Loci Genetics, August 1, 2003; 164(4): 1645 - 1656. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Kitano, C. Schwarz, B. Nickel, and S. Paabo Gene Diversity Patterns at 10 X-Chromosomal Loci in Humans and Chimpanzees Mol. Biol. Evol., August 1, 2003; 20(8): 1281 - 1289. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Subramanian and S. Kumar Neutral Substitutions Occur at a Faster Rate in Exons Than in Noncoding DNA in Primate Genomes Genome Res., May 1, 2003; 13(5): 838 - 844. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. M. Akey, K. Zhang, M. Xiong, and L. Jin The Effect of Single Nucleotide Polymorphism Identification Strategies on Estimates of Linkage Disequilibrium Mol. Biol. Evol., February 1, 2003; 20(2): 232 - 242. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. T. Webster, N. G. C. Smith, and H. Ellegren Compositional Evolution of Noncoding DNA in the Human and Chimpanzee Genomes Mol. Biol. Evol., February 1, 2003; 20(2): 278 - 286. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Yang Likelihood and Bayes Estimation of Ancestral Population Sizes in Hominoids Using Data From Multiple Loci Genetics, December 1, 2002; 162(4): 1811 - 1823. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Zhang, D. M. Webb, and O. Podlaha Accelerated Protein Evolution and Origins of Human-Specific Features: FOXP2 as an Example Genetics, December 1, 2002; 162(4): 1825 - 1835. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. A. Saunders, M. F. Hammer, and M. W. Nachman Nucleotide Variability at G6pd and the Signature of Malarial Selection in Humans Genetics, December 1, 2002; 162(4): 1849 - 1861. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Yu, Y.-X. Fu, and W.-H. Li DNA Polymorphism in a Worldwide Sample of Human X Chromosomes Mol. Biol. Evol., December 1, 2002; 19(12): 2131 - 2141. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Eyre-Walker, P. D. Keightley, N. G. C. Smith, and D. Gaffney Quantifying the Slightly Deleterious Mutation Model of Molecular Evolution Mol. Biol. Evol., December 1, 2002; 19(12): 2142 - 2149. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Fan, E. Linardopoulou, C. Friedman, E. Williams, and B. J. Trask Genomic Structure and Evolution of the Ancestral Chromosome Fusion Site in 2q13-2q14.1 and Paralogous Regions on Other Human Chromosomes Genome Res., November 1, 2002; 12(11): 1651 - 1662. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. V. Rockman and G. A. Wray Abundant Raw Material for Cis-Regulatory Evolution in Humans Mol. Biol. Evol., November 1, 2002; 19(11): 1991 - 2004. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Laporte and B. Charlesworth Effective Population Size and Population Subdivision in Demographically Structured Populations Genetics, September 1, 2002; 162(1): 501 - 519. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. J. Bamshad, S. Mummidi, E. Gonzalez, S. S. Ahuja, D. M. Dunn, W. S. Watkins, S. Wooding, A. C. Stone, L. B. Jorde, R. B. Weiss, et al. A strong signature of balancing selection in the 5' cis-regulatory region of CCR5 PNAS, August 6, 2002; 99(16): 10539 - 10544. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Toomajian and M. Kreitman Sequence Variation and Haplotype Structure at the Human HFE Locus Genetics, August 1, 2002; 161(4): 1609 - 1623. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Yu, F.-C. Chen, S. Ota, L. B. Jorde, P. Pamilo, L. Patthy, M. Ramsay, T. Jenkins, S.-K. Shyue, and W.-H. Li Larger Genetic Differences Within Africans Than Between Africans and Eurasians Genetics, May 1, 2002; 161(1): 269 - 274. [Abstract] [Full Text] [PDF] |
||||
![]() |
X.-S. Zhang, J. Wang, and W. G. Hill Pleiotropic Model of Maintenance of Quantitative Genetic Variation at Mutation-Selection Balance Genetics, May 1, 2002; 161(1): 419 - 433. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Betran, W. Wang, L. Jin, and M. Long Evolution of the Phosphoglycerate mutase Processed Gene in Human and Chimpanzee Revealing the Origin of a New Primate Gene Mol. Biol. Evol., May 1, 2002; 19(5): 654 - 663. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. A. Payseur and M. W. Nachman Gene Density and Human Nucleotide Polymorphism Mol. Biol. Evol., March 1, 2002; 19(3): 336 - 340. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. A. Doris Hypertension Genetics, Single Nucleotide Polymorphisms, and the Common Disease:Common Variant Hypothesis Hypertension, February 1, 2002; 39(2): 323 - 331. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. I. Jensen-Seaman, A. S. Deinard, and K. K. Kidd Modern African Ape Populations as Genetic and Demographic Models of the Last Common Ancestor of Humans, Chimpanzees, and Gorillas J. Hered., November 1, 2001; 92(6): 475 - 480. [Abstract] [Full Text] [PDF] |
||||
![]() |
L.B. Jorde, W.S. Watkins, and M.J. Bamshad Population genomics: a bridge from evolutionary history to genetic medicine Hum. Mol. Genet., October 1, 2001; 10(20): 2199 - 2207. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. D. Makova, M. Ramsay, T. Jenkins, and W.-H. Li Human DNA Sequence Variation in a 6.6-kb Region Containing the Melanocortin 1 Receptor Promoter Genetics, July 1, 2001; 158(3): 1253 - 1268. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||







