Skip Navigation


MBE Advance Access originally published online on July 17, 2007
Molecular Biology and Evolution 2007 24(9):2132-2138; doi:10.1093/molbev/msm143
This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Supplementary Material
Right arrow All Versions of this Article:
24/9/2132    most recent
msm143v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Turner, T. L.
Right arrow Articles by Hahn, M. W.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Turner, T. L.
Right arrow Articles by Hahn, M. W.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2007. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oxfordjournals.org

Research Articles

Locus- and Population-Specific Selection and Differentiation between Incipient Species of Anopheles gambiae

Thomas L. Turner* and Matthew W. Hahn{dagger}

* Center for Population Biology, University of California, Davis
{dagger} Department of Biology and School of Informatics, Indiana University

E-mail: tlturner{at}ucdavis.edu.


    Abstract
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Supplementary Material
 Acknowledgements
 References
 
Anopheles gambiae, the primary mosquito vector of malaria in sub-Saharan Africa, is divided into 2 sympatric incipient species known as M form and S form. Recent genomic analysis of each form revealed that differentiation between forms is clustered into 3 unlinked regions of the genome. Here, we expand the investigation of these "genomic islands of speciation" to multiple populations, including all of the genes across one of the islands. Differentiation between the M and S forms in 2 of the islands is complete across all individuals in all populations, confirming that the M and S forms are reproductively isolated taxa. Differentiation at the third island (on chromosome 2R) is limited to Cameroon populations. There is reduced variation in the M form in Cameroon at this location and increased divergence to the outgroup Anopheles arabiensis, supporting an association of adaptation with reproductive isolation.

Key Words: speciation • mosquito • population genetics • natural selection


    Introduction
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Supplementary Material
 Acknowledgements
 References
 
Patterns of molecular variation within populations are shaped by many processes, including genetic drift, natural selection, demographic history, and migration. Although selective forces vary greatly among loci, forces such as drift, demography, and migration are generally considered to act across the genome (Hudson et al. 1987Go; Begun and Whitley 2000Go; Glinka et al. 2003Go; Akey et al. 2004Go; Stajich and Hahn 2005Go). However, when migration occurs between incipient species, it may interact with selection to create different amounts of realized gene flow among loci (Machado et al. 2002Go; Emelianov et al. 2004Go; Payseur et al. 2004Go). This pattern can occur when hybrid genotypes at some loci are disadvantageous, whereas alleles at other loci are selectively neutral with respect to the different species or populations (Barton and Gale 1993Go; Wu and Ting 2004Go).

Previously, we found such a pattern of heterogeneous gene flow between 2 partially isolated, sympatric populations of Anopheles gambiae (Turner et al. 2005Go). Using Affymetrix microarrays, we were able to characterize differentiation across the genomes of the so-called M and S forms of A. gambiae in Cameroon at over 142,000 markers. We identified 3 unlinked regions that showed no evidence for gene flow (speciation islands), even as the other 99% of the genome showed little differentiation between the M and S forms (Turner et al. 2005Go). These differentiated regions are therefore expected to contain loci which are disadvantageous in a hybrid genetic background but advantageous within each form. However, we do not know which genes are the targets of selection or whether the alleles underlying these selective effects are found throughout the ranges of each form. Because A. gambiae is the primary vector of human malaria, understanding how M and S are different is also relevant to human health. These forms mate assortatively in nature (Tripet et al. 2001Go), implicating behavioral differences between them, but other phenotypic and ecological differences have not been conclusively demonstrated. Finding the genes that differentiate M and S may aid in this investigation by generating testable hypotheses from the bottom up.

Therefore, in this paper, we present an analysis of sequence variation and divergence between the M and S forms from Cameroon and Mali. We have also included individuals with and without the 2Rb inversion: this inversion is polymorphic in both M and S and spans one of the candidate speciation islands in Cameroon (Coluzzi et al. 2002Go). We have 2 main aims: 1) to investigate whether loci within the speciation islands show signatures of positive selection which could be related to speciation and 2) to determine whether the speciation islands identified previously in Cameroon are also present in Mali populations.


    Materials and Methods
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Supplementary Material
 Acknowledgements
 References
 
Anopheles gambiae samples were collected in the towns of Buea, Mutanguene, and Tiko in Cameroon in 2003 and in Selenkenyi in Mali in 2004 by the lab of G. C. Lanzaro (University of California, Davis), who graciously shared DNA from karyotyped mosquitoes for this study. Samples of A. gambiae M and S from Cameroon all possess the +/+ (standard) karyotype on chromosome 2R; Mali 2R karyotypes used in this study are as follows: Mopti (M form) individuals are bc/bc, u/+, and bc/+; Savanna (S form) individuals are cu/cu and b/b; Bamako (S form) individuals are jcu/jcu, jbcu/jcu, and jbcu/jbcu (following Touré et al. 1998Go). Samples of the congeneric species Anopheles arabiensis are from the SENN DDT colony, and Anopheles quadriannulatus sp. A are from the SKUQUA colony; both colonies are maintained in South Africa by M. Coetzee. These outgroup individuals were not karyotyped, but M. Coetzee (personal communication) reports that they both possess the "standard" karyotype for their species. DNA was extracted from A. gambiae following Post et al. (1993)Go and from A. arabiensis and A. quadriannulatus sp. A (hereafter A. quadriannulatus)using a Qiagen DNeasy kit; standard polymerase chain reaction (PCR) diagnostics were used to differentiate A. gambiae from A. arabiensis (Scott et al. 1993Go) and to differentiate M and S forms (Fanello et al. 2003Go). Polytene chromosome preparation and analysis of karyotypes (conducted in the lab of G. C. Lanzaro) are as described in Hunt (1973)Go. Primers for PCR and sequencing were designed from the A. gambiae genome annotation (http://www.ensembl.org/Anopheles_gambiae/index.html) and are available in supplementary table S1 (Supplementary Material online), which contains detailed information about each locus.

For 6 loci, on chromosome 2L (LIM and Subtilase) and 1 on chromosome 2R (GPRor38), sequence data from Cameroon populations of A. gambiae come from the previous study of Turner et al. (2005)Go. We sequenced 3 of these loci and 1 additional locus (MSH5) from individuals in Mali as well as in the related species A. arabiensis and A. quadriannulatus. We also sequenced 5 additional loci on chromosome 2R, in populations from Cameroon. All new sequences have been deposited in GenBank under accession numbers DQ425111–DQ425208, DQ436826–DQ436911, and EF426141-EF426244. PCR products were directly sequenced in both directions for all loci except NADH1b, pepM19, bkinase, and cd59, which were cloned using an Invitrogen TOPO TA kit. Sequence chromatographs were assembled and edited in CodonCode (www.codoncode.com), which uses ABI quality scores and Phred/Phrap to call bases and find heterozygous single-nucleotide polymorphisms (SNPs). Sequences were aligned using ClustalW (Higgins et al. 1994Go). Upon further analysis, the locus UNK1 reported in Turner et al. (2005)Go proved to be suspect due to close paralogous sequences in the A. gambiae genome, so this locus was not used in the current analysis.

Calculation of measures of population variation including Tajima's D, a measure of the skew of the frequency spectrum of mutations (Tajima 1989Go); {pi}, the average number of differences between sequences (Tajima 1983Go); and FST, a measure of population differentiation (Wright 1951Go) were done using DNAsp v 4.00.5 (Rozas et al. 2003Go). All values of FST less than 0 were set to 0 in the text. Calculation of nonsynonymous and synonymous polymorphism and divergence—in order to perform the test of McDonald and Kreitman (1991)Go—and average levels of divergence, Dxy, were also done using DNAsp. HKA tests (Hudson et al. 1987Go) were performed using Jody Hey's HKA program available at http://lifesci.rutgers.edu/~heylab/HeylabSoftware.htm. Significance of HKA tests was determined by conducting 10,000 coalescent simulations as implemented in the HKA software (Kliman et al. 2000Go). These coalescent simulations were also used to assess the significance of Tajima's D values (Kliman et al. 2000Go).


    Results
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Supplementary Material
 Acknowledgements
 References
 
Summary statistics of levels of polymorphism and differentiation are shown in table 1 (for loci on 2L) and table 2 (for loci on 2R). Individuals of the S form from Mali include both the Bamako and Savanna "chromosomal forms" (sensu Touré et al. 1998Go), which are combined as "S form" in table 1; all M form individuals sequenced from Mali are the Mopti chromosomal form. Differentiation between M and S along the X chromosome has been well documented (Wang et al. 2001Go; Stump et al. 2005Go), so X-chromosome loci were not sequenced from Mali. Because all individuals are identified as M or S based on a PCR diagnostic which queries this region, however, the X-chromosome genotype is known. Loci on 2R and 2L were chosen to compare sequence variation within and outside the differentiated regions originally found in the Cameroon population.


View this table:
[in this window]
[in a new window]

 
Table 1 DNA Variation and Differentiation in 2 Loci on Chromosome 2L

 

View this table:
[in this window]
[in a new window]

 
Table 2 DNA Variation and Differentiation in 10 Loci on Chromosome 2R

 
Chromosome 2L
Data from 2 loci on chromosome 2L were sequenced in Mali: LIM was previously found to lie within the region of no gene flow, and Subtilase serves as a nearby control locus. The pattern of differentiation in Mali and Cameroon is the same, with fixed differences and no shared polymorphisms between M and S at the LIM locus, and shared polymorphisms without fixed differences at the Subtilase locus (table 1). The 4 SNPs fixed between forms in Cameroon are fixed between forms in all individuals sequenced regardless of geography or chromosomal form, indicating perfect correspondence of these SNPs with the M and S form diagnostic region on the X chromosome. The difference in number of fixed and shared polymorphisms between the differentiated and undifferentiated regions is highly significant in both Cameroon and Mali (Fisher's exact test; Cameroon: P = 0.0010; Mali: P = 0.0014). Although this difference could, in principle, be due to lineage sorting between loci after the cessation of gene flow, the inference of locus-specific selection against gene flow within the 2L speciation island is supported by the observation of continued hybridization in Mali (Tripet et al. 2001Go) and by the coalescent simulations of Turner et al. (2005)Go. As expected from the pattern of fixed and shared polymorphisms, differentiation is high between M and S at the LIM locus (FST = 0.950 in Mali, 0.909 in Cameroon) and much lower at the Subtilase locus (FST = 0.110 in both populations). Differentiation within each form between countries is low (table 3), with FST values ranging from 0.091 to 0 at both loci on chromosome 2L.


View this table:
[in this window]
[in a new window]

 
Table 3 Differentiation between Cameroon and Mali

 
Levels of polymorphism are also similar between Cameroon and Mali: the LIM locus shows 10–12 times lower values of {pi} than the Subtilase locus in both populations (table 1). The much lower levels of polymorphism within nonintrogressing regions are one possible indication of ongoing natural selection against hybrid individuals at a linked locus. Another potential indication of linked selection is the significantly negative values of Tajima's D at the LIM locus in the Cameroon population of the S form (–1.83), which is the only locus in the differentiated region with more than 2 polymorphisms in either population (we did not calculate Tajima's D for loci with 2 or fewer polymorphisms). To further investigate possible patterns of natural selection, we sequenced LIM and Subtilase in 2 additional species of the A. gambiae sensu lato complex: A. arabiensis and A. quadriannulatus. Using the outgroup sequences, we find that of the 4 nt fixed between M and S, 2 are derived in M, 1 is derived in S, and 1 is ambiguous (A. gambiae M matches the A. quadriannulatus genotype and A. gambiae S matches A. arabiensis). We compared polymorphism and divergence at the LIM and Subtilase loci using the HKA test (Hudson et al. 1987Go). This test would have low power in a comparison of the M and S forms because of extremely low divergence between forms (Ford and Aquadro 1996Go); instead, we compared each form with the closest outgroup, A. arabiensis. The test was significant for both the M form (P = 0.021) and the S form (P = 0.032). It should be noted, however, that the proximity of LIM locus to the centromere—where there is likely reduced recombination (Turner et al. 2005Go)—could result in reduced polymorphism and a skew in the frequency spectrum due to linked selection on genes unrelated to reproductive isolation (Maynard Smith and Haigh 1974Go; Kaplan et al. 1989Go).

Chromosome 2R
In contrast to the highly differentiated 2L and X islands, the region identified on chromosome 2R was only marginally significant in the previous whole-genome microarray analysis (Turner et al. 2005Go). A differentiated region was predicted on chromosome 2R between megabase 24.847 and megabase 24.884—a 37-kb span—but confidence intervals on this estimate are unknown.

To precisely define this region, we sequenced 10 loci across the boundaries of the previously defined island of speciation in populations from Cameroon (table 2). The sequenced loci include portions of all annotated genes between megabase 24.850 and megabase 24.908, which includes all of the genes within the boundaries of the previously identified speciation island. As shown in figure 1, there is a small region of differentiation coincident with reduced variation in the M form very close to the predicted island. Of the 10 loci, 2 show fixed differences with no shared polymorphisms between M and S individuals (MSH5 and i8), whereas a third (GPRor38) has no fixed differences but no shared polymorphisms either. All 3 of these loci have average values of FST above 0.6 between M and S forms. One of these highly differentiated loci lies within the island of differentiation defined in Turner et al. (2005)Go; the other 2 lie just outside the previously defined boundaries (fig. 1). The differences in the fixed:shared polymorphism ratio between these 3 loci and the adjacent 7 loci are highly significant (Fisher's exact test; P = 3.9 x 10–8).


Figure 1
View larger version (18K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
FIG. 1.— Variation and differentiation at 10 loci sequenced in Cameroon. The region predicted to be differentiated by Turner et al. (2005)Go is shown between the gray lines.

 
In order to examine the geographical extent of differentiation on chromosome 2R, we sequenced GPRor38 and MSH5 in populations from Mali. In contrast to the patterns found in Cameroon, these 2 loci have no polymorphisms fixed between forms and share many polymorphisms across M and S in Mali (table 2; dividing the Mali sample into groups with and without the 2Rb inversion does not change this result). Along with the lack of fixed differences, there is an approximately 6-fold reduction in the values of FST between forms across both loci. As expected from the patterns just described, FST between Mali and Cameroon is higher at loci on 2R than at loci on 2L (table 3).

Levels of polymorphism at the loci in and around the 2R speciation island are much lower in the M form than in the S form in Cameroon, suggesting that a selective sweep occurred within the M form (fig. 1). To test for the signature of a selective sweep in this region, we again conducted multilocus HKA tests to the closest outgroup, A. arabiensis, using all 4 loci we have sequences for in A. arabiensis (LIM, Subtilase, GPRor38, and MSH5). An HKA test between the M form and A. arabiensis is significant (P = 0.014), but the same test comparing the S form and A. arabiensis is not significant (P = 0.122), as would be expected if there has been a selective sweep only in the M form lineage since its split with the S form. We also conducted the M form HKA test excluding the LIM locus (which may be subject to linked selection; see above), and the test remained significant (P = 0.017). The significant result is therefore due to low polymorphism in M at GPRor38 (0.003) and MSH5 (0.001) compared with Subtilase (0.012) and all the 3 loci in S (0.010–0.013). High divergence at GPRor38 (5.7%) may also contribute to the significant result (all other loci in M and S = 1.1%–2.3%). Although the Tajima's D value of GPRor38 in the M form in Cameroon was only marginally significant (0.05 < P < 0.10) in a model-based test (Tajima 1989Go), it was found to be significant based on coalescent simulations (P = 0.036), providing further support for a selective sweep near this locus in the M form. The low values of heterozygosity in the M form in Cameroon are not seen in Mali, suggesting that the selective sweep has occurred only in the Cameroon population of the M form.

The percent divergence (Dxy) between the M form in Cameroon and the 2 outgroups at GPRor38 (5.7% to A. arabiensis, 7.2% to A. quadriannulatus) is much greater than the range of divergence from the other 3 loci in M for which we have data (1.4–2.1% to A. arabiensis, 1.3–2.9% to A. quadriannulatus). We used the test suggested by McDonald and Kreitman (1991)Go to determine whether the high levels of divergence at the GPRor38 locus in the Cameroon M form could be due to directional selection on the protein sequence of this olfactory receptor. Using A. arabiensis as an outgroup, this test revealed a marginally significant departure from neutral expectations, despite low power, with only 13 sites considered (synonymous: 7 polymorphic, 1 fixed; nonsynonymous: 1 polymorphic, 4 fixed; 1-tailed P = 0.032). The same test to the more distant outgroup A. quadriannulatus was not significant (synonymous: 7 polymorphic, 7 fixed; nonsynonymous: 1 polymorphic, 5 fixed; 1-tailed P = 0.185). Because the comparison to the more distant A. quadriannulatus includes the continued fixation of synonymous mutations, any signal of recent positive selection would be harder to detect. Together, the tests presented here suggest that adaptive natural selection has acted on the protein sequence of this highly differentiated olfactory receptor. There is no evidence of this adaptation in the S form in Cameroon (synonymous: 7 polymorphic, 1 fixed; nonsynonymous: 5 polymorphic, 1 fixed; P = 0.987).


    Discussion
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Supplementary Material
 Acknowledgements
 References
 
Some models of speciation with gene flow predict that the loci responsible for ecological, behavioral, and other isolating traits will not introgress freely between diverging taxa (Barton and Gale 1993Go; Wu and Ting 2004Go). Within A. gambiae, we have the opportunity to test this prediction with multiple sympatric populations of the M and S forms. We previously found 3 regions of the genome that were highly differentiated between M and S in Cameroon: one each on chromosomes 2L, 2R, and X (Turner et al. 2005Go). The differentiation in these speciation islands indicates that gene flow between forms is reduced relative to the rest of the genome, so these regions are hypothesized to contain genes involved in reproductive isolation (i.e., genes that have different fitness effects in the genomic backgrounds of M and S). We have now found that the island on chromosome 2L is present in both Mali and Cameroon: the 4 nt differences at the LIM locus on chromosome 2L are fixed between M and S in all 52 chromosomes sequenced, including those from both Mali and Cameroon. Because all of these individuals were identified as M or S based on their X-chromosome genotype, this means that the X and 2L regions are in complete association in our sample. This is significant, as the existence of 2 differentiated regions between M and S, each on an independently assorting chromosome, provides strong support for the existence of 2 partially isolated taxa within A. gambiae. The control locus just outside the island on 2L remains undifferentiated in both populations, providing further evidence of locus-specific selection against gene flow across A. gambiae.

We infer the action of positive selection in the 2L island as evidenced by reduced polymorphism and significant HKA tests in both forms, and a significant value of Tajima's D at the only locus with more than 2 polymorphisms; similar evidence has previously been found in the X-chromosome island (Stump et al. 2005Go). In both cases, however, the linked selection may be unrelated to speciation: both of these islands are near centromeres and are likely to have reduced recombination. Regions of reduced recombination show lower levels of polymorphism and/or an excess of low-frequency mutations in almost every organism that has been examined (e.g., Begun and Aquadro 1992Go; Stephan and Langley 1998Go; Nachman 2001Go; Tenaillon et al. 2001Go; Stajich and Hahn 2005Go). It is thought that this pattern is due to linked positive selection (Kaplan et al. 1989Go) and/or background selection (Charlesworth et al. 1993Go). Therefore, our results on chromosome 2L, as well as those of others on the X chromosome (Stump et al. 2005Go), should not be taken as evidence for selection directly associated with reproductive isolation between M and S in A. gambiae.

The third island, on chromosome 2R, shows a quite different pattern. This region is unremarkable in the Mali population, with levels of {pi}, FST, and divergence that are similar to the Subtilase control locus and other loci from chromosomes 2R (this study) and 3R (Turner et al. 2005Go). In the Cameroon population, however, several observations implicate this region as being involved in reproductive isolation. We had previously discovered fixed differences on 2R at the UNK1 locus (Turner et al. 2005Go), but these data are difficult to interpret given that this locus may be a multicopy-transposable element. However, we have now discovered 5 more fixed differences at the MSH5 and i8 loci—which lie just outside the previously defined speciation island—confirming that this region is strongly differentiated between M and S in Cameroon (table 2).

Reduced polymorphism, significant Tajima's D, and significant HKA test all point to a selective sweep in the M form. A significant McDonald–Kreitman test indicates that directional selection on the protein sequence of GPRor38 could be the target of this selective sweep, though this result is tentative because we have sequenced only approximately a third of this gene. Despite considerable effort, we have been unable to clone to rest of this locus from our samples; because the MSH5 locus has also been resistant to sequencing, we suspect that there may be differences between our samples and the reference assembly in this region. In the v36 genome assembly, the only genes predicted to lie in or near the differentiated region are 5; we have sequence data from GPRgr13, GPRor38, GPRor39, MSH5, and CD59 and 2 predicted genes, which appear to be transposons (data not shown). In release 42 of the genome annotation, these 2 predicted transposons were removed, and a small protease inhibitor (locus i8) was further predicted, which we have also sequenced. Despite having sequenced portions of all the genes in this predicted island and successfully defined the extent of differentiation, strong inferences about the target of selection would require resequencing of an approximately 100-kb region (which might reveal differences with the current assembly), and functional studies of interesting candidate substitutions.

Unlike the 2L and X islands, the 2R island is not expected to lie in a region of reduced recombination. The observation of selection at the GPRor38 locus is therefore more likely to be related to reproductive isolation. It is very interesting that differentiation and reduced variation on chromosome 2R are not seen in Mali: this seems to indicate that an isolating barrier was lost in Mali or gained in Cameroon without spreading to Mali. Whatever isolating mechanism is maintained in this chromosomal region in Cameroon is therefore not necessary to maintain isolation in Mali. Hybridization between forms has been directly estimated to occur approximately 1% of the time in Mali (Tripet et al. 2001Go), and hybrid larvae and adults have been found (della Torre et al. 2005Go). No M/S hybrids have been found in Cameroon despite significant sampling effort, perhaps because reproductive isolation is stronger between forms in Cameroon (della Torre et al. 2005Go). Previous studies have found polymorphic isolating barriers between allopatric species, and between species that are sympatric over only some of their range (Ortiz-Barrientos et al. 2004Go; Reed and Markow 2004Go; Kopp and Frank 2005Go). The present results should be taken as further evidence for the lability of isolating barriers across a species range, even between 2 populations that are both sympatric (although this conclusion will remain tentative until additional evidence implicates this region in reproductive isolation). In the speciation model described by Wu and Ting (2004)Go, incipient species become reproductively isolated locus by locus; because of selection against hybrid genotypes at 1 or a few loci, realized gene flow is reduced across the genome, shifting the balance between selection and gene flow toward differentiation at other loci. It is possible that the differentiated region found only in Cameroon on chromosome 2R is such an emerging speciation island, whose differentiation has been facilitated because of selection against hybrids at the 2L and X islands.

Although support for the existence of isolated groups within A. gambiae has existed for some time, the current work helps to clarify what exactly defines these groups. Much confusion has arisen from the switch in focus from inversion frequencies (chromosomal forms sensu Touré et al. 1998Go) to SNPs (molecular forms, Favia et al. 2001Go), and some doubt has remained about whether either form represents a valid taxonomic unit. It was the chromosomal forms that were discovered first; the discovery of marker SNPs on the X chromosome followed, and these SNPs were subsequently investigated throughout the species range (della Torre et al. 2001Go; Gentile et al. 2001Go, 2002Go; della Torre et al. 2005Go). Genotyping of molecular markers is much less ambiguous than determining inversion karyotypes as the actual number of hybrid karyotypes is difficult to resolve (Touré et al. 1998Go). Our discovery of a second differentiated region on chromosome 2L, which is in perfect association with the X-chromosome genotype, makes it clear that the 2L and X SNPs are true indicators of a taxonomic split within A. gambiae. Because M and S form individuals with the same karyotype can be compared in Cameroon, this country is the preferred location for future work on speciation: discoveries made in this country can then be investigated in the more complicated populations in Mali, as was done here. Regardless of which population is studied, it is clear that A. gambiae is a very interesting model organism for the study of speciation with gene flow.


    Supplementary Material
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Supplementary Material
 Acknowledgements
 References
 
Supplementary table S1 is available at Molecular Biology and Evolution online (http://www.mbe.oxfordjournals.org/). Sequence data from this article have been deposited with the GenBank data library under accession numbers: DQ425111–DQ425208, DQ436826–DQ436911, and EF426141-EF426244.


    Acknowledgements
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Supplementary Material
 Acknowledgements
 References
 
We are grateful to G. C. Lanzaro and F. Tripet for providing DNA from karyotyped mosquitoes for this work and to S. V. Nuzhdin, D. J. Begun, and many others for advice and support. This work was supported by National Institutes of Health grant RO1 GM61773–01 and by National Science Foundation grant DEB-0316513 (to S. V. Nuzhdin), by National Institutes of Health grant AI40308 (to G. C. Lanzaro), by the Center for Population Biology at the University of California, Davis (to T.L.T.), and by National Science Foundation grant MCB-0528465 (to M.W.H.).


    Footnotes
 
Michael Nachman, Associate Editor


    References
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Supplementary Material
 Acknowledgements
 References
 

    Akey JM, Eberle MA, Rieder MJ, Carlson CS, Shriver MD, Nickerson DA, Kruglyak L. Population history and natural selection shape patterns of genetic variation in 132 genes. PLoS Biol (2004) 2:e286.[CrossRef][Medline]

    Barton NH, Gale KS. Genetic analysis of hybrid zones. In: Hybrid zones and the evolutionary process—Harrison RG, ed. (1993) New York: Oxford University Press. 13–45.

    Begun DJ, Aquadro CF. Levels of naturally occurring DNA polymorphism correlate with recombination rates in D. melanogaster. Nature (1992) 356:519–520.[CrossRef][Medline]

    Begun DJ, Whitley P. Reduced X-linked nucleotide polymorphism in Drosophila simulans. Proc Natl Acad Sci USA (2000) 97:5960–5965.[Abstract/Free Full Text]

    Charlesworth B, Morgan MT, Charlesworth D. The effect of deleterious mutations on neutral molecular variation. Genetics (1993) 134:1289–1303.[Abstract]

    Coluzzi M, Sabatini A, della Torre A, Di Deco MA, Petrarca V. A polytene chromosome analysis of the Anopheles gambiae species complex. Science (2002) 298:1415–1418.[Abstract/Free Full Text]

    della Torre A, Fanello C, Akogbeto M, Dossou-yovo J, Favia G, Petrarca V, Coluzzi M. Molecular evidence of incipient speciation within Anopheles gambiae s.s. in West Africa. Insect Mol Biol (2001) 10:9–18.[CrossRef][Web of Science][Medline]

    della Torre A, Tu Z, Petrarca V. On the distribution and genetic differentiation of Anopheles gambiae s.s. molecular forms. Insect Biochem Mol Biol (2005) 35:755–769.[CrossRef][Web of Science][Medline]

    Emelianov I, Marec F, Mallet J. Genomic evidence for divergence with gene flow in host races of the larch budmoth. Proc R Soc Lond B (2004) 271:97–105.[Medline]

    Fanello C, Carneiro I, Ilboudo-Sanogo E, Cuzin-Ouattara N, Badalo A, Curtis CF. Comparative evaluation of carbosulfan- and permethrin-impregnated curtains for preventing house-entry by the malaria vector Anopheles gambiae in Burkina Faso. Med Vet Entomol (2003) 17:333–338.[CrossRef][Web of Science][Medline]

    Favia G, Lanfrancotti A, Spanos L, Siden-Kiamos I, Louis C. Molecular characterization of ribosomal DNA polymorphisms discriminating among chromosomal forms of Anopheles gambiae s.s. Insect Mol Biol (2001) 10:19–23.[CrossRef][Web of Science][Medline]

    Ford MJ, Aquadro CF. Selection on X-linked genes during speciation in the Drosophila athabasca complex. Genetics (1996) 144:689–703.[Abstract]

    Gentile G, della Torre A, Maegga B, Powell JR, Caccone A. Genetic differentiation in the African malaria vector, Anopheles gambiae s.s. and the problem of taxonomic status. Genetics (2002) 161:1561–1578.[Abstract/Free Full Text]

    Gentile G, Slotman M, Ketmaier V, Powell JR, Caccone A. Attempts to molecularly distinguish cryptic taxa in Anopheles gambiae s.s. Insect Mol Biol (2001) 10:25–32.[CrossRef][Web of Science][Medline]

    Glinka S, Ometto L, Mousset S, Stephan W, De Lorenzo D. Demography and natural selection have shaped genetic variation in Drosophila melanogaster: a multi-locus approach. Genetics (2003) 165:1269–1278.[Abstract/Free Full Text]

    Higgins D, Thompson J, Gibson T, Thompson JD, Higgins DG, Gibson TJ. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res (1994) 22:4673–4680.[Abstract/Free Full Text]

    Hudson RR, Kreitman M, Aguade M. A test of neutral molecular evolution based on nucleotide data. Genetics (1987) 116:153–159.[Abstract/Free Full Text]

    Hunt RH. A cytological technique for the study of Anopheles gambiae complex. Parassitologia (1973) 15:137–139.[Medline]

    Kaplan NL, Hudson RR, Langley CH. The "hitchhiking effect" revisited. Genetics (1989) 123:887–899.[Abstract/Free Full Text]

    Kliman RM, Andolfatto P, Coyne JA, Depaulis F, Kreitman M, Berry AJ, McCarter J, Wakeley J, Hey J. The population genetics of the origin and divergence of the Drosophila simulans complex species. Genetics (2000) 156:1913–1931.[Abstract/Free Full Text]

    Kopp A, Frank AK. Speciation in progress? A continuum of reproductive isolation in Drosophila bipectinata. Genetica (2005) 125:55–68.[CrossRef][Web of Science][Medline]

    Machado CA, Kliman RM, Markert JA, Hey J. Inferring the history of speciation from multilocus DNA sequence data: the case of Drosophila pseudoobscura and close relatives. Mol Biol Evol (2002) 19:472–488.[Abstract/Free Full Text]

    Maynard Smith J, Haigh J. The hitch-hiking effect of a favorable gene. Genet Res (1974) 23:23–35.[Web of Science][Medline]

    McDonald JH, Kreitman MK. Adaptive protein evolution at the Adh locus in Drosophila. Nature (1991) 351:652–654.[CrossRef][Medline]

    Nachman MW. Single nucleotide polymorphisms and recombination rate in humans. Trends Genet (2001) 17:481–485.[CrossRef][Web of Science][Medline]

    Ortiz-Barrientos D, Counterman BA, Noor MAF. The genetics of speciation by reinforcement. PLoS Biol (2004) 2:e416.[CrossRef][Medline]

    Payseur BA, Krenz JG, Nachman MW. Differential patterns of introgression across the X chromosome in a hybrid zone between two species of house mice. Evolution (2004) 58:2064–2078.[CrossRef][Web of Science][Medline]

    Post RJ, Flook PK, Millest AL. Methods for the preservation of insects for DNA studies. Biochem Syst Ecol (1993) 21:85–92.[CrossRef][Web of Science]

    Reed LK, Markow TA. Early events in speciation: polymorphism for hybrid male sterility in Drosophila. Proc Natl Acad Sci USA (2004) 101:9009–9012.[Abstract/Free Full Text]

    Rozas J, Sánchez-DelBarrio JC, Messeguer X, Rozas R. DnaSP, DNA polymorphism analyses by the coalescent and other methods. Bioinformatics (2003) 19:2496–2497.[Abstract/Free Full Text]

    Scott JA, Brogdon WG, Collins FH. Identification of single specimens of the Anopheles gambiae complex by the polymerase chain reaction. Am J Trop Med Hyg (1993) 49:520–529.[Abstract/Free Full Text]

    Stajich JE, Hahn MW. Disentangling the effects of demography and selection in human history. Mol Biol Evol (2005) 22:63–73.[Abstract/Free Full Text]

    Stephan W, Langley CH. DNA polymorphism in Lycopersicon and crossing-over per physical length. Genetics (1998) 150:1585–1593.[Abstract/Free Full Text]

    Stump AD, Fitzpatrick MC, Lobo NF, Traoré SF, Sagnon NF, Constantini C, Collins FH, Besansky NJ. Centromere-proximal differentiation and speciation in Anopheles gambiae. Proc Natl Acad Sci USA (2005) 102:15930–15935.[Abstract/Free Full Text]

    Tajima F. Evolutionary relationship of DNA sequences in finite populations. Genetics (1983) 105:437–460.[Abstract/Free Full Text]

    Tajima F. Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics (1989) 123:585–595.[Abstract/Free Full Text]

    Tenaillon MI, Sawkins MC, Long AD, Gaut RL, Doebley JF, Gaut BS. Patterns of DNA sequence polymorphism along chromosome 1 of maize (Zea mays ssp. mays L.). Proc Natl Acad Sci USA (2001) 98:9161–9166.[Abstract/Free Full Text]

    Touré YT, Petrarca V, Traoré SF, Coulibaly A, Maiga HM, Sankaré O, Sow M, Di Deco MA, Coluzzi M. The distribution and inversion polymorphism of chromosomally recognized taxa of the Anopheles gambiae complex in Mali, West Africa. Parassitologia (1998) 40:477–511.[Medline]

    Tripet F, Toure YT, Taylor CE, Norris DE, Dolo G, Lanzaro GC. DNA analysis of transferred sperm reveals significant levels of gene flow between molecular forms of Anopheles gambiae. Mol Ecol (2001) 10:1725–1732.[CrossRef][Medline]

    Turner TL, Hahn MW, Nuzhdin SV. Genomic islands of speciation in Anopheles gambiae. PLoS Biol (2005) 3:e285.[CrossRef][Medline]

    Wang R, Zheng LB, Toure YT, Dandekar T, Kafatos FC. When genetic distance matters: measuring genetic differentiation at microsatellite loci in whole-genome scans of recent and incipient mosquito species. Proc Natl Acad Sci USA (2001) 98:10769–10774.[Abstract/Free Full Text]

    Wright S. The genetical structure of populations. Ann Eugen (1951) 15:323–354.

    Wu CI, Ting CT. Genes and speciation. Nat Rev Genet (2004) 5:114–122.[CrossRef][Web of Science][Medline]

Accepted for publication July 12, 2007.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Phil Trans R Soc BHome page
R. K Butlin, J. Galindo, and J. W Grahame
Sympatric, parapatric or allopatric: the most important way to classify speciation?
Phil Trans R Soc B, September 27, 2008; 363(1506): 2997 - 3007.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
N. C. Manoukis, J. R. Powell, M. B. Toure, A. Sacko, F. E. Edillo, M. B. Coulibaly, S. F. Traore, C. E. Taylor, and N. J. Besansky
A test of the chromosomal theory of ecotypic speciation in Anopheles gambiae
PNAS, February 26, 2008; 105(8): 2940 - 2945.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Supplementary Material
Right arrow All Versions of this Article:
24/9/2132    most recent
msm143v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Turner, T. L.
Right arrow Articles by Hahn, M. W.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Turner, T. L.
Right arrow Articles by Hahn, M. W.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?