Skip Navigation


MBE Advance Access originally published online on December 7, 2007
Molecular Biology and Evolution 2008 25(2):417-437; doi:10.1093/molbev/msm272
This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Supplementary Material
Right arrow All Versions of this Article:
25/2/417    most recent
msm272v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Eveno, E.
Right arrow Articles by Garnier-Géré, P. H.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Eveno, E.
Right arrow Articles by Garnier-Géré, P. H.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2007. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oxfordjournals.org

Research Articles

"Contrasting Patterns of Selection at Pinus pinaster Ait. Drought Stress Candidate Genes as Revealed by Genetic Differentiation Analyses"

Emmanuelle Eveno*, Carmen Collada{dagger}, M. Angeles Guevara{dagger}, Valérie Léger*, Alvaro Soto{dagger}, Luis Díaz{dagger}, Patrick Léger*, Santiago C. González-Martínez{dagger}, M. Teresa Cervera{dagger}, Christophe Plomion* and Pauline H. Garnier-Géré*

* Institut National de la Recherche Agronomique, Unité Mixte de Recherche 1202 Biodiversity Genes & Communities, Cestas, France
{dagger} Department of Forest Systems and Resources, Forest Research Institute, Centro de Investigación Forestral-Instituto Nacional de Investigació y Tecnología Agraria y Alimentaria, Madrid, Spain

E-mail: pauline{at}pierroton.inra.fr.


    Abstract
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Perspectives and Conclusion
 Supplementary Material
 Acknowledgements
 References
 
The importance of natural selection for shaping adaptive trait differentiation among natural populations of allogamous tree species has long been recognized. Determining the molecular basis of local adaptation remains largely unresolved, and the respective roles of selection and demography in shaping population structure are actively debated. Using a multilocus scan that aims to detect outliers from simulated neutral expectations, we analyzed patterns of nucleotide diversity and genetic differentiation at 11 polymorphic candidate genes for drought stress tolerance in phenotypically contrasted Pinus pinaster Ait. populations across its geographical range. We compared 3 coalescent-based methods: 2 frequentist-like, including 1 approach specifically developed for biallelic single nucleotide polymorphisms (SNPs) here and 1 Bayesian. Five genes showed outlier patterns that were robust across methods at the haplotype level for 2 of them. Two genes presented higher FST values than expected (PR-AGP4 and erd3), suggesting that they could have been affected by the action of diversifying selection among populations. In contrast, 3 genes presented lower FST values than expected (dhn-1, dhn2, and lp3-1), which could represent signatures of homogenizing selection among populations. A smaller proportion of outliers were detected at the SNP level suggesting the potential functional significance of particular combinations of sites in drought-response candidate genes. The Bayesian method appeared robust to low sample sizes, flexible to assumptions regarding migration rates, and powerful for detecting selection at the haplotype level, but the frequentist-like method adapted to SNPs was more efficient for the identification of outlier SNPs showing low differentiation. Population-specific effects estimated in the Bayesian method also revealed populations with lower immigration rates, which could have led to favorable situations for local adaptation. Outlier patterns are discussed in relation to the different genes' putative involvement in drought tolerance responses, from published results in transcriptomics and association mapping in P. pinaster and other related species. These genes clearly constitute relevant candidates for future association studies in P. pinaster.

Key Words: drought stress • candidate genes • adaptive evolution • Pinus pinaster


    Introduction
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Perspectives and Conclusion
 Supplementary Material
 Acknowledgements
 References
 
Natural selection is a powerful force that promotes adaptive phenotypic differentiation between natural populations for fitness-related traits (Schemske 1984Go; Endler 1986Go; Linhart and Grant 1996Go; Kittelson and Maron 2001Go; Latta 2003Go; Kawecki and Ebert 2004Go). Variation in climatic conditions, such as rainfall or temperature, is thought to confer strong selective pressure on plant local adaptation to different environments (Joshi et al. 2001Go; Thomas 2005Go). This can generate clinal variation at fitness-related traits (Huey et al. 2000Go; García-Gil et al. 2003Go; Jump et al. 2006Go). In particular, for outcrossing species, the persistence and repetition of clines at particular genes illustrate the action of spatially varying selection that counterbalances the homogenizing effects of gene flow (Rehfeldt et al. 1999Go; Storz 2002Go; Baines et al. 2004Go). In tree species, and despite interaction with demographic effects, the role of natural selection in promoting local adaptation has been largely demonstrated in common garden experiments (provenance, progeny, and clonal tests) (e.g., Hurme et al. [1997]Go on bud set and frost resistance in Pinus sylvestris; Garnier-Géré and Ades [2001]Go on growth and survival in Eucalyptus delegatensis; Aitken and Hannerz [2001]Go on cold adaptation in conifers; González-Martínez et al. [2002]Go on survival and growth for Pinus pinaster).

At the molecular level, it is much less clear how selection affects patterns of nucleotide variation as demographic factors, such as drift and variation in population size, can produce similar patterns (Tajima 1989Go; Fay and Wu 1999Go; Galtier et al. 2000Go; Nielsen 2001Go; Heuertz et al. 2006Go). Unraveling the molecular basis of adaptation thus still remains a challenging task and, despite being a long-term goal of population genetics and evolutionary biology (Howe and Brunner 2005Go; Wright and Gaut 2005Go), is still a largely untouched area (Schlötterer 2002aGo; Ehrenreich and Purugganan 2006Go). Fundamental questions include the nature, number, and location of genes or mutations involved in adaptation (Barton and Keightley 2002Go) and also the distribution and magnitude of their allelic effects on phenotypes (Storz 2005Go; Wright and Gaut 2005Go).

Traditionally, the approach used to search for genes underlying adaptive trait variation has been to map quantitative trait loci (QTLs), based on significant statistical associations between molecular markers and trait variation, either in known pedigrees (Borevitz and Chory 2004Go) or more rarely in undomesticated natural populations (Slate 2005Go). This approach suffers from large confidence intervals (CIs) around the estimated QTL location, which potentially include tens to hundreds of different candidate genes (Flint and Mott 2001Go; Christians and Keightley 2002Go; Masle et al. 2005Go). This is due to the large blocks of linkage disequilibrium (LD) between markers, generated by the small number of recombination events that typically occurs in 2 or 3 generations' pedigrees.

In the past decade, progress in both public genomic resources and population genomics methods has dramatically changed the prospects to better understand the genetic basis of adaptation (Ford 2002Go; Luikart et al. 2003Go; Wright and Gaut 2005Go). In tree species, there has been a considerable increase in genes' sequence data (Kado et al. 2003Go; Brown et al. 2004Go; Krutovsky and Neale 2005Go; González-Martínez, Krutovsky, and Neale 2006Go). Also, the development of a large body of coalescence-based methods allowing analysis and comparison of nucleotide diversity patterns within and among species (or populations) enables insightful studies of adaptation at the molecular level to be performed (Charlesworth et al. 2001Go; Nielsen 2005Go; Sabeti et al. 2006Go). Intense debates are currently being raised regarding which demographical or historical processes and which forms of selection mainly affect population genomic patterns of variation (Fay and Wu 2000Go; Andofalto 2001Go; Hellmann et al. 2003Go; Haddrill et al. 2005Go). Ideally, population genomics requires to study as many genes or markers along chromosomes as possible, with the aim of distinguishing locus-specific effects from genome-wide patterns due to species structure and demographic histories (Cavalli-Sforza 1966Go; Schlötterer 2002aGo; Luikart et al. 2003Go). These locus-specific effects could either show a higher than expected differentiation due to diversifying selection between populations or a lower than expected differentiation due to balanced selection across populations.

There has been a recent interest in studies involving a larger sampling of distinct populations for the search of adaptive evolution in plants (Schaal and Olsen 2000Go; Schlötterer 2002bGo) as local adaptation may occur on an appropriate time scale for selection detection with population genomics methods (Wright and Gaut 2005Go). Many tree species would constitute good models for this purpose due to their large-sized natural populations with little confounding substructure and domestication history, outcrossing mating systems leading to high gene flow, sufficient nucleotide diversity, and long history of ecological and quantitative genetics research (Howe and Brunner 2005Go; reviewed in Savolainen and Pyhäjärvi 2007Go). In conifers in particular, genome scans are not yet feasible due to their large genome size (e.g., ~25.5 pg/C for P. pinaster as estimated by Chagné et al. [2002Go], i.e., about 170 times the size of the Arabidopsis genome) and rapid decay of LD (reviewed in Neale and Savolainen 2004Go), calling for candidate multigene approaches, which have been advocated in recent reviews (Tabor et al. 2002Go; Vasemägi and Primmer 2005Go). In such approaches, "neutral" expectations accounting for structure and demographic history can be both model and empirically based, valuing previous information available on neutral markers in the targeted species (Beaumont and Nichols 1996Go; Storz 2005Go).

In this study, we report a multilocus scan of differentiation among phenotypically contrasted populations of P. pinaster using recently identified structural candidate genes for water-deficit response, which were identified from expression studies (Dubos and Plomion 2003Go; Dubos et al. 2003Go; Watkinson et al. 2003Go; González-Martínez et al. 2006Go). Looking across the species full range, physiological and gene expression studies in P. pinaster showed large differences in the genes and mechanisms involved in drought tolerance (Nguyen-Queyrens and Bouchet-Lannat 2003Go; Dubos and Plomion 2003Go; Dubos et al. 2003Go), whereas a QTL mapping study (Brendel et al. 2002Go) identified 4 QTLs in a French west coast population for {delta}13C, a time integrated estimate of water use efficiency (Farquhar et al. 1989Go). Pinus pinaster also presents a fragmented geographic distribution, with a relatively high genetic differentiation among populations at putative neutral markers (mean GST varying from 0.10 to 0.17 for range-wide studies using different markers: Petit et al. 1995Go, 2005Go; Bucci et al. 2007Go), compared with other conifer species (e.g., mean GST of around 5% in P. sylvestris [Waldmann et al. 2005Go]). Considering phenotypic variation in P. pinaster, higher levels of population divergence have been observed on adaptive traits such as height and survival (QST of around 0.7 from multisite Spanish provenance trials, González-Martínez et al. 2002Go) that could be explained by contrasted ecological conditions and local adaptation (Alía et al. 1995Go, 1997Go). The discrepancy between molecular and quantitative levels of differentiation suggests the importance of diversifying selection among populations (McKay and Latta 2002Go; Le Corre and Kremer 2003Go).

Sampling in phenotypically distinct populations across the species full range, we compared different coalescent-based approaches for testing whether nucleotide differentiation patterns at both single nucleotide polymorphisms (SNPs) within genes and whole-candidate genes could depart from neutral patterns and result from natural selection. One approach was based on an infinite island model and produced simulated null FST distributions conditional on heterozygosity, using frequentist-like tests to infer outlier patterns (Beaumont and Nichols 1996Go). Null distributions were shown by the authors to be robust to a large range of population structure models and interlocus variation for mutation rates, but we found that this method was less robust in the case of biallelic loci, especially for limited sample sizes, and thus proposed instead a specific method for SNPs using similar coalescent simulations. These 2 methods, however, assume an island model of migration, thus another approach based on a hierarchical "Bayesian" model was applied (Beaumont and Balding 2004Go), in order to accommodate the common situation of heterogeneous migration rates among natural populations. We demonstrated significant outlier patterns at 5 genes potentially involved in different mechanisms of drought stress response, which can be interpreted in terms of divergent or homogenizing selection, variation in migration rates, and population isolation.


    Materials and Methods
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Perspectives and Conclusion
 Supplementary Material
 Acknowledgements
 References
 
Populations' Sampling and Climatic Data
Seeds were collected from 20 to 30 trees in 24 populations from France, Spain, Morocco, and Tunisia, which are representative of the natural fragmented distribution of P. pinaster (fig. 1). Among them, 10 populations were chosen for DNA sequencing due to their contrasting climates, as described by a set of 6 variables based on observations for 23 years for temperature and 37 years for rainfall (except for northern African populations, table 1). Each of these variables was chosen so that it explained less than 30% of variation of any other, based on Pearson correlation coefficient (table 1). The annual total mean rainfall (ANMR) ranges from 348 mm for Oria (Spain) to 1343 mm for Pineta (Corsica). A total rainfall of the driest month (RDRYM) of 2 mm for Oria and 3 mm for Pinia (Corsica) illustrates more severe summer drought periods for these populations (compared with an average of 11.5 mm for the others). The variation in ANMT between populations is consistent with climates along their latitudinal range: from averages of 11.6° for Pleucadec (North–West of France) to 17.2° and 17.9° for Tamrabta (Morroco) and Tabarka (Tunisia), respectively. The Pineta population in Corsica shows a larger variance of ANMT across the years compared with other populations (between 3 and 8 times higher). Finally, the mean temperatures of the coldest and warmest months (MTCM and MTWM) are, respectively, lower for Coca and Oria and higher for Arenas de San Pedro ("Arenas" hereafter) and San Cipriano than for other populations (table 1).


Figure 1
View larger version (26K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
FIG. 1.— Distribution of Pinus pinaster (gray area, after Burban and Petit [2003]Go). Ten native populations chosen for sequencing on the basis of contrasted climatic conditions are indicated by an arrow.

 

View this table:
[in this window]
[in a new window]

 
Table 1 Geographical Coordinates and Climatic Data for 10 Contrasted Pinus pinaster Populations

 
Candidate Gene Selection, Sequencing, and Polymorphisms Detection
A set of 13 candidate genes (table 2) were chosen considering previous gene expression studies using various drought stress treatments in maritime pine and other conifer species (Chang et al. 1996Go; Padmanabhan et al. 1997Go; Costa et al. 1998Go; Dubos and Plomion 2003Go; Chaumeil 2006Go), nucleotide diversity analyses in related species (González-Martínez et al. 2006Go), and also for their putative role during drought stress responses: 1) cell wall reinforcement for 2 lignification genes, CCoAOMT and COMT, 2) maintenance of osmotic balance under dehydration conditions related to sugar metabolism for Glucan and Ino3, 3) protection of cytoplasmic structures for dhn-1 and dhn-2, 4) signal transduction processes induced by stress conditions for pp2c, and 5) degradation of proteins damaged by drought effects for rd21A.


View this table:
[in this window]
[in a new window]

 
Table 2 Description of the 13 Candidate Genes

 
Publicly available sequences were blasted in gene databases such as National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/BLAST/) and The Institute for Genomic Research (TIGR) (http://compbio.dfci.harvard.edu/tgi/cgi-bin/tgi/gimain.pl?gudb=pine) to search for homologs in P. pinaster or orthologs in close species (Pinus taeda, Pinus radiata, and P. sylvestris) (table 2). After alignment with Bioedit v. 7.0.5 (http://www.mbio.ncsu.edu/BioEdit/bioedit.html) to get an overview of gene structure and prediction of in silico polymorphisms, primers were designed, using both Primer 3 (http://frodo.wi.mit.edu/cgi-bin/primer3/primer3_www.cgi) and OligoCalc (http://www.basic.northwestern.edu/biotools/OligoCalc.html) for polymerase chain reaction (PCR) amplification of candidate genes fragments with an expected length of 500 to 900 base pairs (bp) (supplementary table S1, Supplementary Material online). These fragments preferentially included the 3' untranslated regions (UTRs) to avoid amplification of different multigene family members or covered (for short length genes such as dehydrins) the full length of the gene.

DNA was extracted from haploid tissue of megagametophyte seeds as described by Plomion et al. (1995)Go for a discovery panel of 5 to 31 individuals in each of the 10 contrasted populations. Purified PCR products were sequenced using either the DYEnamic ET Terminator kit (Amersham Biosciences Inc., Uppsala, Sweden) or the BigDye Terminator v 3.1 Cycle Sequencing kit (Applied Biosystems, Foster City, CA) and separated using capillary electrophoresis (MegaBACE 1000, Amersham Biosciences Inc., or ABI 3130 XL, ABI 3700, and ABI 3730, Applied Biosystems). Base calling was performed from the raw chromatograms using Sequence Analyser (Amersham Biosciences Inc.) or Seqscape v. 2.5 (Applied Biosystems). Sequence alignments and quality attribution were processed with either CodonCode Aligner v. 1.5.1 (Codon Code Corporation, Dedham, MA) or Chromas Pro ver. 2.31 (Technelysium Pty Ltd, Tewantin, Queensland, Australia). All detected polymorphisms (SNPs and insertion–deletions [indels]) were visually checked in CodonCode and further validated using Phred scores (quality threshold) above 20.

Nuclear Microsatellites (nuSSRs) Genotyping and Diversity Estimates
Putatively neutral nuSSRs were used to account for genetic differentiation caused by demographic and other processes not related with selection (e.g., genetic drift resulting from geographic isolation or population expansion); thus, no detailed analysis of nuSSRs diversity is presented here. Rather, their use was for identifying homogeneous gene pools and computation of genetic differentiation estimates that could represent a neutral reference. Eight polymorphic nuSSRs were selected from those previously developed by Chagné et al. (2004)Go: NZPR413, NZPR1078, ctg64; Mariette et al. (2001)Go: ctg275, FRPP91, FRPP94, ITPH4516, and Guevara et al. (2005)Go: A6F03. Criteria of choice included their localization to different linkage groups, their allelic richness (at least 4 alleles), and their potential for multiplex amplification. Genotyping was performed on genomic DNA isolated from needles from the complete set of 24 sampled populations (using 20–30 individuals per population and primer pairs as described in supplementary table S2 [Supplementary Material online]). Genetic diversity and inbreeding coefficient (FIS) were estimated as detailed in supplementary tables and figures (Supplementary Material online) : mean diversity estimates were consistent with previously reported values in either pedigrees or different samples of natural provenances (Mariette et al. 2001Go; Derory et al. 2002Go; Chagné et al. 2004Go; Guevara et al. 2005Go) (supplementary table S2, Supplementary Material online). FIS values were close to zero (mean = 0.07 across the 24 populations) except for FRPP91 and ITPH4516, which presented significant heterozygote deficits within several populations (supplementary table S2, Supplementary Material online). Populations were clustered using 2 different methods in order to define homogeneous gene pools: one was based on the pairwise FST matrix among populations (table 3) and the other was based on the Bayesian algorithm proposed in Structure v. 2.0 (Pritchard et al. 2000Go). More details on the methods are given in supplementary tables and figures (Supplementary Material online). Both clustering methods gave consistent outputs and allowed the identification of 6 groups (table 4 and supplementary figs. S1 and S2 [Supplementary Material online]): Corsica (group C), continental France (group F), northwestern and central Spain (group S) and 3 clusters with single populations: Oria (southern Spain, group O), Tamrabta (Morocco, group M), and Tabarka (Tunisia, group T).


View this table:
[in this window]
[in a new window]

 
Table 3 Matrix of Pairwise FST Averaged across 8 Nuclear SSRs between the 10 Populations

 

View this table:
[in this window]
[in a new window]

 
Table 4 Matrix of Pairwise FST Averaged across 8 Nuclear SSRs between the 6 Homogeneous Gene Pools

 
Candidate Genes and SNPs Statistical Analyses
Molecular Diversity and Differentiation
For each candidate gene, nucleotide diversity was estimated with both {theta}{pi} (Nei 1987Go), based on the average number of pairwise differences between sequences, and {theta}S (Watterson 1975Go), based on the number of segregating sites, on the total number of polymorphic sites, nonsynonymous and silent sites (including synonymous sites in coding regions and sites in noncoding regions). Both SNPs and indels were included in diversity estimates, with indels in coding regions classified as nonsynonymous sites. Haplotype diversity was estimated using Hd statistics of Nei (1987)Go. Diversity estimates were computed in Arlequin v. 3.01 (Excoffier et al. 2005Go), which allows for missing data, and DNAsp v. 4.10 (Rozas et al. 2003Go), which does not. Additionally, the number of small linkage disequilibrium blocks (SLDB), defined as contiguous sets of nucleotides in complete LD that included at least 2 polymorphisms, was described for each gene, with their mean length and mean number of SNPs. Possible departures of {theta}{Pi} values from those expected under neutrality were tested using coalescent simulations under both no and free recombination in Proseq v. 2.91 (Filatov 2002Go), which allows for subdivided populations (10 demes were used). Molecular genetic differentiation was then assessed among the 10 populations for each candidate gene at both haplotype and SNP levels, with an analysis of molecular variance (AMOVA) on the distance (number of differences) matrix between "haplotypes" or sequence pairs, using Arlequin. In the AMOVA analysis, the among-population component of total variation represents the nucleotide differentiation estimate NST (Nei and Kumar 2000Go) that accounts for genetic relatedness between haplotypes in a general framework allowing for missing data. Classical Weir and Cockerham (1984)Go FST parameters were also estimated along the genes for each polymorphic site having a minor allele frequency (MAF) above 5% across populations, and also for each gene by considering each haplotype as an independent allele (after excluding missing data).

Neutrality Tests Based on Genetic Differentiation
To search for possible signatures of natural selection, we applied a first approach (implemented in "Fdist2," Beaumont and Nichols 1996Go), that derives the expected neutral distribution of FST values for many loci conditional on their heterozygosity (and thus accounting for variation in allele frequency), using coalescent simulations under a symmetrical island migration model and assuming migration-drift equilibrium. We will refer to it as the Fdist2 method. Around 40,000 independent loci were generated for 100 demes (as an approximation to the infinite island model), using the mean FST values observed in nuSSRs as the neutral expectation. The correspondence between simulated and observed FST means was checked with a precision of +/– 0.01%. We considered a 2-sided test at an {alpha}% significance level ({alpha} up to 10% not to be too conservative) for detecting either signatures of diversifying selection by identifying outliers showing FST values greater than the (1–1/2{alpha})% quantile of the simulated null distribution, or for detecting balancing, or homogenizing selection from outliers with FST values lower than the 1/2{alpha}% quantile of the distribution. The 1/2{alpha}% and (1–1/2{alpha})% quantiles were computed following Beaumont and Nichols (1996)Go. Infinite allele and stepwise mutation models (IAMs and SMMs, respectively) were considered to be well suited to candidate genes (assuming independent haplotypes) and nuSSRs (which were also tested for neutrality using this approach), respectively. Both models produced very similar distributions for our experimental sample sizes (10–80 gametes) but were not suited to SNPs, mostly biallelic, for which we designed a method based on coalescent simulations that were performed with Simcoal2 ver. 2.1.2 (Laval and Excoffier 2004Go). In this method, biallelic SNPs and a similar island migration model to that used in Fdist2 were simulated. We will refer to this as the "FstSNP" method. We calculated the 1/2{alpha}% and (1–1/2{alpha})% quantiles for 11 discrete classes of heterozygosity, each covering a heterozygosity range of 0.05, using the R package (http://www.r-project.org/). Because different numbers of haploid sequences were available for different genes, simulations were applied to: 1) a first group of genes (PR-AGP4, CCoAOMT, GRP3, Glucan, and dhn-1) for which the sample size ranged from 9 to 14 gametes per population (whether using Fdist2 without missing data in sequences or FstSNP allowing for missing data) and 2) a second group (erd3, dhn-2, lp3-1, lp3-3, pp2c, and rd21A) for which the sample size was around 6 gametes per population. Loci (genes and SNPs) with FST values falling outside the 1/2{alpha}% or the (1–1/2{alpha})% quantiles of the envelope boundaries were considered as significant outliers. SNPs in complete LD were excluded from the analyses to limit redundant information, as well as SNPs with a MAF below 5%. To account for multiple testing in the 2 frequentist methods (Fdist2 and FstSNP), adjusted P values were first computed using the Bonferroni procedure where the family-wise error rate (FWER) is divided by the number of tests performed (Sokal and Rohlf 1995Go). This procedure only controls the chance of making even a single type I error and is recognized to be overly conservative as the overall interpretation might be greatly affected by a large number of type II errors (Verhoeven et al. 2005Go). A recent method based on the false discovery rate (FDR) allows instead to control the FDR (expected proportion of false positives among all significant tests) under a certain level, giving individual q values measures of significance (Storey and Tibshirani 2003Go). This method was shown to be more powerful than methods controlling the FWER and has thus been applied using the Qvalue R library (http://faculty.washington.edu/~jstorey/qvalue/).

To account for likely departures from a symmetrical migration model in our sampling of populations (see, e.g., González-Martínez et al. 2007Go for Oria population), a third approach was also used. This approach was based on a hierarchical Bayesian model and implemented via Markov Chain Monte Carlo (MCMC) simulations (Beaumont and Balding 2004Go). The underlying method for estimating differentiation allows for heterogeneous migration rates among populations, which could have resulted from reduced immigration in some isolated populations, and is thus robust to a number of demographic models in structured populations (Beaumont 2005). After having expressed the likelihood for the allele counts as a function of FSTij, defined for each locus i and population j, the method consists in regressing log[FSTij/(1–FSTij)] onto a locus effect ({alpha}i) and a population effect (βj) (interaction effects [{lambda}ij] were first tested and found not significant in the full data set), which posterior distributions are obtained from approximately 2,000 uncorrelated outputs of the total MCMC outputs (including at least 14,000 iterations [up to 22,000] and at least 27,0000 iterations [up to 450,000], respectively, for the burn-in and post-burn-in periods). The mean and standard deviation of prior distributions were set by default at 0 and 1 for {alpha}i, and –2 (in order to have FSTij as a growing function of βj) and 1 for βj. For each locus, {alpha}i was said to be significant at a P value when its 1-tailed 100[1–(P value)]% posterior interval excluded zero (Beaumont and Balding 2004Go). Significant and positive values of {alpha}i can then be interpreted as diversifying selection among populations, whereas negative values would be indicative of spatially homogenizing selection (e.g., balancing selection in each population). Similarly, a population effect was considered to be significant if 100[1–(P value)]% of its posterior distribution was shifted from the prior mean. A positive (respectively negative) outlier could be due to a population showing less gene flow (respectively more gene flow) comparing with others or to one with a smaller effective population size Ne (respectively larger Ne). Compared with the previous approach, loci corresponding to different sample sizes could be analyzed simultaneously because the method accounts for sample sizes variation at each locus. Different values of standard deviations were tested for the {alpha}i prior, but this did not affect the results much, suggesting that the data were sufficiently informative to produce {alpha}i robust posterior distributions. All Bayesian analyses were performed using the program "BayesFst" (Balding DJ, personal communication); thus, the method is referred to as BayesFst. A specific parameter proposed in the program to account for prior information on an average correlation level between loci was also used, but results obtained with setting it to 0 or to 0.2 (for a weak overall correlations among SNPs within and between genes) were the same with our data. Additionally, the Bayesian approach presents the advantage of dealing with the multiple testing problem by accounting of how plausible the null hypothesis is through the choice of prior distributions. It will result precisely for each particular locus or SNP, in a P value that is defined as the probability that the null hypothesis is true, using probability as a direct measure of uncertainty (Shoemaker et al. 1999Go; O'Hagan and Luce 2003Go). Finally, we also performed the same analysis after having grouped "close" haplotypes belonging to the same lineages (i.e., distant from one another by a small number of mutations and no recombination) for each candidate gene. The objective here was first to increase the precision on each new haplotype frequency estimate (by reducing the haplotype number) and second to get a Bayesian estimate of differentiation for each gene that would be closer to a NST estimate due to the pooling of less distant haplotypes and thus that would better fit the original view of FST as an inbreeding coefficient along with what is proposed in the Bayesian method. An arbitrary threshold of around 10% differences (number of mutation differences between pairs of haplotypes relative to the total number of mutations) was used in a first step to pool haplotypes. This rule corresponded to a maximum of 6 mutations' differences (for PR-AGP4) and a minimum of one mutation's difference for GRP3, lp3-3, and dhn-2. Classical Weir and Cockerham's FST estimates were also computed at each gene with the newly defined haplotypes.


    Results
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Perspectives and Conclusion
 Supplementary Material
 Acknowledgements
 References
 
Nucleotide sequence data were obtained for 13 candidate genes spanning over 11 kb, with about half from coding regions (table 2). Among them, one gene fragment (Ino3 > 850 bp sequenced) presented only one SNP in 24 gametes and another (COMT, 1,400 bp sequenced for 20 gametes) was monomorphic. In the 11 remaining genes, 302 SNPs and 36 indels (from which 71 and 7 were singletons, respectively) were detected, representing on average one polymorphic site per 28 bp. Approximately 20% of the SNPs identified were nonsynonymous (table 5).


View this table:
[in this window]
[in a new window]

 
Table 5 Number of Identified Segregating Sites, Nucleotide and Haplotype Diversities, and SLDB at 11 Polymorphic Candidate Genes across the 10 Populations

 
Patterns of Nucleotide Polymorphism
The average nucleotide diversity, {theta}{pi}, across polymorphic fragments was 5.48 (x10–3) and varied from 1.51 (x10–3) for erd3 to 9.45 (x10–3) for rd21A (table 5). This mean value dropped to 4.63 (x10–3) when including the 2 monomorphic genes. Based on 95% CIs built using the variance formula provided by Tajima (1983) and assuming normal distributions, diversity estimates at all genes largely overlapped (data not shown), thus even {theta}{pi} values from genes at range extremes could not be considered as significantly different. Nucleotide diversity at silent sites was higher (between 2- and 8-fold) than diversity at nonsynonymous sites for 7 genes out of the 9 where the comparison was possible. This general trend is consistent with what is expected in functional regions because amino acid changes are likely to be constrained by background selection of variable intensity depending on the region (Li and Graur 1991Go). Remarkably, 2 genes (erd3 and dhn-1) showed similar values for both types of diversity, which was due to 2 nonsynonymous SNPs with relatively high MAF (~17–20%) in erd3 and a high proportion (11 out of 21) of nonsynonymous polymorphic sites in dhn-1. The average of {theta}S was slightly higher (6.24 x10–3) than {theta}{pi}: values ranged from 3.57 (x10–3) for erd3 to 11.11 (x10–3) for rd21A, with corresponding 95% CIs also overlapping. Significant spatial genetic structure associated with the current fragmented distribution of maritime pine has been reported for various marker systems (Petit et al. 1995Go; Mariette et al. 2002Go; Burban and Petit 2003Go; Bucci et al. 2007Go). As a consequence, classical neutrality tests based on allelic frequency spectrum were not applied to the overall sample of data because similar patterns could arise from both various demographic and natural selection effects and could lead to incorrect data interpretations (Hein et al. 2004Go). Instead, given the large variation observed for {theta}{pi} and {theta}s values among genes, possible nucleotide diversity departures from values expected under neutrality were tested with coalescent simulations assuming an island model in subdivided populations and no recombination. Five genes out of 10 presented values that were significantly higher than expected: Glucan, dhn-1, and lp3-1 for {theta}{pi} and PR-AGP4 and dhn-2 for {theta}S. Haplotype diversity estimates (Hd) among genes ranged from 0.498 for erd3 to 0.874 for lp3-1 with a mean value of 0.75 (table 5). The high values of both Hd and {theta}{pi} for lp3-1 can be explained by the presence of a nuSSR in the sequenced part of the 3' UTR, a case that is usually associated with a higher mutation rate (Brohede and Ellegren 1999Go). Excluding the microsatellite region, {theta}{pi} dropped to 6.19 (x10–3) for lp3-1 and was no longer an outlier. Overall, the number of contiguous sets of SNPs in complete LD was very low and their size very small (table 5). This is consistent with previous conifer studies (González-Martínez et al. 2006Go; Heuertz et al. 2006Go) and illustrates the variation and generally very rapid LD decay along single genes, (see also supplementary fig. S3A and B [Supplementary Material online] for LD estimates among polymorphic SNPs across genes in all populations). Comparing genes, however, lower values of Hd, and thus stronger haplotype structure, were observed for 3 of them (PR-AGP4, Glucan, and erd3), consistently with their higher mean length of SLDB, ranging from 46 bp for PR-AGP4 to 78 bp for glucan (table 5). The opposite trend was observed in 2 genes (GRP3 and lp3-1). These patterns could be due in part to the effects of genetic differentiation among populations, but all genes were not affected in a similar manner; thus, other forces might be acting.

Genetic Differentiation at Control Loci and Candidate Genes
Throughout the text, FST refers to the classical estimate of Weir and Cockerham (1984), considering either the haplotype (or allelic) level for each locus (gene or nuSSR) or biallelic SNP loci, whereas NST refers to the estimate of Nei and Kumar (2000) only at the haplotype level (see Materials and Methods).

For the control loci (8 nuSSRs), the mean FST value observed for the 10 populations was 0.15, with NZPR413 and NZPR1078 showing the largest and lowest estimates, respectively (supplementary table S2, Supplementary Material online). Compared with estimates observed across the full set of 24 populations, the FST range and average were slightly inflated across the 10 populations selected for candidate gene sequencing (supplementary table S2, Supplementary Material online). However, FST values among the 6 homogeneous gene pools were very similar, whether including all 24 populations or only the 10 selected populations (table 4 and supplementary table S3B [Supplementary Material online]), confirming that the genetic structure in this contrasted subset was representative of the whole species range.

For the candidate genes, the mean FST value for the 10 populations was similar to that for nuSSRs (~0.14, table 6), although a larger range was observed (from 0.002 for dhn-2 to 0.33 for lp3-3, table 6), in particular toward the lower values. A large variation was also found for FST values at polymorphic SNPs (with MAF >5%) along each gene, the highest range being observed for PR-AGP4 (from 0.01 to 0.64) and erd3 (from –0.07 to 0.59). Comparing different estimates of genetic differentiation at nuSSRs, the mean FST value was comparable with the mean RST value (defined in supplementary Materials and methods, Supplementary Material online) estimated among the 6 gene pools but lower among the 10 populations (mean RST of 0.22 compared with mean FST of 0.15, supplementary table S2 [Supplementary Material online]). At candidate genes, no strong difference was observed between FST and NST mean values across the 10 populations. However, the FST value was half the NST value for GRP3 (0.16 compared with 0.27 in table 6), and this difference was significant at P < 0.006 using an allele permutation test implemented in the SPAGeDi software v. 1.2 (Hardy and Vekemans 2002Go). This means that a phylogeographical pattern exists for this gene due to a correspondence between the phylogeny of haplotypes and their geographical distribution (see also supplementary fig. S4, Supplementary Material online).


View this table:
[in this window]
[in a new window]

 
Table 6 Molecular Differentiation Estimates for 11 Candidate Genes across the 10 Populations (excluding missing data)

 
Outlier Detection
All analyses were performed first considering either the 6 homogeneous gene pools defined from nuSSRs data or the 10 populations separately. The results obtained were very similar, in terms of proportions and nature of outliers, whether at the haplotype or SNPs levels. Despite a drop in sample sizes of 25–50% from the 6 gene pools to the 10 population sampling, we did not observe any apparent loss in power for outlier detection across the 3 methods, conversely to what we had assumed initially. In some cases, outliers were even masked by the clustering of populations. Therefore, only the 10 populations analysis is presented here, and results using the 6-groups clustering are given as supplementary tables S4 and S5 (Supplementary Material online).

Following the Fdist2 approach, one nuSSR, NZPR413, was a clear outlier above the 95% quantile upper bound (supplementary fig. S5B [Supplementary Material online], sample size of 48 gametes). The neutral envelopes were thus simulated again after excluding this marker (using the new FST mean value of 0.138). All remaining FST values for nuSSRs fell within the 5% and 95% envelope bounds (fig. 2) and showed a smaller range of mean within-population diversity values (HS) than that of candidate genes, especially because of the high HS values for lp3-1 and dhn-2 (fig. 2).


Figure 2
View larger version (20K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
FIG. 2.— Distribution of observed FST values for each gene, for SNPs within genes, and for nuSSRs, as a function of their mean heterozygosities (HS), across the 10 populations that constitute the sequencing panel (FST for simulations of 0.138). The 5% and 95% quantiles of the simulated neutral envelopes are represented by continuous and dotted lines (corresponding to a 2-tailed global test at 10%), respectively for the Fdist2 and FstSNP methods. Black lines represent the boundaries of envelopes for the first set of genes (with an average of 14 gametes for SNPs and 9 gametes for genes excluding missing data), and gray lines those for the second set of genes (with 6 gametes for both SNPs and genes on average). Outlier genes detected with Fdist2 are in bold, and outlier SNPs detected with FstSNP are indicated by arrows.

 
At the haplotype level, the 2 dehydrin genes dhn1 and dhn-2, presented, out of the 11 genes analyzed, lower FST estimates than the 5% lower bound (fig. 2 and table 7). When computing q values analogs to P values, as if in a multiple testing framework controlling for the FDR while randomly repeating the same experiment, dhn-1 was still significant at 9%. Using BayesFst, these negative outlier patterns were supported along with that of lp3-1 at 5% based on the posterior distributions of locus-effect parameters {alpha}i, and using either raw data or grouping close haplotypes (respectively "H" or "HG" analyses in columns 5 and 8 of table 7, see also {alpha}i posterior distributions for detected outliers in supplementary fig. S6 [Supplementary Material online]). Three additional outliers were also identified with higher differentiation values than expected either with the H or HG analyses: PR-AGP4 and erd3 at 4% and GRP3 at 9% (table 7, positive values of {alpha}i).


View this table:
[in this window]
[in a new window]

 
Table 7 Outliers Detected at the Haplotype Level across 10 Contrasted Populations, Using the Fdist2, FstSNP, or BayesFst Methods

 
Envelopes of the FST distribution were also simulated specifically for SNPs with the FstSNP method. For HS values ranging from 0.1 to 0.4 (maximum HS value of 0.5 for biallelic markers), wider neutral expected areas were produced (due essentially to higher upper bounds) than those defined with the Fdist2 method (fig. 2). The observed variance of simulated FST values is thus larger toward lower heterozygosity corresponding to alleles closer to fixation. Generally, across both FstSNP and BayesFst, outlier patterns at the SNP level were consistent with those observed at the haplotype level (PR-AGP4, erd3, dhn-1, and dhn-2, tables 7 and 8), but distinct outlier patterns or stronger outliers were also found at others (CCoAOMT and GRP3). This result would be expected in a situation where many SNPs are not in complete LD, some being independent even at short distances within genes (supplementary fig. S3A and B, Supplementary Material online). Out of 94 nonredundant SNPs with MAF above 5%, 15 were found as outliers using significance levels of 10%, but only 2 positive outliers were detected by both FstSNP and BayesFst methods, and none of the 8 outliers detected with FstSNP remained significant after correction for multiple testing using q values (table 7). It should be noted also that with the BayesFst method, only positive outlier SNPs were detected but none with negative {alpha}i estimates (even at 10%), comparing with the FstSNP method for which more outliers identified fell below the simulated bounds. For PR-AGP4, about half of the SNPs (12 out of 20) were detected as positive outliers (FST values higher than expected P values ranging from 4% to 9%), consolidating the pattern observed at the haplotype level. Interestingly, most were located in the first large intron or in the 3' UTR region (fig. 3), whereas the one nonsynonymous mutation was not detected as a significant outlier. Among those 12 outlier SNPs, all except SNP S525 were correlated (r2 varying from 0.5 to 0.9), although not in complete LD. For GRP3, which was an outlier at the gene level at 9% with BayesFst, only one synonymous SNP (out of 6) was a positive outlier with BayesFst (S164, table 8), whereas the nonsynonymous mutation with an FST of around 30% (S110, table 8) was not significant with either method (FstSNP or BayesFst). In contrast, the one nonsynonymous SNP for erd3 (S42, table 8) was the only outlier (out of 4). For negative outlier SNPs, based on P values, the dhn1 and dhn-2 genes detected at the haplotype level also had a few SNPs having FST values lower than expected with the FstSNP method (table 8), one nonsynonymous SNP from dhn-1 being the strongest (S725, table 8). Although not detected as an outlier at the haplotype level, CCoAOMT presented 2 strong negative SNP outliers, one nonsynonymous (I832, table 8), which were in LD (r2 = 0.8). No correlation was observed among SNPs from different genes. For all detected outliers with BayesFst, whether at the gene or SNP levels, the standard deviations of the {alpha}i posterior distributions were all reduced and comprised between 0.5 and 0.7, comparing with the prior distribution values of one.


View this table:
[in this window]
[in a new window]

 
Table 8 Outliers Detected at the SNP Level for SNPs Having a Heterozygosity Greater Than 0.1 across 10 Contrasted Populations, Using the Fdist2, FstSNP, or BayesFst Methods

 

Figure 3
View larger version (19K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
FIG. 3.— Schematic representation of PR-AGP4 haplotype structure (singletons excluded) and occurrence of each principal haplotype in the different gene pools (C: Corsica, F: continental France, S: northwestern and central Spain, O: Oria, M: Tamrabta, T: Tabarka). Each base is represented by a different color A (black), T (dark gray), C (light gray), G (white), deletion (-). Exons are represented by boxes and noncoding regions (5' UTR, 3' UTR, and intronic regions) by a line. SNP positions detected as outliers at 10 % with FstSNP or BayesFst methods are indicated by arrows. A nonsynonymous SNP not detected as an outlier but showing a FST value of 0.38 among the 6 gene pools is indicated in brackets.

 
Contribution of Populations to Outlier Patterns
Another interest of the Bayesian method used here was the possibility to account for heterogeneity in migration rates by estimating and testing population effects (βj) to determine if they were significantly different from prior mean values. Significant population effects would be indicative of distinct migration rates from other populations, different effective population sizes, or particular mating patterns. Using data from the 10 populations separately, βj values were significantly higher than expected (at 5%) for Tamrabta (Morocco), Tabarka (Tunisia), and Pleucadec (northern population in France), whereas they were significantly lower and negative for Oria and Coca (Spain) (fig. 4). These results illustrate the overall contribution of populations to outlier patterns, with the Moroccan and Tunisian populations generally showing the strongest differentiation from the others and from each other at nuSSRs (table 3), and across most genes (supplementary table S6, Supplementary Material online). These contributions were observed especially for PR-AGP4, erd3, and GRP3, which exhibit the strongest pairwise FST values with other populations (supplementary table S6A, B, and D, Supplementary Material online), with values often above 0.4. It is remarkable than even for negative gene outliers with FST values close to zero overall, Pleucadec and Tabarka populations can show substantial pairwise differentiation with other populations (e.g., between Tabarka and other populations for dhn-1 [supplementary table S6H, Supplementary Material online]; between Pleucadec and 4 other populations for pp2c [supplementary table S6G, Supplementary Material online]; and between Tamrabta and Arenas, Pleucadec and Arenas or Tabarka for dhn-2 [supplementary table S6E, Supplementary Material online]). The variation in βj parameter estimates also confirmed the contrasted patterns of immigration rates in the different populations, indicating that Tabarka, Tamrabta, and Pleucadec might have been more isolated, a result consistent with their geographic locations. Conversely, the Coca and Oria Spanish populations had negative and significant βj values, which illustrate their contribution to the overall negative outlier patterns (fig. 4). This could be due either to higher immigration rates from other populations or to their having larger effective population sizes. It is worth noting that for PR-AGP4, erd3, and GRP3, which are positive gene outliers, Oria showed pairwise FST values close to zero with either Tamrabta or Tabarka, which are also among the positive outliers for the population effect (supplementary tables S6A, B, and D, Supplementary Material online). This could thus be consistent with more exchanges (past or present) from these populations and Oria, despite their higher isolation to other populations.


Figure 4
View larger version (6K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
FIG. 4.— Mode values of the posterior distributions of βj for each population along a latitudinal gradient. Significance levels for outlier populations are indicated in brackets. The dotted line represents the value of the prior mean βj distribution set at –2 in the BayesFst method.

 

    Discussion
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Perspectives and Conclusion
 Supplementary Material
 Acknowledgements
 References
 
Outlier Patterns Detected in Drought-Response Candidate Genes
This work reports nucleotide diversity and differentiation estimates for 11 polymorphic candidate genes for drought stress tolerance in contrasted, rangewide P. pinaster population samples. Using a multilocus scan of differentiation based on haploid sequence data, we compared 3 different methods that aimed to detect outliers from simulated neutral expectations: 1) the Beaumont and Nichols (1996)Go method (referred to as Fdist2 method), 2) an extension of this approach that we specifically developed for biallelic SNPs (the FstSNP method), and 3) a Bayesian method (Beaumont and Balding 2004Go), referred to as the BayesFst method. Outliers were identified at the haplotype level for 5 genes using a 5% threshold, which were robust across methods for 2 of them (dhn1 and dhn2). Two genes presented a higher differentiation (FST values) than expected (PR-AGP4 and erd3), suggesting that they could have been affected by the action of diversifying selection among homogeneous gene pools and populations. In contrast, 3 genes presented a lower genetic differentiation than expected (dhn-1, dhn2, and lp3-1), which could represent signatures of homogenizing selection among populations and/or balancing selection within populations. Outlier patterns at the SNP level were consistent with those observed at the haplotype level for the 5 outliers identified, but others like CCoAOMT presented distinct patterns. Even if several nonsynonymous sites (for erd3, dhn1, or CcoAOMT) might be of potential interest by themselves, particular combinations of SNPs might better represent candidate genes functional importance. Overall, we detected a higher proportion of outliers at the haplotype level (45% i.e., 5 out of 11) compared with that at the SNP level (6% and 4% for positive and negative outliers, respectively, and 11% in total out of 94 informative SNPs at P < 5% combining results from the FstSNP and the BayesFst methods). This points out that particular combinations of sites in haplotypes can have a greater functional significance than single sites, but it is not possible to conclude on which test, either haplotype-based or SNP-based, is more powerful for outlier detection. The power of haplotype-based method could, for example, depends on the structure of LD existing among the SNPs. In addition, nonsynonymous mutations may not alone harbor higher functional importance than those in nontranslated regions because other SNPs located in promoter or intronic regions can also be affected by selection (Shabalina et al. 2001Go). In this study, the proportion of outlier SNPs among those that were nonsynonymous and those that were silent was similar (22% and 15%, respectively).

The proportion of outliers reported here at the haplotype level was greater than those reported in previous published surveys using FST-based outlier detection methods for studying adaptive divergence, in which no more than 6% of outliers were detected (Wilding et al. 2001Go; Campbell and Bernatchez 2004Go; Acheré et al. 2005Go; Bonin et al. 2006Go). The main difference with our study is that the cited surveys focused on anonymous markers (mostly Amplified fragment length Polymorphisms [AFLPs]), instead of candidate genes. In recent genome scans involving gene-associated markers, the frequency of outliers identified was indeed higher than at anonymous markers: 12% of 78 expressed sequence tag-based microsatellites in Vasemägi et al. (2005)Go and 21% of SNPs from expressional candidate genes for abiotic stress in Scotti-Saintagne et al. (2004)Go, compared with 9% of outliers identified on average for other marker types (genomic nuSSRs and AFLPs). Focusing directly on strong functional and expressional candidate genes in our study might have increased the likelihood of detecting outliers, providing further empirical support for the candidate gene approach.

Although the observation of both positive and negative gene outlier patterns seems more consistent with locus-specific effects due to different types of selection, rather than with demographic effects which would have affected the genome in a more homogenous manner, it is necessary to examine how these demographic factors have been considered in the different models used. Both population sizes and migration rates are accounted for in the metapopulation model underlying BayesFst, and both Fdist2 and FstSNP are conservative in the sense that they define rather large "neutral envelopes" likely to encompass much of the variation associated with demographic and sampling effects. Moreover, even for low Hs values, the lower expected distribution with Fdist2 was shown to be very robust to a mixed colonization model with different numbers of founders (Beaumont and Nichols 1996Go). For the 3 negative outliers identified in this study (dhn-1, dhn-2, and lp3-1), no clinal variation was observed overall and no differentiation was detected between populations pairs, except between Tamrabta/Pleucadec with Arenas and Pleucadec with Tabarka (supplementary tables S6E, F, and H, Supplementary Material online) for dhn-2 due to differences in particular haplotypes, between Tabarka and other populations for dhn-1 (but with much lower values [around 0.2] than for positive outliers) due to one missing haplotype in Tabarka, and between Oria and several populations for lp3-1. Otherwise, we observe the maintenance of most haplotypes for these 3 genes in all populations, even for those separated by large geographic distances (cf. most pairwise values below zero in supplementary tables S6 [Supplementary Material online], in contrast with much stronger pairwise FST values at nuSSRs [table 3]). Thus, from the general assumption that the effects at these 3 genes could be at least partly due to homogenizing selection, a few mechanisms can be invoked. First, balancing selection at the molecular level favoring the maintenance of alleles at intermediate frequencies could result from the strong heterogeneity of environmental conditions across years with recurrent events of more severe drought periods. Balanced polymorphisms at genes involved in adaptive traits may thus be a consequence of this large temporal heterogeneity that would alternatively favor one haplotype over another, one being more advantageous during events of severe drought for example. Such interannual variability in climate was proposed by Jump et al. (2006)Go to interpret the patterns of balanced polymorphisms at a putative locus linked to temperature effects in Fagus Sylvatica. This heterogeneity may also be combined with spatial heterogeneity (various levels or trophic and edaphic conditions) leading to disruptive selection within stands and increasing allelic diversity at genes involved in the trait variation (Rueffler et al. 2006Go). One way to test among alternative hypotheses would be to collect genotypic data and estimate FIS coefficients to see whether heterozygotes are or are not in excess in natural populations for these particular genes. It will be interesting also in future association studies to see whether effects on phenotypic variation are observed for specific alleles, particular combinations of alleles at one locus (in heterozygous state) or at several loci interacting and involved in the adaptive traits targeted.

Factors Affecting the Efficiency of Detecting Selection in the Different Methods Tested
All methods used were based on coalescent simulations, but each presents its own set of hypotheses and limitations and raises a number of important issues related to robustness to low sample sizes and departures from demographic and mutation model assumptions, to their power to detect a particular type of selection, and to the levels of experimental differentiation considered as neutral.

Beaumont and Nichols (1996)Go showed, by simulation, that the joint distributions of FST and heterozygosity (HS) were robust to variation in sample size when above 50 haploid individuals. However, even if FST distributions were broader for sample sizes below 20, they were still informative. In our study, the drop in sample sizes from 9 to 6 gametes in the Fdist2 method (corresponding to the 2 sets of genes studied) or from 14 to 6 gametes in the FstSNP method, affected the 90% upper boundary of the neutral simulated envelope by around 5 units of FST values (in %) on average for HS lower than 0.5 and a bit less for higher HS (fig. 2). Although this did not greatly affect our results (only erd3 might have been a stronger outlier with bigger sample size at the haplotype level with Fdist2), we can envisage that for genomic scans of larger regions or loci, small sample sizes could miss outliers; therefore, we recommend a minimum of 20 haploids for such studies.

Our analysis also shows that the effect of the mutation model in coalescent simulations was important, and not entirely accounted for by the Fdist2 method, justifying the use of simulations that explicitly model biallelic SNPs (FstSNP method). For the range of HS values observed at biallelic loci, the simulated neutral envelope was far broader (especially the upper bound for HS below 0.25) than when using an IAM or a SMM (fig. 2). This is consistent with the Beaumont and Nichols (1996)Go test of sensitivity to mutation rates, which already showed that for lower mutation rates, and HS values below 0.2, upper boundaries were inflated, but not as much as what we observed using the FstSNP simulations. Thus, a larger variance of FST values for biallelic markers with low diversity HS can result in reduced power for detecting positive outliers. Comparing positive SNP outliers detected with either BayesFst or FstSNP methods, P values lower than 5% were observed in more than half the cases with BayesFst, whereas only 2 outliers were significant at 10% with FstSNP (tables 7 and 8). This highlights the reasonable discrimination power of the Bayesian method, even for low sample sizes, for detecting directionally selected loci, as noted in Beaumont and Balding (2004)Go simulations. However, the same authors also showed, with a mean FST of 0.1, the low power of their method for detecting negative outliers in the biallelic case, except for the stronger selection coefficients tested (~10%). In our study with a mean FST of around 0.14, none of the 4 negative SNP outliers detected with FstSNP were also identified with BayesFst. At the haplotype level, however, the Bayesian method was shown to give more precise FST estimates as the number of alleles increased from 2 to 8 (Balding 2003Go), which might lead to greater discriminating power in our case for the haplotype or gene level versus the SNP level. This is illustrated in the example data set used by Beaumont and Balding (2004)Go, which led to identifying 5 multiallelic loci under balancing selection for a mean FST of 0.17 that were not detected by the Fdist2 method. In our study for genes with a number of haplotypes varying from 8 to 25 (table 6), 2 more positive outliers were detected with BayesFst that otherwise fell within the 10% boundaries of the Fdist2 envelope.

Another critical issue is that of multiple hypotheses testing, considering that in classical frequentist methods, a separate test is performed at each locus, so the number of false positives is bound to increase with the number of SNPs tested. Referring to the 2 frequentist methods Fdist2 and FstSNP, the proportion of outliers at the haplotype level drops to 9% with a type I error rate of 5% (18% with a risk of 10%), whereas at the SNP level, this proportion falls just above this risk. No test remained significant using FDR q values, except for dhn-1 at 9%. However, because SNPs included in our data set have not been chosen randomly but have been identified from carefully selected expressional and functional candidate genes, it seems more adequate to assess the results on a per gene basis. Doing that for candidate genes for which outlier SNPs were identified (PR-AGP4, erd3, CCoAOMT, and dhn1), the proportions of outlier SNPs are above the type I error rates (15%, 25%, 14%, and 10%, respectively, at a 5% error rate). Among all the SNPs detected as outliers, only those within PR-AGP4 were correlated with various strengths except one, as well as the 2 outliers for CcoAOMT (r2 ~ 0.08), which explains their correlated test outputs. However, no correlation was observed among the outlier SNPs from the different genes.

With respect to the Fdist2 and FstSNP methods, an advantage of the Bayesian analysis is that, through the a priori distributions, it avoids the classical problem of multiple testing encountered in frequentist-based methods (Shoemaker et al. 1999Go; O'Hagan and Luce 2003Go). Considering this difference in interpretation, and also that demography in BayesFst is accounted for in a more flexible way than in Fdist2, outliers identified in common with both methods reinforce the likelihood that these genes might have been affected by natural selection and that the multiple testing correction attempted in our case might have reduced statistical power too strongly.

Finally, the assumption of a constant FST due to homogeneous migration rates among populations might be the strongest assumption in the Beaumont and Nichols (1996)Go model because it was shown to be robust to a range of departure from the assumption of migrations-drift equilibrium. In reality, migration rates between natural populations of maritime pine are probably heterogeneous, considering the pairwise FST matrix between populations (table 3). Also, it is likely that populations isolated by geographic barriers such as Oria (González-Martínez et al. 2007Go) could have different or lower effective population sizes. The Bayesian approach is thus an interesting alternative because it accounts for heterogeneity in population size or migration rates and allows testing of population-specific effects in the regression step. This might explain differences observed in P values for positive SNP outliers detected with either BayesFst or FstSNP.

It is therefore difficult to make recommendations as to which method is more appropriate and efficient for detecting selection based on genetic differentiation, in particular because we do not know which are the "true" loci under selection. We do have, however, a preference for the Bayesian method of Beaumont and Balding (2004)Go due to its efficiency in our case despite small sample sizes, the flexibility of assumptions concerning demographic parameters and population effects, which might account to nonequilibrium situations, and the fact that it deals effectively with the problem of multiple testing. Given the probability meaning of P values, they should not be set at too low levels to avoid being overly conservative, especially in cases of small sample sizes. Its main limitation, illustrated in our case, might be its lack of power to detect negative outliers for biallelic SNPs, which calls for either much larger samples or the application of another method for comparison. The FstSNP method appears to be an interesting alternative accounting for SNPs frequency distribution and typically large FST variances.

Functional Interpretation of Candidate Genes Showing Outlier Patterns
Seeking the causes for outlier behavior will ultimately lead to functional genomics studies in your species of interest (Storz 2005Go), but information can also be obtained from homologous genes in other species. The question is whether the interpretation that we proposed in terms of past or still current selection pressures for the 5 outliers identified can be linked to their putative implications in drought responses. We try below to summarize available knowledge on these genes, acknowledging the fact that genomic studies of drought stress in plants reveal large numbers of potential candidates that ultimately constitute complex regulatory networks (Seki et al. 2001Go, 2007Go; Ramanjalu and Bartels 2002Go; Street et al. 2006Go).

Concerning the dehydrin genes potentially submitted to homogenizing selection (dhn-1 and dhn-2), which showed both an absence of lineage differentiation among populations, a high nucleotide diversity, and numerous nonsynonymous mutations for dhn-1, a large body of results over the past 10 years have shown their strong link with drought stress response providing also insights into their functional role (Dure 1993Go; Ingram and Bartels 1996Go; Bray 1997Go; Ramanjalu and Bartels 2002Go): dhn-1 and dhn2 both encode dehydrin proteins belonging to the late embryogenesis abundant (LEA) gene family, known to allow plant protective reactions against dehydration. Their highly hydrophilic nature plays a role in the maintenance of protein or membrane structure, sequestration of ions, binding of water, and thus helps to maintain the minimum cellular water requirements. In conifers, increased transcript accumulation for the Picea glauca dhn-1 homolog was observed in needle (and possibly roots) tissues after applying stress for 48 h (Richard et al. 2000Go), and in P. taeda, a dhn-2 homolog (based on BlastN searches) was overexpressed in needles under mild drought stress (Watkinson et al. 2003Go). Evidence of a Populus dhn-1 homolog overexpression under different abiotic stresses, including drought, has also been shown (Caruso et al. 2002Go). Less is known about the role of lp3-1, belonging to a gene family called ASR (ABA/water stress/ripening-induced) first described in tomato (Thompson and Corlett 1995Go), but this gene was shown to be overexpressed in stems under progressively applied water stress, in order to be closer to field conditions (Padmanabhan et al. 1997Go). This gene also showed a significant genetic association with an increase in proportion of latewood in P. taeda, although just caused by a few extreme clones (González-Martínez, Wheeler, et al. 2007Go). The consistent overexpression of these 3 genes in various experiments of drought stress or different species might be compatible with a homogenizing mode of selection action that could be linked to more or less frequent recurrent events of drought experienced in maritime pine natural populations.

The erd3 gene had the lowest diversity among all genes studied and a very high FST value for one nonsynonymous SNP that was fixed for one allele in half the populations. This pattern can be compared with what was observed by González-Martínez et al. (2006)Go in a large unstructured population of P. taeda, with the erd3 gene homolog showing also low diversity and a few nonsynonymous mutations, with an excess of rare mutations. Alternative explanations proposed in P. taeda were a selective sweep or a signature of past population expansion. In P. pinaster, there is no simple latitudinal gradient at the nonsynonymous outlier SNP or one that would simply be consistent with a higher level of drought stress on the basis of climatic variables: the same allele that is fixed in Arenas receiving a high level of rainfall is also fixed in Tabarka having much lower rainfall. No additional information was found in the literature on this gene's putative function. It was originally annotated in P. taeda on a cDNA clone from roots recovering from drought, by similarity with dehydration-responsive genes identified in Oryza sativa, and could be a methyltransferase with S-adenosyl Methionin (SAM)-binding domain (and led to the annotation of the erd3 complete coding sequence in Arabidopsis thaliana (TAIR:AT4G19120 in http://www.arabidopsis.org/).

For the other positive outlier PR-AGP4, most outlier SNPs are located in noncoding regions (intron and 3' UTR). This gene also presents a high diversity and strong haplotype structure across populations with 4 less frequent haplotypes being population private and many sites belonging to small blocks of LD (fig. 3, supplementary fig. S3 [Supplementary Material online]). If we consider separately populations from the western range (France, Spain, and Morocco) and from the eastern range (Corsica and Tunisia), SNP allelic frequencies (except S525) follow a latitudinal gradient from South to North. The latitude itself is linked to a decrease in temperature (correlations of –0.67, Pr < 5% with the annual mean rainfall ANMT, and of –0.71 with the mean temperature of the wettest month, MTWM, in supplementary table S7 [Supplementary Material online]) and to an increase in RDRYM (correlation of 0.5, not significant at 5%, in supplementary table S7 [Supplementary Material online]). The implication of this gene in drought stress response in maritime pine would be consistent with differential expression studies in both P. pinaster and P. taeda. Using osmotic stress in P. Pinaster, Dubos et al. (2003)Go revealed a strong downregulation of this gene in roots (greater than that observed in needles). In P. taeda, the homolog ptaAGP4 was identified as strongly expressed in differentiating xylem of bent trees compared with other tissues (Zhang et al. 2000Go). Moreover, from an expression study in the same species (Yang et al. 2005Go), ptaAGP4 was shown to have higher expression in one ecotype originating from a region with very low precipitation (Lost Pines in TX) compared with the ecotype from South Louisiana with wetter conditions. Interestingly, in controlled drought stress conditions, the gene was strongly repressed for both ecotypes (Yang et al. 2005Go), consistent with what has been observed with osmotic stress in French Atlantic (humid) P. pinaster populations. Although the functional role of this gene is not yet clear, drought stress has been shown to cause alterations in the chemical composition and physiological properties of the cell wall (e.g., wall extensibility) (Ingram and Bartels 1996Go). We may therefore assume a potential role for this gene in cell differentiation/elongation and growth, that would be repressed in hydroponic stress treatments, and could have evolved differentially in populations submitted to different environmental conditions. For both erd3 and PR-AGP4 candidate genes, functional genomics studies will be needed to unravel their mode of action. It might also be of interest to extend the sequencing to 5' UTR regulating regions that might contain functionally important sites for the gene expression and might have affected the diversity patterns observed. Indeed, polymorphisms localized in promoter region have been shown to be directly involved in trait variation (e.g., McGuire et al. 1994Go for disease traits in humans; Andersen et al. 2005 for the gene Dwarf8 in maize) or to have been the target of positive selection that resulted in a change of regulation (e.g., the tb1 domestication gene in maize, Wang et al. 1999Go).

Insights into Local Adaptation of Populations from Contrasted Environmental Conditions
The observation of 2 positive outliers (PR-AGP4 and erd3) among the 11 genes analyzed is consistent with the proposed interpretation that local adaptation, already demonstrated at the phenotypic level for traits like growth or water use efficiency (Alía et al. 1995Go, 1997Go; Garnier-Géré et al. unpublished results), could have led to divergent selection at genes potentially involved in adaptive traits, despite high gene flow. Local adaptation can result from many different environmental pressures (e.g., high temperature, soil fertility, and heterogeneity of rainfall), which might interact across seasons and lead not only to different responses in each population, but also to different genes being more or less strongly affected by selection for a similar response. The overall contribution of particular populations to positive outlier patterns was further illustrated by the variation in the βj population-specific parameter estimates from the Bayesian analysis (fig. 4). The Tunisian and Moroccan populations (both presenting the highest annual mean temperature, around 17°) have a preponderant role in these patterns, as well as a French northern population (Pleucadec). These populations could either be more isolated, thus experiencing less gene flow from others, or have smaller effective population sizes (Ne). Moroccan stands have been shown to exhibit low diversity using chloroplast microsatellites and allozymes (Wahid et al. 2004Go; Bucci et al. 2007Go) due to the typically small and fragmented populations sampled which are severely threatened by overexploitation and lack of natural regeneration (Wahid et al. 2004Go) and thus might be of relatively small Ne. However, the population studied here (Tamrabta) covers hundreds of hectares (ha) with over 200 trees per ha. Similarly, the Tabarka and Pleucadec populations can be considered as having reasonable population sizes due to large surrounding stands. Thus, the combination of low immigration from populations occurring in different environments, and relatively strong selection pressures at the extreme of the range are necessary to explain their higher differentiation with other populations. This could have generated a favorable situation for adaptive divergence to occur efficiently at particular genes such as PR-AGP4, erd3, and GRP3. Beaumont and Balding (2004)Go showed through simulations that appreciable discrimination of positively selected loci was obtained when the selection rate was 5-fold higher than the migration rate, in a situation where FST was lower than in our case, so this might constitute a conservative estimate for selection coefficients at positive outliers detected. Selection coefficients in the range of 0.01 to 0.04 would be sufficient to allow the detection of outliers with the methods employed here in case of genes with relatively high diversity, on the basis of simulations in a comparable situation by Hoffman et al. (2006Go). In contrast for Hs values below 0.4 and for an upper confidence boundary reaching 0.4 for FST, we may assume that selection effects must have been fairly strong to produce outliers, in particular for PR-AGP4 S525 and erd3 S42. For other genes with weaker selection coefficients, we could hypothesize that gene flow could have interacted to mask particular effects of local adaptation at the molecular level.

Conversely, Coca and Oria (in central and southern spain, respectively) behaved as negative population outliers from their βj parameter values (fig. 4), indicating that they would either experience more gene flow from other populations or have a larger population size. For Oria, this might be consistent with the fact that it would belong to former glacial refugial areas in southeastern Spain (suggested initially by Salvador et al. [2000Go]). Substantial Chloroplast simple-sequence repeat variation has been found in this population (12 haplotypes and genetic diversity values of 0.96; our unpublished results) despite being relatively well isolated from nearby populations (González-Martínez, et al. 2007Go). Coca negative population outlier pattern may originate from admixture between surrounding gene pools in this population of large effective population size.


    Perspectives and Conclusion
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Perspectives and Conclusion
 Supplementary Material
 Acknowledgements
 References
 
Although confirmatory evidence may come from functional studies, other sources of evidence can strengthen assumptions that selection might have influenced or caused the outlier patterns observed (Storz 2005Go; Vasemägi and Primmer 2005Go). Additional evidence of clinal variation and possible links to the climatic conditions in which the populations have evolved for PR-AGP4 has been mentioned above, but it might not be easy to disentangle which environmentally mediated factors had the strongest effect on putative candidate genes or to discern whether these effects are confounded with demographic effects. Studying differentiation patterns at a finer spatial scale along an environmental gradient could avoid historical effects, but one needs also to carefully evaluate alternative neutral scenarios. More insight from the respective roles of demography and selection can come from tests performed within larger and unstructured populations, which might be more powerful than FST-based tests to detect recent selective events (Fay and Wu 2000Go; Biswas and Akey 2006).

Complex patterns of selection could also lead to intermediate levels of FST and go undetected as values would fall within the neutral expected envelopes. Indeed, it has been shown theoretically that in species with high gene flow, selection for adaptive divergence at polygenic traits might be more associated with covariances of allele frequencies across populations than with high FST at individual loci coding for a trait (Latta 2003Go; Le Corre and Kremer 2003Go). This would call for a shift toward multilocus methods to identify selection at the molecular level, which might be envisaged in the near future with sequence data from larger samples of genes and populations in nonmodel species.

Despite their robustness to a large range of demographic scenarios (Beaumont 2005; Nielsen 2005Go), the FST parameters used here are estimated from allele frequencies considered as equally distant (unordered alleles), following Weir and Cockerham (1984)Go. With the NST estimate of population differentiation on sequence data, information on the genealogical distances among haplotypes is integrated and can enable assessment of the phylogeographical structure. The NST estimate might be also more appropriate to capture Wright's original definition of FST as an inbreeding coefficient (the probability of alleles that are identical-by-descent being combined in zygotes) and thus provide a more accurate estimation of Nm despite a larger variance (Neigel 2002Go). Among the genes studied, GRP3 showed a significantly higher NST than FST value, which could be due to a strong signal of phylogeographic structure. The geographic distribution of haplotype frequencies in the different populations at this locus illustrates fairly well the pattern detected and is comparable with similar distributions observed previously for both mitochondrial DNA and chloroplastDNA markers (Burban and Petit 2003Go; Bucci et al. 2007Go; see also supplementary fig. S4 [Supplementary Material online]). How this type of pattern may be affected by selection is an open question and suggests an interest in developing methods based on NST for searching outliers when sequence data on candidate genes are available.

This exploratory study provided criteria from FST-based methods that allowed candidate genes potentially affected by selection to be identified. One way to validate these predicted functional roles is to perform association studies between their allelic variation and drought stress response in maritime pine natural or breeding populations. All markers, whether genes, single sites, or haplotypes, will have to be tested for association with phenotypic variation as it is not clear what will their respective impact be on the power to detect true associations (Buntjer et al. 2005Go). We plan to test associations between relevant SNPs and several traits related to water use efficiency (e.g., carbon isotope discrimination, growth, and biomass) in a common garden experiment. Comparing differentiation between populations at the phenotypic level (QST) and candidate gene level (FST) or mapping these genes in maritime pine pedigrees to look for colocalization with targeted traits' QTLs, would also provide useful evidence for their putative involvement in drought tolerance trait variation (Brendel et al. 2002Go). Recently published results in conifers (Brown et al. 2004Go; Neale and Savolainen 2004Go; Howe and Brunner 2005Go; González-Martínez, Wheeler, et al. 2007Go) with similar reproductive and ecological characteristics (extensive natural populations, little confounding substructure), genome structure similarities (Chagné et al. 2004), comparable levels of nucleotide diversity, and rapid decay of LD across genes, suggest the potential efficiency of candidate gene association studies in this group of species. This study also illustrates that conifer species (despite their large genome sizes) can be good models for studying the molecular basis of adaptive divergence in natural populations.


    Supplementary Material
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Perspectives and Conclusion
 Supplementary Material
 Acknowledgements
 References
 
Supplementary tables S1–S7 and figures S1–S6 are available at Molecular Biology Evolution online (http://www.mbe.oxfordjournals.org/).


    Acknowledgements
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Perspectives and Conclusion
 Supplementary Material
 Acknowledgements
 References
 
We thank David Balding for providing the "BayesFst" program and further assistance on its application to our data, María Luisa Lafuente and Raúl Fernández for technical assistance, Ricardo Alia and Jean Brach for help with populations sampling and discussions on material phenotypic variation, Christian Burban for help in acquisition and interpreting climatic data on southern populations, 2 anonymous reviewers and Josquin Tibbits for his helpful comments and discussion on the manuscript. This work and E. Eveno scholarship were funded by the European TREESNIPs project no. 836501. The work of S. C. González-Martínez was supported by a ‘Ramón y Cajal’ fellowship RC02-2941. Part of the sequence data analyzed here were obtained at the Genotyping and Sequencing facility of Bordeaux (grants from the Conseil Régional d'Aquitaine no. 20030304002FA and 20040305003FA and from the European Union, FEDER no. 2003227).


    Footnotes
 
Marcy Uyenoyama, Associate Editor


    References
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Perspectives and Conclusion
 Supplementary Material
 Acknowledgements
 References
 

    Acheré V, Favre JM, Besnard G, Jeandroz S. Genomic organization of molecular differentiation in Norway spruce (Picea abies). Mol Ecol (2005) 14:3191–3201.[CrossRef][Medline]

    Aitken SN, Hannerz M. Genecology and gene resource management strategies for conifer cold hardiness. In: Conifer cold hardiness—Bigras FJ, Columbo SJ, eds. (2001) Dordrecht (The Netherlands): Kluwer Academic Publishers. 23–53.

    Alía R, Gil L, Pardos J. Performance of 43 Pinus pinaster Ait. provenances on 5 locations in central Spain. Silvae Genet (1995) 44:75–81.

    Alía R, Moro J, Denis JB. Performance of Pinus pinaster Ait. provenances in Spain: interpretation of the genotype-environment interaction. Can J For Res (1997) 27:1548–1559.[CrossRef]

    Andersen JR, Schrag T, Melchinger AE, Zein I, Lubberstedt T. Validation of Dwarf8 polymorphisms associated with flowering time in elite European inbred lines of maize (Zea mays L.). Theor Appl Genet (2005) 111:206–217.[CrossRef][Web of Science][Medline]

    Andofalto P. Adaptive hitchhiking effects on genome variability. Curr Opin Genet Dev (2001) 11:635–641.[CrossRef][Web of Science][Medline]

    Baines JF, Das A, Mousset S, Stephan W. The role of natural selection in genetic differentiation of worldwide populations of Drosophila ananassae. Genetics (2004) 168:1987–1998.[Abstract/Free Full Text]

    Balding DJ. Likelihood-based inference for genetic correlation coefficients. Theor Popul Biol (2003) 63:221–230.[CrossRef][Web of Science][Medline]

    Barton NH, Keightley PD. Understanding quantitative genetic variation. Nat Rev Genet (2002) 3:11–20.[Web of Science][Medline]

    Beaumont MA. Adaptaion and speciation: What can FST tell us? Trends Ecol. Evol. (2005) 20:435–440.

    Beaumont MA, Balding DJ. Identifying adaptive genetic divergence among populations from genome scans. Mol Ecol (2004) 13:969–980.[CrossRef][Medline]

    Beaumont MA, Nichols RA. Evaluating loci for use in the genetic analysis of population structure. Proc R Soc Lond. B (1996) 263:1619–1626.[CrossRef]

    Biswas S, Akey JM. Genomic insights into positive selection. Trends in Genetics. (2006) 22:437–446.

    Bonin A, Taberlet P, Miaud C, Pompanon F. Explorative genome scan to detect candidate loci for adaptation along a gradient of altitude in the common frog (Rana temporaria). Mol Biol Evol (2006) 23:773–783.[Abstract/Free Full Text]

    Borevitz JO, Chory J. Genomic tools for QTL analysis and gene discovery. Curr Opin Plant Biol (2004) 7:132–136.[CrossRef][Web of Science][Medline]

    Bray EA. Plant responses to water deficit. Trends Plant Sci (1997) 2:48–54.[CrossRef][Web of Science]

    Brendel O, Pot D, Plomion C, Rozenberg P, Guehl JM. Genetic parameters and QTL analysis of delta C-13 and ring width in maritime pine. Plant Cell Environ (2002) 25:945–953.[CrossRef]

    Brohede J, Ellegren H. Microsatellite evolution: polarity of substitutions within repeats and neutrality of flanking sequences. Proc R Soc Lond B Biol Sci (1999) 266:825–833.[Medline]

    Brown GR, Gill GP, Kuntz RJ, Langley CH, Neale DB. Nucleotide diversity and linkage disequilibrium in loblolly pine. Proc Natl Acad Sci USA (2004) 101:15255–15260.[Abstract/Free Full Text]

    Bucci G, González-Martínez SC, Le Provost G, Plomion C, Ribeiro MM, Sebastiani F, Alía R, Vendramin GG. Range-wide phylogeography and gene zones in Pinus pinaster Ait. revealed by chloroplast microsatellite markers. Mol Ecol (2007) 16:2137–2153.[CrossRef][Medline]

    Buntjer JB, Sorensen AP, Peleman JD. Haplotype diversity: the link between statistical and biological association. Trends Plant Sci (2005) 10(10):466–471.[CrossRef][Web of Science][Medline]

    Burban C, Petit RJ. Phylogeography of maritime pine inferred with organelle markers having contrasted inheritance. Mol Ecol (2003) 12:1487–1495.[CrossRef][Medline]

    Campbell D, Bernatchez L. Generic scan using AFLP markers as a means to assess the role of directional selection in the divergence of sympatric whitefish ecotypes. Mol Biol Evol (2004) 21:945–956.[Abstract/Free Full Text]

    Caruso A, Morabito D, Delmotte F, Kahlem G, Carpin S. Dehydrin induction during drought and osmotic stress in Populus. Plant Physiol Biochem (2002) 40:1033–1042.[CrossRef][Web of Science]

    Cavalli-Sforza LL. Population structure and human evolution. Proc R Soc Lond B (1966) 164:362–379.[Medline]

    Chagné D, Chaumeil P, Ramboer A, et al, (12 co-authors). Cross-species transferability and mapping of genomic and cDNA SSRs in pines. Theor Appl Genet (2004) 109:1204–1214.[CrossRef][Web of Science][Medline]

    Chagné D, Lalanne C, Madur D, et al, (12 co-authors). A high density linkage map of Pinus pinaster based on AFLPs. Ann For Sci (2002) 59:627–636.[CrossRef]

    Chang S, Puryear JD, Dias MADL, Funkhouser EA, Newton RJ, Cairney J. Gene expression under water deficit in loblolly pine (Pinus taeda): isolation and characterization of cDNA clones. Physiol Plant (1996) 97:139–148.[CrossRef]

    Charlesworth D, Charlesworth B, McVean GA. Genome sequences and evolutionary biology, a two-way interaction. Trends Ecol Evol (2001) 16(5):235–242.[CrossRef][Medline]

    Chaumeil P. Plasticité moléculaire de deux écotypes de pin maritime soumis à un stress osmotique PhD Thesis (2006) University of Bordeaux-I, Bordeaux, France.

    Christians JK, Keightley PD. Genetic architecture: dissecting the genetic basis of phenotypic variation. Curr Biol (2002) 12:415–416.[CrossRef][Web of Science][Medline]

    Costa P, Bahrman N, Frigerio JM, Kremer A, Plomion C. Water-deficit-responsive proteins in maritime pine. Plant Mol Biol (1998) 38:587–596.[CrossRef][Web of Science][Medline]

    Cruz RT, Jordan WR, Drew MC. Structural changes and associated reduction of hydraulic conductance in roots of Sorghum bicolour L. following exposure to water deficit. Plant Physiol (1992) 99:203–212.[Abstract/Free Full Text]

    Derory J, Mariette S, Gonzaléz-Martínez SC, Chagné D, Madur D, Gerber S, Brach J, Persyn F, Ribeiro MM, Plomion C. What can nuclear microsatellites tell us about maritime pine genetic resources conservation and provenance certification strategies? Ann For Sci (2002) 59:699–708.[CrossRef]

    Destremau DX, Jolly H, Thari T. Contribution à la connaissance des provenances de Pinus pinaster. Ann Rech For Maroc (1976) 16:101–153.

    Dubos C, Le Provost G, Pot D, Salin F, Lalane C, Madur D, Frigerio JM, Plomion C. Identification and characterization of water-stress-responsive genes in hydroponically grown maritime pine (Pinus pinaster) seedlings. Tree Physiol (2003) 23:169–179.[Abstract]

    Dubos C, Plomion C. Identification of water-deficit responsive genes in maritime pine (Pinus pinaster Ait.) roots. Plant Mol Biol (2003) 51:249–262.[CrossRef][Web of Science][Medline]

    Dure L III. Structural motifs in Lea proteins. In: Plant responses to cellular dehydration during environmental stress—close TJ, Bray EA, eds. (1993) Rockville (MD): American Society of Plant Physiologists. 91–103.

    Ehrenreich IM, Purugganan MD. The molecular genetic basis of plant adaptation. Am J Bot (2006) 93:953–962.[Abstract/Free Full Text]

    Endler JA. Natural selection in the wild (1986) Princeton (NJ): Princeton University Press.

    Excoffier L, Laval G, Schneider S. Arlequin ver. 3.0: an integrated software package for population genetics data analysis. Evol Bioinform Online (2005) 1:47–50.[Medline]

    Farquhar GD, Ehleringer JR, Hubick KT. Carbon isotope discrimination and photosynthesis. Annu Rev Plant Physiol Plant Mol Biol (1989) 40:503–537.[CrossRef][Web of Science]

    Fay JC, Wu C-I. A human population bottleneck is not incompatible with the discordance between patterns of mitochondrial vs. nuclear DNA variation. Mol Biol Evol (1999) 16:1003–1006.[Web of Science][Medline]

    Fay JC, Wu C-I. Hitchhiking under positive Darwinian selection. Genetics (2000) 155:1405–1413.[Abstract/Free Full Text]

    Filatov DA. ProSeq: a software for preparation and evolutionary analysis of DNA sequence data sets. Mol Ecol Notes (2002) 2:621–624.[CrossRef][Web of Science]

    Flint J, Mott R. Finding the molecular basis of quantitative traits: successes and pitfalls. Nat Rev Genet (2001) 2:437–445.[CrossRef][Web of Science][Medline]

    Ford MJ. Applications of selective neutrality tests to molecular ecology. Mol Ecol (2002) 11:1245–1262.[CrossRef][Medline]

    Galtier N, Depaulis F, Barton NH. Detecting bottlenecks and selective sweeps from DNA sequence polymorphism. Genetics (2000) 155(2):981–987.[Abstract/Free Full Text]

    García-Gil MR, Mikkonen M, Savolainen O. Nucleotide diversity at two phytochrome loci along a latitudinal cline in Pinus sylvestris. Mol Ecol (2003) 12:1195–1206.[CrossRef][Medline]

    Garnier-Géré PH, Ades PK. Environmental surrogates for predicting and conserving adaptive genetic variability in tree species. Conserv Biol (2001) 15:1632–1644.[CrossRef]

    González-Martínez SC, Alía R, Gil L. Population genetic structure in a Mediterranean pine (Pinus pinaster Ait.): a comparison of allozyme markers and quantitative traits. Heredity (2002) 89:199–206.[CrossRef][Web of Science][Medline]

    González-Martínez SC, Ersoz E, Brown GR, Wheeler NC, Neale DB. DNA sequence variation and selection of tag single-nucleotide polymorphisms at candidate genes for drought-stress response in Pinus taeda L. Genetics (2006) 172:1915–1926.[Abstract/Free Full Text]

    González-Martínez SC, Gómez A, Carrión JS, Agúndez D, Alía R, Gil L. Spatial genetic structure of an explicit glacial refugium of maritime pine (Pinus pinaster Aiton) in southeastern Spain. In: Phylogeography in southern European refugia: evolutionary perspectives on the origins of European biodiversity—Weiss S, Ferrand N, eds. (2007) Kluwer Academic Publishers London. 257–269.

    González-Martínez SC, Krutovsky KV, Neale DB. Forest-tree population and genomics and adaptive evolution. New Phytol (2006) 170:227–238.[CrossRef][Web of Science][Medline]

    González-Martínez SC, Wheeler NC, Rezos E, Nelson CD, Neale DB. Association genetics in Pinus taeda L.I. wood property traits. Genetics (2007) 175:399–409.[Abstract/Free Full Text]

    Guevara MA, Chagné D, Almeida MH, et al, (16 co-authors). Isolation and characterization of nuclear microsatellite loci in Pinus pinaster Ait. Mol Ecol Notes (2005) 5:57–59.[CrossRef][Web of Science]

    Haddrill PR, Thornton KR, Charlesworth B, Andolfatto P. Multilocus pattern of nucleotide variability and the demographic and selection history of Drosophila melanogaster populations. Genome Res (2005) 15:790–799.[Abstract/Free Full Text]

    Hardy OJ, Vekemans X. SPAGeDi: a versatile computer program to analyse spatial genetic structure at the individual or population levels. Mol Ecol Notes (2002) 2:618–620.[CrossRef][Web of Science]

    Hein J, Schierup M, Wiuf C. Gene genealogies, variation and evolution: a primer in coalescent theory (2004) New York: Oxford University Press.

    Hellmann I, Ebersberger I, Ptak SE, Pääbo S, Przeworski M. A neutral explanation for the correlation of diversity with recombination rates in humans. Am J Hum Genet (2003) 72:1527–1535.[CrossRef][Web of Science][Medline]

    Heuertz M, De Paoli E, Källman T, Larsson H, Jurman I, Morgante M, Lascoux M, Gyllenstrand N. Multilocus patterns of nucleotide diversity, linkage disequilibrium and demographic history of norway spruce [Picea abies (L.) Karst]. Genetics (2006) 174:2095–2105.[Abstract/Free Full Text]

    Hoffman EA, Schueler FW, Jones AG, Blouin MS. An analysis of selection on a colour polymorphism in the northern leopard frog. Mol Ecol (2006) 15:2627–2641.[CrossRef][Medline]

    Howe GT, Brunner AM. An evolving approach for understanding plan adaptation. New Phytol (2005) 167:1–5.[CrossRef][Web of Science][Medline]

    Huey RB, Gilchrist GW, Carlson ML, Berrigan D, Serra L. Rapid evolution of a geographic cline in size in an introduced fly. Science (2000) 287:308–309.[Abstract/Free Full Text]

    Hurme P, Repo T, Savolainen O, Pääkkönen T. Climatic adaptation of bud set and frost hardiness in Scots pine (Pinus sylvestris). Can J For Res (1997) 27:716–723.[CrossRef]

    Ingram J, Bartels D. The molecular basis of dehydration tolerance in plants. Annu Rev Plant Physiol Plant Mol Biol (1996) 47:377–403.[CrossRef][Web of Science][Medline]

    Joshi J, Schmid B, Caldeira MC, et al, (18 co-authors). Local adaptation enhances performance of common plant species. Ecol Lett (2001) 4:536–544.[CrossRef][Web of Science]

    Joyce DC, Aspinall D, Edwards GR. Water deficit and the growth and anatomy of the radish fleshy axis. New Phytol (1983) 93:439–446.[CrossRef][Web of Science]

    Jump AS, Hunt JM, Martinez-Izquierdo JA, Penuelas J. Natural selection and climate change: temperature-linked spatial and temporal trends in gene frequency in Fagus sylvatica. Mol Ecol (2006) 15:3469–3480.[CrossRef][Medline]

    Kado T, Yoshimaru H, Tsumura Y, Tachida H. DNA variation in a conifer, Cryptomeria japonica (Cupressaceae sensu lato). Genetics (2003) 164:1547–1559.[Abstract/Free Full Text]

    Kawecki TJ, Ebert D. Conceptual issues in local adaptation. Ecol Lett (2004) 7:1225–1241.[CrossRef][Web of Science]

    Kittelson PM, Maron JL. Fine-scale genetically based differentiation of life-history traits in the perennial shrub lupinus arboreus. Evolution (2001) 55:2429–2438.[CrossRef][Web of Science][Medline]

    Krutovsky KV, Neale DB. Nucleotide diversity and linkage disequilibrium in cold-hardiness- and wood quality-related candidate genes in Douglas fir. Genetics (2005) 171:2029–2041.[Abstract/Free Full Text]

    Latta RG. Gene flow, adaptive population divergence and comparative population structure across loci. New Phytol (2003) 161:51–58.[Web of Science]

    Laval G, Excoffier L. SIMCOAL 2.0: a program to simulate genomic diversity over large recombining regions in a subdivided population with a complex history. Bioinformatics (2004) 20:2485–2487.[Abstract/Free Full Text]

    Le Corre V, Kremer A. Comparison of genetic variability at neutral markers, QTLs and adaptive trait in a subdivided population: effects of selection intensity, variance of local optima and migration. Genetics (2003) 164:1205–1219.[Abstract/Free Full Text]

    Li WH, Graur D. Fundamentals of molecular evolution (1991) Sunderland (MA): Sinauer Associates.

    Linhart YB, Grant MC. Evolutionary significance of local genetic differentiation in plants. Annu Rev Ecol Syst (1996) 27:237–277.[CrossRef][Web of Science]

    Luikart G, England PR, Tallmon D, Jordan S, Taberlet P. The power and promise of population genomics: from genotyping to genome typing. Nat Rev Genet (2003) 4:981–994.[Web of Science][Medline]

    Mariette S, Chagné D, Decroocq S, Giovanni G, Lalanne C, Madur D, Plomion C. Microsatellite markers for Pinus pinaster Ait. Ann For Sci (2001) 58:203–206.[CrossRef]

    Mariette S, Le Corre V, Austerlitz F, Kremer A. Sampling within the genome for measuring within population diversity: trade-offs between markers. Mol Ecol (2002) 11:1145–1156.[CrossRef][Medline]

    Masle J, Gilmore SR, Farquhar GD. The ERECTA gene regulates plant transpiration efficiency in Arabidopsis. Nature (2005) 436:866–870.[CrossRef][Medline]

    McGuire W, Hill AV, Allsopp CE, Greenwood BM, Kwiatkowski D. Variation in the TNF-alpha promoter region associated with susceptibility to cerebral malaria. Nature (1994) 371:508–511.[CrossRef][Medline]

    McKay JK, Latta RG. Adaptive population divergence: markers, QTL and traits. Trends Ecol Evol (2002) 17:285–291.[CrossRef]

    Neale DB, Savolainen O. Association genetics of complex traits in conifers. Trends Plant Sci (2004) 97:325–330.

    Nei M. Molecular Evolutionary Genetics (1987) New York: Columbia University Press.

    Nei M, Kumar S. Molecular evolution and phylogenetics (2000) New York: Oxford University Press.

    Neigel JE. Is FST obsolete? Conserv Genet (2002) 3:167–173.[CrossRef]

    Nguyen-Queyrens A, Bouchet-Lannat F. Osmotic adjustment in three-year-old seedlings of five provenances of maritime pine (Pinus pinaster) in response to drought. Tree Physiol (2003) 23:397–404.[Abstract]

    Nielsen R. Statistical tests of selective neutrality in the age of genomics. Heredity (2001) 86:641–647.[CrossRef][Web of Science][Medline]

    Nielsen R. Molecular signatures of natural selection. Annu Rev Genet (2005) 39:197–218.[CrossRef][Web of Science][Medline]

    O'Hagan A, Luce BR. A primer on Bayesian statistics in health economics and outcomes research (2003) London: Medtap International Inc.

    Padmanabhan V, Dias DMAL, Newton RJ. Expression analysis of a gene family in loblolly pine (Pinus taeda L.) induced by water deficit stress. Plant Mol Biol (1997) 35:801–807.[CrossRef][Web of Science][Medline]

    Petit RJ, Bahrman N, Baradat PH. Comparison of genetic differentiation in maritime pine (Pinus pinaster Ait.) estimated using isozyme, total protein and terpenic loci. Heredity (1995) 75:382–389.[CrossRef][Web of Science]

    Petit RJ, Duminil J, Fineschi S, Hampe A, Salvini D, Vendramin GG. Comparative organization of chloroplast, mitochondrial and nuclear diversity in plant populations. Mol Ecol (2005) 14:689–701.[CrossRef][Medline]

    Plomion C, Bahrman N, Durel CE, O'Malley DM. Genomic analysis in Pinus pinaster (Maritime pine) using RAPD and protein markers. Heredity (1995) 74:661–668.[CrossRef][Web of Science]

    Pritchard JK, Stephens M, Donnelly P. Inference of population structure using multilocus genotype data. Genetics (2000) 155:945–959.[Abstract/Free Full Text]

    Ramanjalu S, Bartels D. Drought- and dessication-induced modulation of gene expression in plants. Plant Cell Environ (2002) 25:141–151.[CrossRef][Medline]

    Rehfeldt GE, Ying CC, Spittlehouse DL, Hamilton DA Jr. Genetic responses to climate in Pinus contorta: niche breadth, climate change and reforestation. Ecol Monogr (1999) 69:375–407.[Web of Science]

    Richard S, Morency MJ, Drevet C, Jouanin L, Séguin A. Isolation and characterization of a dehydrin gene from white spruce induced upon wounding, drought and cold stresses. Plant Mol Biol (2000) 43:1–10.[CrossRef][Web of Science][Medline]

    Rodriguez PL. Protein phosphatase 2C (PP2C) function in higher plants. Plant Mol Biol (1998) 38:919–927.[CrossRef][Web of Science][Medline]

    Rozas J, Sanchez-DelBarrio JC, Messeguer X, Rozas R. DnaSP, DNA polymorphism analyses by the coalescent and other methods. Bioinformatics (2003) 19:2496–2497.[Abstract/Free Full Text]

    Rueffler C, Van Dooren TJM, Leimar O, Abrams PA. Disruptive selection then what? Trends Ecol Evol (2006) 21(5):238–245.[CrossRef][Medline]

    Sabeti PC, Schaffner SF, Fry B, Lohmueller J, Varilly P, Shamovsky O, Palma A, Mikkelsen TS, Altshuler D, Lander ES. Positive natural selection in the human lineage. Science (2006) 12:1614–1620.

    Salvador L, Alía R, Agúndez D, Gil L. Genetic variation and migration pathways of maritime pine (Pinus pinaster Ait) in the Iberian peninsula. Theor Appl Genet (2000) 100:89–95.[Medline]

    Savolainen O, Pyhäjärvi T. Genomic diversity in forest trees. Curr Opin Plant Biol (2007) 10:1–6.[CrossRef][Web of Science][Medline]

    Schaal BA, Olsen KM. Gene genealogies and population variation in plants. Proc Natl Acad Sci USA (2000) 97:7024–7029.[Abstract/Free Full Text]

    Schemske DW. Population structure and local selection in Impatiens pallida (Balsaminaceae), a selfing annual. Evolution (1984) 38:817–832.[CrossRef][Web of Science]

    Schlötterer C. Towards a molecular characterization of adaptation in local populations. Curr Opin Genet Dev (2002a) 12:683–687.[CrossRef][Web of Science][Medline]

    Schlötterer C. A microsatellite-based multilocus screen for the identification of local selective sweeps. Genetics (2002b) 160:753–763.[Abstract/Free Full Text]

    Scotti-Saintagne C, Mariette S, Porth I, Goicoechea PG, Barreneche T, Bodénès C, Burg K, Kremer A. Genome scanning for interspecific differentiation between two closely related oak species [Quercus robur L. and Q. petraea (Matt.) Liebl.]. Genetics (2004) 168:1615–1626.[Abstract/Free Full Text]

    Seki M, Narusaka M, Abe H, Kasuga M, Yamaguchi-Shinozaki K, Carninci P, Hayashizaki Y, Shinozaki K. Monitoring the expression pattern of 1300 Arabidopsis genes under drought and cold stresses by using a full-length cDNA microarray. Plant Cell (2001) 13:61–72.[Abstract/Free Full Text]

    Seki M, Umezawa T, Urano K, Shinozaki K. Regulatory metabolic networks in drought stress responses. Curr Opin Plant Biol (2007) 10:296–302.[CrossRef][Web of Science][Medline]

    Shabalina SA, Ogurtsov AY, Kondrashov VA, Kondrashov AS. Selective constraint in intergenic regions of human and mouse genomes. Trends Genet (2001) 17:373–376.[CrossRef][Web of Science][Medline]

    Shinozaki K, Yamaguchi-Shinozaki K. Molecular responses to dehydration and low temperature: differences and cross-talk between two stress signalling pathways. Curr Opin Plant Biol (2000) 3:217–233.[Web of Science][Medline]

    Shoemaker J, Painter I, Weir BS. Bayesian statistics in genetics. Trends Genet (1999) 15:354–358.[CrossRef][Web of Science][Medline]

    Slate J. Quantitative trait locus mapping in natural populations: progress, caveats and future directions. Mol Ecol (2005) 14:363–379.[CrossRef][Medline]

    Société Centrale pour l’Equipement du Territoire- Tunisie. Integrated management for water resources in the North of Tunisia. Final Report for the Tunisian Republic agriculture and hydraulic resources ministry (2005).

    Sokal RR, Rohlf FJ. Biometry (1995) 3rd ed. San Francisco (CA): Freeman.

    Storey JD, Tibshirani R. Statistical significance for genomewide studies. Proc Natl Acad Sci USA (2003) 100:9440–9445.[Abstract/Free Full Text]

    Storz JF. Contrasting patterns of divergence in quantitative traits and neutral DNA markers: analysis of clinal variation. Mol Ecol (2002) 11:2537–2551.[CrossRef][Medline]

    Storz JF. Using genome scans of DNA polymorphism to infer adaptive population divergence. Mol Ecol (2005) 14:671–688.[CrossRef][Medline]

    Street NR, Skogstrom O, Tucker J, Rodriguez-Acosta M, Nilsson P, Jansson S, Taylor G. The genetics and genomics of the drought response in Populus. Plant J (2006) 48:321–341.[CrossRef][Web of Science][Medline]

    Tabor HK, Risch NJ, Myers RM. Opinion: candidate-gene approaches for studying complex genetic traits: practical considerations. Nat Rev Genet (2002) 3:391–397.[Web of Science][Medline]

    Tajima F. Evolutionary relationship of DNA sequences in finite populations. Genetics (1983) 105:437–460.[Abstract/Free Full Text]

    Tajima F. Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics (1989) 123:585–595.[Abstract/Free Full Text]

    Thomas CD. Recent evolutionary effects of climate change. In: Climate change and biodiversity—Lovejoy TE, Hannah L, eds. (2005) New Haven (Connecticut): Yale University Press. 75–88.

    Thompson AJ, Corlett JE. mRNA levels of four tomato (L. esculentum Mill L.) genes related to fluctuating soil water status. Plant Cell Environ (1995) 18:773–780.[CrossRef]

    Vasemägi A, Nilsson J, Primmer CR. Expressed sequence tag-linked microsatellites as a source of gene-associated polymorphisms for detecting signatures of divergent selection in atlantic salmon (Salmo salar L). Mol Biol Evol (2005) 22:1067–1076.[Abstract/Free Full Text]

    Vasemägi A, Primmer CR. Challenges for identifying functionally important genetic variation: the promise of combining complementary research strategies. Mol Ecol (2005) 14:3623–3642.[CrossRef][Medline]

    Verhoeven KJF, Simonsen KL, McIntyre LM. Implementing false discovery rate control: increasing your power. Oikos (2005) 108:643–647.[CrossRef][Web of Science]

    Wahid N, González-Martínez SC, El Hadrami I, Boulli A. Genetic structure and variability of natural populations of maritime pine (Pinus pinaster Aiton) in Morocco. Silvae Genet (2004) 53:93–99.

    Waldmann P, García-Gil MR, Sillanpää MJ. Comparing Bayesian estimates of genetic differentiation of molecular markers and quantitative traits: an application to Pinus sylvestris. Heredity (2005) 94:623–629.[CrossRef][Web of Science][Medline]

    Wang R-L, Stec A, Hey J, Lukens L, Doebley J. The limits of selection during maize domestication. Nature (1999) 398:236–239.[CrossRef][Medline]

    Watkinson JI, Sioson AA, Vasquez-Robinet C, et al, (14 co-authors). Photosynthetic acclimation is reflected in specific patterns of gene expression in drought-stressed loblolly pine. Plant Physiol (2003) 133:1702–1716.[Abstract/Free Full Text]

    Watterson GA. On the number of segregating sites in genetical models without recombination. Theor Popul Biol (1975) 7:256–276.[CrossRef][Web of Science][Medline]

    Weir BS, Cockerham CC. Estimating F-statistics for the analysis of population structure. Evolution (1984) 38:1358–1370.[CrossRef][Web of Science]

    Wilding CS, Butlin RK, Grahame J. Differential gene exchange between parapatric morphs of Littorina saxatilis detected using AFLP markers. J Evol Biol (2001) 14:611–619.[CrossRef][Web of Science]

    Wright SI, Gaut BS. Molecular population genetics and the search for adaptive evolution in plants. Mol Biol Evol (2005) 22:506–519.[Abstract/Free Full Text]

    Yang S-H, Wang H, Sathyan P, Stasolla C, Loopstra CA. Real-time RT-PCR analysis of loblolly pine (Pinus taeda) arabinogalactan-protein-like genes. Physiol Plant (2005) 124:91–106.[CrossRef]

    Zhang YI, Sederoff RR, Allona I. Differential expression of genes encoding cell wall proteins in vascular tissues from vertical and bent loblolly pine trees. Tree Physiol (2000) 20:457–466.[Abstract]

Accepted for publication December 4, 2007.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
GeneticsHome page
A. J. Eckert, J. L. Wegrzyn, B. Pande, K. D. Jermstad, J. M. Lee, J. D. Liechty, B. R. Tearse, K. V. Krutovsky, and D. B. Neale
Multilocus Patterns of Nucleotide Diversity and Divergence Reveal Positive Selection at Candidate Genes Related to Cold Hardiness in Coastal Douglas Fir (Pseudotsuga menziesii var. menziesii)
Genetics, September 1, 2009; 183(1): 289 - 298.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
A. J. Eckert, A. D. Bower, J. L. Wegrzyn, B. Pande, K. D. Jermstad, K. V. Krutovsky, J. B. St. Clair, and D. B. Neale
Association Genetics of Coastal Douglas Fir (Pseudotsuga menziesii var. menziesii, Pinaceae). I. Cold-Hardiness Related Traits
Genetics, August 1, 2009; 182(4): 1289 - 1302.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
A.E. Palme, T. Pyhajarvi, W. Wachowiak, and O. Savolainen
Selection on Nuclear Genes in a Pinus Phylogeny
Mol. Biol. Evol., April 1, 2009; 26(4): 893 - 905.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Supplementary Material
Right arrow All Versions of this Article:
25/2/417    most recent
msm272v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Eveno, E.
Right arrow Articles by Garnier-Géré, P. H.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Eveno, E.
Right arrow Articles by Garnier-Géré, P. H.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?