MBE Advance Access originally published online on December 20, 2006
Molecular Biology and Evolution 2007 24(3):769-783; doi:10.1093/molbev/msl204
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Research Articles |
Multilocus Analysis of Nucleotide Variation and Speciation in Oryza officinalis and Its Close Relatives

* State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Xiangshan, Beijing, China
The Graduate School, Chinese Academy of Sciences, Beijing, China
E-mail: gesong{at}ibcas.ac.cn.
| Abstract |
|---|
|
|
|---|
Nucleotide variation in 10 unlinked nuclear genes was investigated in species-wide samples of Oryza officinalis and its close relatives (Oryza eichingeri and Oryza rhizomatis). Average estimates of nucleotide diversity were the lowest in O. rhizomatis (
sil = 0.0038) and the highest in O. eichingeri (
sil = 0.0057) that is disjunctly distributed in Africa and Sri Lanka. These wild rice species appeared to harbor relatively low levels of nucleotide variation relative to other plant species because the diversity level of O. eichingeri is only 2346% of those in Zea species and 35% of that in Arabidopsis thaliana. The lower nucleotide diversity in these Oryza species could be best explained by their smaller historic effective population sizes. The speciation model test indicated that O. officinalis and its close relatives might have undergone a process of population contraction since divergence from their ancestor. Incongruent topologies among 10 gene trees, particularly regarding the positions of O. eichingeri and O. rhizomatis accessions might be attributed to lineage sorting arising from ancient polymorphism and hybridization/introgression between the Sri Lankan O. eichingeri and O. rhizomatis. However, the null hypothesis of the isolation model was not rejected for any contrast between taxa, which suggested that no subsequent gene flow shaped the present patterns of nucleotide variation since their divergence and that introgression was not pervasive in this group of species. Our molecular dating provides an approximate divergence time of 0.37 Myr between 2 geographical races of O. eichingeri, much more recent compared with the times of other speciation events in this group (0.630.68 Myr). A long-distance dispersal from West Africa to Sri Lanka was more likely to play a role in the disjunct distribution of O. eichingeri.
Key Words: nucleotide variation divergence speciation Oryza officinalis
| Introduction |
|---|
|
|
|---|
Inference of recent evolutionary history of closely related species is one of the most intricate questions for evolutionary biologists. The level and pattern of nucleotide variation in DNA sequences provide important information on the evolutionary history of a species and divergent process of closely related species. In recent decades, molecular population genetics and genealogical approaches have been successfully used to reveal the patterns of genetic diversity within and between populations and to trace the histories of divergence and speciation in plants (Eyre-Walker et al. 1998
In the rice genus (Oryza L.), 10 genome groups (i.e., the A-, B-, C-, BC-, CD-, E-, F-, G-, HJ-, and HK-genomes) have been recognized (Ge et al. 1999
; Khush and Brar 2001
), including the A-genome group that the cultivated rice (Oryza sativa) belongs to. The C-genome group, a well-defined monophyletic clade (Ge et al. 1999
), includes 3 closely related diploid species, that is, Oryza officinalis Wall. ex Watt., Oryza eichingeri Peter, and Oryza rhizomatis Vaughan (Tateoka 1965
; Vaughan 1989
). Oryza officinalis is the most common species and distributed widely in southern China, South and Southeast Asia, and Papua New Guinea, whereas O. rhizomatis has only been reported from Sri Lanka. The third species, O. eichingeri, is distributed in Sri Lanka and West and East Africa and is the only wild Oryza species reported from both Asia and Africa. It is intriguing that the Sri Lankan O. eichinger is sympatric to O. rhizomatis with their population being overlapping in both northern and southern Sri Lanka, though their habitats are distinctly different (Bautista et al. 2006
). Phylogenetic and population genetic studies showed that these 3 C-genome species have diverged recently with low level of species differentiation (Ge et al. 1999
; Bao and Ge 2003
; Bao et al. 2006
; Bautista et al. 2006
). Therefore, O. officinalis and its close relatives provide an ideal system to explore demographic history and speciation processes in plants. In this study, we investigate the patterns of nucleotide variation in 10 unlinked nuclear loci in species-wide samples of the 3 C-genome species of Oryza. We aim to use multiple genealogies and population parameters to explore whether they have remained isolated since their divergence and to address the demographic and geographic aspects of their speciation history.
This study also seeks to qualify species-wide levels of nucleotide diversity of O. officinalis and its close relatives and compare the result with those from previous studies using different markers. Although a few studies have been undertaken on the genetic diversity of the C-genome species (Aggarwal et al. 1999
; Gao et al. 2001
; Gao 2005
; Gao and Zhang 2005
; Bautista et al. 2006
), inconsistent results have been obtained probably due to the different samples and molecular markers used. For example, using amplified fragment length polymorphism markers, Aggarwal et al. (1999)
and Bao et al. (2006)
inferred that O. rhizomatis harbored the lowest genetic variation among 3 C-genome species, whereas Bautista et al. (2006)
found that O. eichingeri maintained lower genetic diversity than the other 2 species. To date, a few studies on nucleotide variation have been undertaken on the cultivated rice (O. sativa) and its wild relative Oryza rufipogon (Olsen and Purugganan 2002
; Garris et al. 2003
; Yoshida and Miyashita 2005
; Olsen et al. 2006
). These investigations, however, mainly focused on a single species and were exclusively based on 1 or 2 genes or multiple linked genes. The present study is the first attempt to investigate the nucleotide polymorphism and divergence among the wild Oryza species using multilocus sequence data of unlinked genes. Such information will facilitate the effective use of the wild rice germplasm because the wild species in Oryza possess abundant genes valuable for rice breeding and improvement, such as resistance to diseases and insects and stress tolerances (Khush and Brar 2001
; Vaughan et al. 2003
).
| Materials and Methods |
|---|
|
|
|---|
Species Sampling
The geographic distribution of 3 C-genome species is shown in figure 1. The 3 species are largely allopatric across the pantropical Old World, but O. eichingeri is sympatric to O. rhizomatis in Sri Lanka. The identity and geographic origin of the individuals sampled for each species are presented in table 1 and figure 1. Twelve O. officinalis individuals were collected from 11 countries, covering the entire distribution range of the species. Four O. rhizomatis individuals were sampled to represent the species that is found only in Sri Lanka. Because our previous studies found high level of genetic divergence between the African and Sri Lankan populations of O. eichingeri (Bao and Ge 2003
|
|
Sampled Loci
DNA sequences were obtained for 10 nuclear loci that are located on 10 different chromosomes in rice (O. sativa) (table S1, Supplementary Material online). Adh1 gene encodes alcohol dehydrogenase I (alcohol nicotinamide adenine dinucleotide+: oxidoreductase, EC 1.1.1.1 [EC] ), an important protein in the process of anaerobic metabolism. It is a single copy in the Oryza species and located in the short arm of chromosome 11 in rice (Tarchini et al. 2000
subunit that functions in various systems of signal transduction in diverse tissues or cells in flowering plants (Seo et al. 1995
-1) is one of the recessively inherited resistance genes that provide race-specific resistance to bacterial blight (Blair et al. 2003
Polymerase Chain Reaction Amplification, Cloning, and Sequencing
All polymerase chain reaction (PCR) amplifications were performed in a total volume of 25 µl on a Tpersonal thermocycler (Biometra, Germany), using 1030 ng genomic DNA. The reaction mixture was supplemented with 0.2 µM of each primer, 200 µM of each deoxyribonucleotide triphosphate, 10 mM Tris-Cl (pH = 8.3), 50 mM KCl, 1.5 mM MgCl2, and 0.75 U exTaq DNA polymerase (TaKaRa). To reduce recombinant molecule during PCR (Judo et al. 1998
; Shammas et al. 2001
), long extension time during PCR reactions was used. For instance, a 3-min extension was used during each amplification cycling for
1.4-kb Adh1, a 2.5-min extension for
1.3-kb GPA1, and a 2-min extension for
1.0-kb Lhs1. Amplified products were ligated into pGEM T-easy vectors (Promega, Madison, WI) after being purified from agrose gel with either a Pharmacia purification kit (Amersham Pharmacia Biotech, Piscataway, NJ) or a Dingguo purification kit (Dingguo, Beijing, China). Independent plasmid DNAs were selected randomly and isolated by the method of alkaline lysis plasmid miniprep as described (Ausubel 1992
). Sequencing reactions were performed by a MegaBACE 1000 automated sequencer (Amersham Pharmacia Biotech) or an ABI 3730 automated sequencer (Applied Biosystems, Foster City, CA).
Although both outcrossing and inbreeding species have been recorded for the wild Oryza species (Vaughan 1989
; Dally and Second 1990
), mating system of the C-genome species is largely unclear. Therefore, individuals in these species can be either homozygous or heterozygous at nuclear loci. Thus all amplification products of 10 loci from each individual were cloned, and multiple clones were sequenced. To obtain both alleles from the heterozygous samples, we adopted the partial sequencing strategy (Tiffin and Gaut 2001
) to sequence 8 to 10 clones at each locus. The advantage of the multiclone sequencing is that pseudopolymorphisms induced by Taq polymerase or recombination during PCR amplification could be avoided in the final sequence data set, although this method is relatively laborious and high costing (Palumbi and Baker 1994
; Tiffin and Gaut 2001
; Clark et al. 2004
).
Because Taq errors occur at random, it is unlikely that polymorphisms shared among more than one clones (sequences) are artifactual (Palumbi and Baker 1994
; Eyre-Walker et al. 1998
; Hilton and Gaut 1998
). However, "singletons," that is, polymorphisms occurred in only 1 sequence relative to all the remainder sequences, can represent either true sequence variation or Taq polymerase artifact. Previous studies found that the percentage of singletons resulting from Taq polymerase error ranged from 29% (Hilton and Gaut 1998
) to 100% (Small et al. 1999
; White and Doebley 1999
), depending on the genes and taxa. Because we sequenced more than 8 clones for each individual, most alleles could be easily determined and some of the artificial singletons were removed from the original data set. To confirm the remaining singletons, we performed repeated PCR amplification, cloning, and sequencing and found that 63% (51 out of 81) of the singletons resulted from Taq polymerase error and the corrected sequences were used in the analyses. By means of multiclone sequencing and reamplifying and resequencing, interallelic PCR recombinants were also verified and removed. Therefore, accuracy and reliability of nucleotide polymorphisms in this study are sufficiently guaranteed for subsequent analyses.
All allele sequences have been deposited in GenBank, and their accession numbers are DQ223326DQ223418, DQ901744DQ901953, and DQ911245DQ911249.
Sequence Analysis
Sequence data were edited and assembled with the ContigExpress program from the Vector NTI Suite 6.0 (Informax Inc., North Bethesda, MD). Allele sequences for each locus were aligned using a combination of methods implemented in DAMBE version 4.1.19 (Xia and Xie 2001
) and ClustalX version 1.81 (Thompson et al. 1997
), with additional manual refinements. Levels of intraspecific genetic variation were calculated with estimates of average pairwise differences per basepair between sequences (
) (Nei and Li 1979
) and Watterson's estimates (
w) from S (Watterson 1975
) using both DnaSP version 4.10 (Rozas et al. 2003
) and SITES (Hey and Wakeley 1997
), where S is the number of segregating sites. With the assumption of the standard neutral model of a random-mating population of constant size, the statistic estimate of nucleotide variation
w in an autosomal gene is equal to 4Neµ, where Ne is the effective population size and µ, the mutation rate per generation per site. The minimum number of recombination events was assessed using the algorithm of Hudson and Kaplan (1985)
in the SITES program.
Under the null assumption that molecular variation is evolving neutrally, a number of statistical tests have been used to assess whether selective forces exert influences on patterns of genetic variation. In this study, deviation from standard neutral equilibrium model was tested based on both the frequency spectrum of polymorphisms or the haplotype distribution and the relationship between intraspecific and interspecific diversity. If the hypothesis of the neutrality is not rejected based on single statistical test, we still could not determine if the locus is evolving neutrally because failure of rejection could be simply due to the fact that the test is not sensitive enough to detect certain type of selective force (Wayne and Simonsen 1998
). Therefore, multiple statistic tests for individual locus (Tajima 1989
; Fu and Li 1993
), as well as multilocus tests (Hudson et al. 1987
), were performed to determine the departure from the neutrality hypothesis and make inferences on the species history using the program DnaSP (Rozas et al. 2003
).
Tajima's D (Tajima 1989
) was based on the discrepancy between the mean pairwise differences (
) and Watterson's estimator (
w), whereas D* and F* of Fu and Li (1993)
rely on the difference between the number of polymorphic sites in external branches (polymorphisms unique to an extant sequence) and number of polymorphic site in internal phylogenetic branches (polymorphisms shared by extant sequences). Because selective force is generally considered to affect a particular locus in evolutionary history, the multilocus HKA test across unlinked or loosely linked loci was performed using the program HKA to discriminate between selection forces and population demography during the speciation process. For the HKA tests, O. punctata sequences were used as outgroups. The SITES, HKA, and WH (mentioned below) software packages were distributed kindly by Jody Hey (http://lifesci.rutgers.edu/
heylab). Insertion/deletion polymorphisms were excluded from the analyses.
The genealogical trees of 10 nuclear loci were constructed using the parsimony and distance methods as implemented in PAUP* version 4.0b10 (Swofford 2002
). The Neighbor-Joining (NJ) method (Saitou and Nei 1987
) was performed with Kimura's 2-parameter distances (Kimura 1980
). Maximum parsimony (MP) analyses were performed using heuristic search with MULPARS, Tree Bisection-Reconnection branch swapping, and RANDOM stepwise addition with 1,000 replicates. Topological confidence was assessed by bootstrap analysis with 1,000 replicates. Furthermore, we estimated the divergence time of the C-genome species based on molecular clock hypothesis. To examine rate heterogeneity among lineages, we used the program MEGA version 3.0 (Kumar et al. 2004
) to assess the constancy of molecular evolution across individual lineages.
Ancestral Parameter Estimates
The simple speciation model was fitted based on the different classes of mutations from the multilocus sequence comparisons between 2 species using the program WH (Wakeley and Hey 1997
; Wang et al. 1997
; Kliman et al. 2000
). Three classes of nucleotide mutations for the WH program, including polymorphisms that are exclusive to 1 species, shared polymorphisms between 2 species, and fixed differences between 2 species, could be obtained from the SITES program. Under the assumption of 2 descendent populations (species) separated from an ancestral population (species) with constant population sizes and no gene flow between the populations (species) after their separation, the model presented estimates of population parameters in the ancestral and descendant populations (species) (
A,
1,
2) as well as the time since separation T (scaled in 2N1 generations). A rejection of the model may indicate that gene flow has occurred between a species pair after the time of divergence. Both a simple measure (the difference between the highest and lowest numbers of shared polymorphisms plus the difference between highest and lowest numbers of fixed differences among multiple loci) (Wang et al. 1997
; Machado et al. 2002
) and a
2 statistic (Kliman et al. 2000
) have been used to assess the overall fit to the simple speciation model.
| Results |
|---|
|
|
|---|
Nucleotide Variation
Forty-eight sequences were obtained for each of the 10 loci, with 2 sequences per individual. Total length of the aligned sequences for the 10 genes is 9,916 bp, including 3,321 bp of coding sequence and 6,585 bp of noncoding sequence (table S1, Supplementary Material online). The number of insertiondeletion polymorphisms ranged from 0 to 12 across loci, with a total of 37 indel polymorphisms for the 4 taxa. A detailed examination of these indel polymorphisms showed that all of them occurred in noncoding regions, with 26 (
70%) being 1-bp indels. The remaining indels included four 2-bp, five 3-bp, one 15-bp, and one 39-bp polymorphisms. All indels were not considered in subsequent analyses. The schematic diagrams and the nucleotide polymorphisms in the sequenced regions of 10 genes are provided in Supplementary Materials online (figs. S1 and S2, Supplementary Material online).
Standard statistics of sequence variation for each locus are summarized in table 2, including the estimates of nucleotide variation in different regions at individual loci. As expected, due to strong functional constraint, the levels of nucleotide variation at coding regions were lower than those at noncoding regions at all 10 loci except for 6 cases involving 3 loci (Lhs1, Ks1, and SSIII), where no silent substitution was observed. Levels of polymorphisms varied across loci, with CBP1, GPA1, and Ks1 being the least variable genes in O. eichingeri, O. officinalis, and O. rhizomatis, respectively. At the species level, the average estimates of variation over 10 loci were comparable for all the taxa although O. officinalis has much wider distribution than the other 3. The
sil ranged from 0.0033 (Sri Lankan O. eichingeri) to 0.0044 (O. officinalis) and
sil ranged from 0.0038 (O. rhizomatis) to 0.0042 (Sri Lankan O. eichingeri) (table 2). For O. eichingeri, the Sri Lankan race (
sil = 0.0033;
sil = 0.0042) possessed almost similar level of nucleotide diversity with the African race (
sil = 0.0040;
sil = 0.0039), but the value in the Sri Lankan race was much lower (
sil = 0.0013;
sil = 0.0011) if one introgressed individual was excluded (see Discussion). It is noted that the diversity values of O. eichingeri would be increased (
sil = 0.0057;
sil = 0.0052) if both races were combined into a single data set, indicative of the impact of population subdivision on genetic diversity.
|
Because the amount of recombination influences the coalescent simulations of isolation species model and the phylogenetic inferences (Wang et al. 1997
Tests of Neutrality
To test the standard neutral equilibrium model, we first performed the tests of Tajima's D (Tajima 1989
), and D* and F* of Fu and Li (1993)
to address whether the data show evidence that natural selection has shaped levels of variation for individual loci. The values of Tajima's D and D* and F* of Fu and Li varied vastly across 10 loci and most of them were not significant (table 2). It is noteworthy that both Tajima's D and D* and F* of Fu and Li indicated significant positive values for the locus CBP1 in O. officinalis and Lhs1 in O. rhizomatis, suggesting the presence of balancing selection at these loci. This explanation is consistent with the finding that the elevated diversity was observed at the loci, in particular for CBP1 in O. officinalis (table 2). For O. eichingeri-AFR, no locus was significantly different from 0. For O. eichingeri-LKA, however, 7 values were negative with 3 being significantly less than 0, and 2 (CBP1 and SSII1) were slightly but not significantly positive (one could not be calculated). D* and F* of Fu and Li gave similar patterns (table 2). To determine whether the average values of these tests within taxa significantly deviate from zero, we used a multilocus test based on coalescent simulations to compare the observed Tajima's D and D* and F* values of Fu and Li across all loci against the neutral expectation. Significantly negative mean values of Tajima's D and D* of Fu and Li were observed only for the Sri Lankan O. eichingeri (table 3), consistent with the tests at individual loci (table 2). The overall negative patterns of the tests from O. eichingeri-LKA indicated an excess of low-frequency polymorphisms in sequence data, with the simplest explanation being a recent demographic expansion (Tajima 1989
, and see below) because demographic forces affect all loci simultaneously. It is interesting that level of nucleotide variation at CBP1 is significantly lower in 2 races of O. eichingeri (
sil = 0.00.0007) than in the other species (
sil = 0.00340.0094), suggesting that natural selection might remove variation from this species at this locus.
|
We further performed the multilocus HKA test (Hudson et al. 1987
2 = 15.66, P = 0.616; O. officinalis/O. eichingeri-AFR,
2 = 23.49, P = 0.173; O. officinalis/O. rhizomatis,
2 = 14.52, P = 0.695; O. eichingeri-LKA/O. rhizomats,
2 = 11.61, P = 0.867; O. eichingeri-AFR/O. rhizomats,
2 = 11.87, P = 0.853; and O. eichingeri-LKA/O. eichingeri-AFR,
2 = 7.267, P = 0.988). Because the HKA test statistic for closely related species is not expected to follow the
2 distribution (Machado et al. 2002
|
Shared/Fixed Polymorphism and Divergence
As incipient species diverge from each other, shared polymorphisms are expected to lose whereas fixed differences gradually accumulate (Wakeley and Hey 1997
|
A larger number of fixed differences but no shared polymorphism were observed at Lhs1 locus between all contrasts except for the O. eichingeri-LKA/O. rhizomatis comparison (table 4). This result might imply that this locus has experienced directional selection, in agreement with the positive Tajima's D in 3 taxa except for eichingeri-LKA (table 2). Much more shared polymorphisms than fixed differences (19 vs. 2) were observed between the 2 races of O. eichingeri relative to more fixed differences than shared polymorphisms between the other contrasts. Moreover, the shared polymorphisms between them involved in 6 loci though there was no polymorphism observed at 2 loci (table 4). It is interesting that the number of shared polymorphisms was more than twice that of fixed differences (25 vs. 13) between O. eichingeri-LKA and O. rhizomatis, in contrast to the numbers (10 vs. 34) between O. eichingeri-AFR and O. rhizomatis (table 4). More shared polymorphism and less fixed differences between O. eichingeri-LKA and O. rhizomatis might indicate closer genetic affinity or hybridization/introgression between them. Because shared polymorphisms can also be generated by parallel mutations, we calculated the amount of shared polymorphisms under the assumption that mutations occur randomly and independently with equal probability at all sites to assess whether the shared polymorphisms could arise just by recurrent mutation (table 4). In all contrast pairs, the expected values of shared polymorphisms were very low (generally near zero) and comprised a small fraction of the observed number (table 4). Therefore, the probability of recurrent mutation was rather low in our data, indicating that a significant fraction of shared polymorphism could not be explained by parallel mutation (Clark 1997
The level of net divergence, the average pairwise divergence between species minus the average intraspecific pairwise variation (Nei 1987
), was also used to measure interspecific difference. Over all the 10 loci, levels of net pairwise divergence (D) among 4 taxa and between each of them and O. punctata, an outgroup used in this study, were calculated (table S2, Supplementary Material online). Estimates of net divergence were similar between each of the 4 taxa and O. punctata, with the average values over the 10 loci ranging from 0.0394 to 0.0413. The net divergence between the C-genome species and O. punctata was obviously higher than those of the 4 taxa pairs (0.00220.0072). Note that the average net divergence between African and Sri Lankan races of O. eichingeri was 3-fold lower than the estimations of the other pairs, indicating close genetic relationship between the 2 geographic races (table S2, Supplementary Material online).
Genealogical Analyses
Genealogical trees of the 10 loci were constructed for all samples using both NJ and MP methods. Several characteristics were observed from the genealogical trees as showed in the NJ trees (fig. S3, Supplementary Material online). First, sequences from the 4 taxa sufficiently diverged from the B-genome species at 8 out of 10 loci, implying that the C-genome species started to diverge relatively recently compared with its divergence from the B-genome species. The 2 exceptions involved the loci CBP1 and Waxy, in which zero-length branches were found involving the outgroup (O. punctata) samples and ingroup accessions (fig. S3b and S3j, Supplementary Material online), suggestive of the persistence of ancestral alleles together with their descendants and the derived lineages evolved from single ancestral alleles (Posada and Crandall 2001
). This deep coalescence of alleles at CBP1 and Waxy might reflect maintenance by balancing selection, in agreement with the results of neutral tests. Second, on most trees, O. officinalis sequences formed a monophyletic clade, whereas sequences of the remaining 3 taxa did not cluster by taxon despite a tendency for sequences to cluster by the taxonomic designation. This pattern suggests that coalescence for alleles at most loci occurs after divergence of O. officinalis and the common ancestor of eichingeri and O. rhizomatis. Third, alleles from eichingeri and O. rhizomatis were intermixed entirely at 9 loci, whereas accessions from 2 O. eichingeri races were basically separated into 2 groups, corresponding to the African and Sri Lankan races. It should be noted that alleles from the African O. eichingeri in Cote d'Ivoire (eic-CIV) were clustered with alleles of the Sri Lankan O. eichingeri at 8 loci (fig. S3, Supplementary Material online), implying their closer relationship. In brief, phylogenetic analyses indicate that for a particular locus some alleles in 1 taxon are more closely related to those sampled from another taxon, which occurs at different hierarchical levels. Such genealogical patterns of lack of concordance among the 10 gene trees at different taxonomical levels might result from lineage sorting and gene flow because the 4 taxa were closely related and diverged very recently (see below) and gene flow has been documented previously, at least for that between the Sri Lankan eichingeri and O. rhizomatis (Bautista et al. 2006
).
To explore the phylogenetic relationship among species, we reconstructed the phylogeny among the 4 taxa based on a combined data set of the 10 genes. Because there were 2 alleles for a heterozygote, we chose 1 allele randomly in each locus for the heterozygous individuals. Both NJ and MP analyses demonstrated essentially the same topology except for slightly different bootstrap supports for some clades (data not shown). The combined phylogeny (fig. 3) revealed that accessions from the same species formed well-supported, monophyletic clades, including a highly supported monophyly of 2 races of O. eichingeri. It is worthwhile mentioning that 2 accessions had particular positions in figure 3. One accession of the Sri Lankan O. eichingeri (eic-LKA4) was not clustered with the other O. eichingeri accessions and instead formed a clade with the O. rhizomatis accessions. Taking into consideration at individual loci, eic-LKA4 was heterozygous at 8 loci, and importantly, 1 of the alleles at all the 8 loci clustered with the O. rhizomatis alleles (fig. S3, Supplementary Material online). This phylogenetic pattern could be best explained by hybridization/introgression between the Sri Lankan O. eichingeri and O. rhizomatis (see Discussion). Within the O. eichingeri clade, an African accession (eic-CIV) formed a clade with the Sri Lankan accession before clustering with the other African accessions. Similarly, alleles of this accession were clustered with alleles of Sri Lankan O. eichingeri at 8 loci (fig. S3, Supplementary Material online), suggestive of a long-distance dispersal of the African O. eichingeri into Sri Lanka (see Discussion).
|
Testing Speciation Model and Estimating Divergence Time
Upon the assumption of constant population size in history and no gene flow between 2 descendent species since separation from an ancestral species, the simple speciation model (WH) was used to estimate the relative sizes of the ancestral and descendant populations (Wakeley and Hey 1997
2 (Kliman et al. 2000
A) were approximately 2-fold to 10-fold larger than those of either descendents (
1,
2) (table 5), indicating that the population sizes of descendent taxa might have experienced a contracting process since the time of their separation. This implies a speciation scenario in which a larger ancestral population becomes subdivided and the resulting daughters each occupies part of the ancestral species range. The exception is the contrast between the 2 races of O. eichingeri (O. eichingeri-LKA/O. eichingeri-AFR), in which the ancestral population sizes (
A = 27.874) was estimated slightly smaller than those of the 2 descendents (
1 = 36.575,
2 = 33.569). This result did not support a speciation event where a large ancestral population becomes subdivided, as would be expected by a vicariance event, but rather suggests that the formation of 2 races (O. eichingeri-LKA and O. eichingeri-AFR) might involve long-distance dispersal (see Discussion) and subsequently slight population expansion, consistent with the average negative Tajima's D value and D* value of Fu and Li (table 3).
|
To understand speciation history of O. officinalis and its close relatives, we used the molecular clock approach to estimate the time of divergence. Three loci (CBP1, Lhs1, and Waxy) were excluded from the divergence analyses because of their heterogeneous mutation rates detected by the program MEGA (data not shown) and significant deviation from neutrality in the data sets (table 2). For the remaining 7 loci, the relative-rate test (Tajima 1993
| Discussion |
|---|
|
|
|---|
Low Level of Nucleotide Variation in the C-Genome Species in Oryza
Although nucleotide variation varied 30-fold among the 10 loci, the 4 taxa in the present study possessed similar levels of nucleotide diversity (
sil = 0.00380.0042;
sil = 0.00330.0044). When the 2 races of O. eichingeri were considered together, diversity value for this species would be elevated slightly (
sil = 0.0057;
sil = 0.0052) because of population subdivision. A literature survey on the multiple gene studies of plant species has shown a wide range of nucleotide variation across species, even among closely related species (table 6). However, the Oryza species in the present study maintain apparently lower nucleotide diversity compared with the estimates of other angiosperm species based on estimates of multiple loci. As shown in table 6, the nucleotide diversity from a majority of plant species is 26 times that of the C-genome species, except for the cultivated sorghum (Sorghum bicolor) that is largely self-pollinating and has a smaller effective population size (Hamblin et al. 2004
sil = 0.0057) was only 2346% of those observed in Zea species (
sil = 0.01230.0247) and 35% of that in Arabidopsis thaliana (
sil = 0.0161), a highly self-fertilizing species (table 6). Low level of genetic variation in O. officinalis and its closely related species was previously reported using molecular markers, such as allozyme, restriction fragment length polymorphisms, and simple sequence repeats (Gao et al. 2001
|
As pointed out by Wright and Gaut (2005)
sil = 0.0095;
sil = 0.0072) in a related A-genome species (O. rufipogon) based on sequences of same set of loci (Zhu et al., 2007, table 6). It is well established that nucleotide diversity can be affected by a consequence of selection (either long-term balancing selection or a recent selective sweep). However, it is unlikely that the low diversity in the species under study was attributable to selection because statistic tests did not find overall significant deviation from neutrality except for a few of loci where both balancing selection (elevated the diversity) and selective sweep (reduced the diversity) were present (table 2).
The effects of demography on diversity and speciation have been well appreciated in plants and animals (see Machado et al. 2002
; Llopart et al. 2005
; Wright and Gaut 2005
for reviews). Ramos-Onsins et al. (2004)
conducted a comprehensive investigation on the divergence and speciation of the closely related outcrossing Arabidopsis halleri and A. lyrata based on sequence data of 38 nuclear loci and found that levels of nucleotide variation in different species reflected the differences among species in effective population size. In the present study, the speciation model test indicates that the estimated effective population sizes of all 4 taxa (
1,
2) are smaller than those of their ancestors (
A) at all comparison pairs except for the contrast between the 2 races of O. eichingeri (table 5). This result suggests that O. officinalis and its close relatives might have undergone a process of population contraction since divergence from their ancestor. This phenomenon has been evidenced from other organisms such as Drosophila, Arabidopsis, and crop species because of either a reduction of effective population size or a founder effect and bottleneck during domestication (Kliman et al. 2000
; Machado et al. 2002
; Hamblin et al. 2004
; Ramos-Onsins et al. 2004
; Städler et al. 2005
; Wright and Gaut 2005
). Consequently, the low level of nucleotide diversity in the C-genome species is most likely explained by the demographic factor, that is, a smaller historic effective population size. Recent population reduction and extinction because of habitat fragmentation and deterioration may also have led to genetic reduction for these species. For example, several studies (Gao et al. 2001
; Gao 2005
) showed that the natural populations of O. officinalis have been isolated due to habitat deterioration and human destruction, which in turn caused the spatial distribution of this species to be fragmented. Under the circumstances of the fragmented habitats and isolated populations associated with low level of migration, individual local populations are apt to extinction from stochastic processes (Amos and Harwood 1998
).
Evolutionary History and Introgression between Species
Traditionally, O. officinalis and its close relatives have been delineated on the basis of morphological characters, distinct habitats, and different geographical distributions (Tateoka 1965
; Vaughan 1989
, 1990
). Oryza officinalis and O. rhizomatis can be differentiated from O. eichingeri in that the former 2 have rhizomes (Vaughan 1990
). Compared with O. rhizomatis, O. officinalis has smaller spikelets, shorter palea tip, and more approximately equal branches from the lowest panicle node. Morphologically, O. officinalis is more similar to O. rhizomatis than to O. eichingeri (Vaughan 1990
), which was supported by phylogenetic analysis based on multiple gene sequences (Bao and Ge 2003
). In contrast, recent AFLP (Bautista et al. 2006
) and SSR (Bao et al. 2006
) analyses suggested that O. rhizomatis was more genetically similar to O. eichingeri than to O. officinalis.
In molecular phylogenetics of closely related taxa, it has been increasingly appreciated that the time back to the common ancestor of 2 DNA sequences may be longer than the time back to the common ancestor of 2 taxa (Nei 1987
; Pamilo and Nei 1988
). This phenomenon that gene divergence precedes species divergence originates from ancestral polymorphisms and will cause a high probability that gene trees disagree with species trees (Wu 1991
; Wendel and Doyle 1998
). Therefore, it is not unexpected that the phylogenetic trees based on 10 nuclear genes showed different topological relationships among species in the present study (fig. S3, Supplementary Material online). Such lineage sorting because of ancient polymorphism is more likely a source of incongruence among gene trees at lower taxonomic ranks (Wendel and Doyle 1998
). As pointed out by Klein et al. (1998)
, at most loci, differential fixation of ancestral polymorphism influences phylogenies in which divergences occur within a time interval of less than 12 Myr. Our approximate estimation of divergence times for 3 species indicates that the 2 speciation events within the C-genome group happened at such a short time interval (
0.630.68 Myr) that the polymorphisms in the ancestral population of all 3 species could persist easily from the first divergence to the second. Such molecular phenomenon associated with speciation radiations is the main reason causing incongruent topologies by different genes in this study (fig. S3, Supplementary Material online; Bao and Ge 2003
) and has been reported in other species such as human and its relatives (Enard and Pääbo 2004
), Drosophila and field cricket species complex (Wang et al. 1997
; Kliman et al. 2000
; Broughton and Harrison 2003
; Hey and Nielsen 2004
), and plant crops (Small and Wendel 2000
; Tiffin and Gaut 2001
; Clark et al. 2004
).
Therefore, a combined analysis based on multiple loci is generally needed to overcome the noises of ancient polymorphisms to accurately reconstruct a phylogeny of closely related species (Pamilo and Nei 1988
; Wu 1991
; de Queiroz et al. 1995
). In this study, the combined tree indicates clearly that O. rhizomatis and O. eichingeri form a monophyletic clade, which is sister to the group containing all O. officinalis accessions (fig. 3). This phylogenetic relationship is also supported by the analyses based on shared polymorphisms and net pairwise divergence (table 4 and table S2, Supplementary Material online), consistent with recent population-based studies using AFLP and SSR markers (Bao et al. 2006
; Bautista et al. 2006
).
In the process of speciation, shared polymorphisms in newly formed species may result both from recent divergence from a common ancestor and from gene flow or introgression between species (Machado et al. 2002
; Broughton and Harrison 2003
; Ramos-Onsins et al. 2004
). The existence of gene flow between species would alter the pattern of both within- and between-species variation (Ramos-Onsins et al. 2004
). In the present study, the null hypothesis of the isolation model was not rejected for any of 6 contrasts (table 5), suggesting that no subsequent gene flow shaped the present patterns of nucleotide variation since their divergence. In another word, gene flow or introgression among species have been limited, although the divergence of O. officinalis and its close relatives was relatively recent and extensive polymorphism has been maintained in their common ancestor. However, a hybridization event between the Sri Lankan O. eichingeri and O. rhizomatis is plausible, for interspecific cross between them has been reported by Bautista et al. (2006)
. In this study, 1 of 2 alleles from 1 Sri Lankan O. eichingeri accession (eic-LKA4) clustered with the O. rhizomatis alleles at all 8 heterozygous loci (fig. S3, Supplementary Material online). This observation strongly supports the introgression or hybridization between these 2 species because most introgressed alleles tend to co-occur in the same individuals, whereas lineage sorting makes alleles in question more randomly distributed among individuals (Wendel and Doyle 1998
). Nevertheless, there is little evidence of substantial introgression between the 2 species even though the Sri Lankan O. eichingeri and O. rhizomatis hybridize where they come into contact (Bautista et al. 2006
). To evaluate whether or not the introgressed individual (eic-LKA4) has affected the test of isolation speciation model, we recalculated the population parameters by excluding the accession eic-LKA4. Similar estimations of the population parameters were obtained and the null hypothesis of the isolation model was not rejected (data not shown), suggesting that introgression is not pervasive between these 2 species.
Genetic Differentiation between Geographic Races and Long-Distance Dispersal of O. eichingeri
Oryza eichingeri is a particularly interesting species, and there has been a considerable debate regarding its taxonomic treatment because of its remarkably disjunct distribution in Africa and Sri Lanka (fig. 1) (Nayar 1973
; Biswal and Sharma 1987
; Vaughan 1989
; Vaughan et al. 2003
). In the present study, based on sequences of 10 nuclear loci, a divergence between the 2 races of O. eichingeri was observed. To further explore the genetic differentiation of the 2 races, we performed a test of geographic subdivision (Hudson et al. 1992
) and found a significant differentiation between the 2 races for nine loci out of 10, with the Fst values ranging from 0.143 to 0.714 (average Fst = 0.442, P < 0.001). Sufficient differentiation between 2 O. eichingeri races has also been detected using molecular markers (Shcherban et al. 2001
; Federici et al. 2002
; Bao and Ge 2003
; Bao et al. 2006
). As demonstrated previously, population subdivision can elevate the level of genetic variation in species if there is no presence of gene flow (Nei and Takahata 1993
; Cherry 2004
). Because no gene flow between the geographical races was detected by speciation model test, significant genetic differentiation between the 2 races might have contributed to the highest diversity of O. eichingeri among the diploid C-genome species (table 6). High degree of intraspecific variation in O. eichingeri has also been detected at the genome level (Dally and Second 1990
).
Based on 1 sample collected from Sri Lanka, Sharma and Shastry (1965)
named a new species, Oryza collina. This treatment was subsequently followed by some authors (Nayar 1973
; Wang et al. 1992
), but retracted by some others (Biswal and Sharma 1987
; Vaughan 1990
) who considered that the Sri Lankan form was within the variation range of O. eichingeri. This study indicates clearly that 2 geographical races of O. eichingeri shares a more recent common ancestor compared with either of them to the other C-genome species based on both the shared polymorphisms and the multiple gene phylogenies (table 4 and fig. S3, Supplementary Material online). Our molecular dating provides an approximate divergence time of 0.37 Myr between the 2 races, much more recent compared with the times of other speciation events in this group (0.630.68 Myr). These results do not support treating the Sri Lankan O. eichingeri as an independent species.
Oryza eichingeri is the only wild Oryza species reported from both Asia and Africa and thus has attracted interests regarding its geographic pattern (Vaughan et al. 2003
, 2005
; Bautista et al. 2006
). This distribution pattern could result either from a vicariance where a large ancestral population becomes subdivided or from long-distance dispersal between 2 continents. The present study supports the long-distance dispersal hypothesis. First, our isolation model test revealed a smaller ancestral population size relative to those of 2 geographic races of O. eichingeri (table 5), which makes the vicariance scenario unlikely. In addition, the consistency of the negative values of the average Tajima's D and D* and F* of Fu and Li across loci (tables 2 and 3) does not corroborate the vicariance hypothesis. The long dispersal hypothesis is further supported by the observation that O. eichingeri has a similar morphology and is found in similar habitats in Africa and Sri Lanka (Vaughan et al. 2003
).
Vaughan et al. (2005)
suggested that the African O. eichingeri could be dispersed to Sri Lanka by the birds that migrated across the Indian Ocean from Africa. Bautista et al. (2006)
comparatively studied genetic diversity of the Oryza species with the A- and C-genomes in southern South Asia and speculated that the Sri Lankan O. eichingeri might have been introduced from Africa a very long time ago. The present phylogenetic analyses found a high level of divergence between the western and eastern African accessions and that alleles from the Cote d'Ivoire accession (eic_CIV) were clustered with alleles of the Sri Lankan accessions rather than with those from the Ugandan accessions on the individual trees of 8 loci and the combined tree (fig. 3 and fig. S3, Supplementary Material online). These results suggest a closer relationship between West African and the Sri Lankan O. eichingeri and imply that the Sri Lankan O. eichingeri might be originated from West Africa.
Population genetics predicts that the derived populations would harbor much reduced genetic diversity relative to the ancient population (population bottleneck). In this study, no bottleneck effect was detected for the Sri Lankan O. eichingeri because the 2 geographic races had comparable levels of average nucleotide diversity (
sil = 0.0042,
sil = 0.0033 for O. eichingeri-LKA;
sil = 0.0039,
sil = 0.0040 for O. eichingeri-AFR; table 2). This result seems odd to the long-distance dispersal from Africa to Sri Lanka unless the genetic diversity in the African O. eichingeri has been significantly decreased for some reasons and/or there have been frequent dispersal events from Africa to Sri Lanka. It should be noted, however, that the diversity level of O. eichingeri-LKA was substantially reduced (
sil = 0.0013;
sil = 0.0011) when the introgressed O. eichingeri accession (eic-LKA4) was excluded, whereas the diversity of O. eichingeri-LKA remained high if any one of the other accessions was excluded (
sil = 0.00470.0048;
sil = 0.00390.0043). This suggests that much less nucleotide diversity would be expected in the Sri Lankan O. eichingeri if the introgression between species was precluded, consistent with a long-distance dispersal from Africa to Sri Lanka. However, it is still premature to make a conclusion about the geographic history of this species before a phylogeographic study with extensive sampling is made.
| Supplementary Material |
|---|
|
|
|---|
Supplementary Tables S1 and S2 and Figures S1S3 are available at Molecular biology and Evolution online (http://www.mbe.oxfordjournals.org/).
| Acknowledgements |
|---|
|
|
|---|
We thank Qi-Hui Zhu and Zhi-Yong Zhang for helpful suggestions on the manuscript, Jody Hey for guidance on data analysis, and Xian-Zhao Kan and the members of Ge's group for technical assistances. We are also grateful to the International Rice Research Institute (Los Banos, Philippines) for providing leaf and seed samples and Gerard Second for providing the IP7 sample. This work was supported by the National Natural Science Foundation of China (30430030 and 30121003) and the Chinese Academy of Sciences Innovation Grant.
| Footnotes |
|---|
Pekka Pamilo, Associate Editor
| References |
|---|
|
|
|---|
Aggarwal RK, Brar DS, Nandi S, Huang N, Khush GS. (1999) Phylogenetic relationships among Oryza species revealed by AFLP markers. Theor Appl Genet 98:13201328.[CrossRef][Web of Science]
Akey JM, Eberle MA, Rieder MJ, Carlson CS, Shriver MD, Nickerson DA, Kruglyak L. (2004) Population history and natural selection shape patterns of genetic variation in 132 genes. PLoS Biol 2:e286.[CrossRef][Medline]
Amos W and Harwood J. (1998) Factors affecting levels of genetic diversity in natural populations. Phil Trans R Soc Lond B 353:177186.
Ausubel FM. (1992) Short protocols in molecular biology. John Wiley & Sons, New York.
Bao Y and Ge S. (2003) Phylogenetic relationships among diploid species of Oryza officinalis complex revealed by multiple gene sequences. Acta Phytotaxon Sin 41:497508.
Bao Y and Ge S. (2004) Origin and phylogeny of Oryza species with the CD genome based on multiple-gene sequence data. Plant Syst Evol 249:5566.[CrossRef]
Bao Y, Zhou HF, Hong DY, Ge S. (2006) Genetic diversity and evolutionary relationships of Oryza species with the B- and C-genomes as revealed by SSR markers. J Plant Biol 49:339347.
Bautista N, Vaughan D, Jayasuriya A, Liyanage A, Kaga A, Tomooka N. (2006) Genetic diversity in AA and CC genome Oryza species in southern South Asia. Genet Resour Crop Evol 53:631640.[CrossRef]
Biswal J and Sharma SD. (1987) Taxonomy and phylogeny of Oryza collina. Oryza 24:2429.
Blair MW, Garris AJ, Iyer AS, Chapman B, Kresovich S, McCouch SR. (2003) High resolution genetic mapping and candidate gene identification at the xa5 locus for bacterial blight resistance in rice (Oryza sativa L.). Theor Appl Genet 107:6273.[CrossRef][Web of Science][Medline]
Broughton RE and Harrison RG. (2003) Nuclear gene genealogies reveal historical, demographic and selective factors associated with speciation in field crickets. Genetics 163:13891401.
Cherry JL. (2004) Selection, subdivision and extinction and recolonization. Genetics 166:11051114.
Clark AG. (1997) Neutral behavior of shared polymorphism. Proc Natl Acad Sci USA 94:77307734.
Clark RM, Linton E, Messing J, Doebley JF. (2004) Pattern of diversity in the genomic region near the maize domestication gene tb1. Proc Natl Acad Sci USA 101:700707.
Dally AM and Second G. (1990) Chloroplast DNA diversity in wild and cultivated species of rice (Genus Oryza, section Oryza). Cladistic-mutation and genetic-distance analysis. Theor Appl Genet 80:209222.[Web of Science]
de Queiroz A, Donoghue MJ, Kim J. (1995) Separate versus combined analysis of phylogenetic evidence. Annu Rev Ecol Syst 26:657681.[Web of Science]
Dian W, Jiang H, Chen Q, Liu F, Wu P. (2003) Cloning and characterization of the granule-bound starch synthase II gene in rice: gene expression is regulated by the nitrogen level, sugar and circadian rhythm. Planta 218:261268.[CrossRef][Web of Science][Medline]
Enard W and Pääbo S. (2004) Comparative primate genomics. Annu Rev Genomics Hum Genet 5:351378.[CrossRef][Web of Science][Medline]
Eyre-Walker A, Gaut RL, Hilton H, Feldman DL, Gaut BS. (1998) Investigation of the bottleneck leading to the domestication of maize. Proc Natl Acad Sci USA 95:44414446.
Federici MT, Shcherban AB, Capdevielle F, Francis M, Vaughan D. (2002) Analysis of genetic diversity in the Oryza officinalis complex. J Biotechnol 5:173181.
Fu YX and Li WH. (1993) Statistical tests of neutrality of mutations. Genetics 133:693709.[Abstract]
Gao LZ. (2005) Microsatellite variation within and among populations of Oryza officinalis (Poaceae), an endangered wild rice from China. Mol Ecol 14:42874297.[Medline]
Gao LZ, Ge S, Hong DY. (2000) Allozyme variation and population genetic structure of common wild rice, Oryza rufipogon Griff. in China. Theor Appl Genet 101:494502.[CrossRef][Web of Science]
Gao LZ, Ge S, Hong DY. (2001) High levels of genetic differentiation of Oryza officinalis Wall et Watt. from China. J Hered 92:511516.
Gao LZ and Zhang CH. (2005) Comparisons of microsatellite variability and population genetic structure of two endangered wild rice species, Oryza rufipogon and O. officinalis, and their conservation implications. Biodivers Conserv 14:16631679.[CrossRef]
Garris AJ, McCouch SR, Kresovich S. (2003) Population structure and its effect on haplotype diversity and linkage disequilibrium surrounding the xa5 locus of rice (Oryza sativa L.). Genetics 165:759769.
Gaut BS, Morton BR, McCaig BC, Clegg MT. (1996) Substitution rate comparisons between grasses and palms: synonymous rate differences at the nuclear gene Adh parallel rate differences at the plastid gene rbcL. Proc Natl Acad Sci USA 93:1027410279.
Ge S, Sang T, Lu BR, Hong DY. (1999) Phylogeny of rice genomes with emphasis on origins of allotetraploid species. Proc Natl Acad Sci USA 96:1440014405.
Hamblin MT, Mitchell SE, White GM, Gallego J, Kukatla R, Wing RA, Paterson AH, Kresovich S. (2004) Comparative population genetics of the Panicoid grasses: sequence polymorphism, linkage disequilibrium and selection in a diverse sample of Sorghum bicolor. Genetics 167:471483.
Hey J and Nielsen R. (2004) Multilocus methods for estimating population sizes, migration rates and divergence time, with applications to the divergence of Drosophila pseudoobscura and D. persimilis. Genetics 167:747760.
Hey J and Wakeley J. (1997) A coalescent estimator of the population recombination rate. Genetics 145:833846.[Abstract]
Hilton H and Gaut BS. (1998) Speciation and domestication in maize and its wild relatives: evidence from the Globulin-1 gene. Genetics 150:863872.
Hilton H, Kliman RM, Hey J. (1994) Using hitchhiking genes to study adaptation and divergence during speciation within the Drosophila melanogaster species complex. Evolution 48:19001913.[CrossRef][Web of Science]
Hudson RR and Kaplan NL. (1985) Statistical properties of the number of recombination events in the history of a sample of DNA sequences. Genetics 111:147164.
Hudson RR, Kreitman M, Aguade M. (1987) A test of neutral molecular evolution based on nucleotide data. Genetics 116:153159.
Hudson RR, Slatkin M, Maddison WP. (1992) Estimation of levels of gene flow from DNA sequence data. Genetics 132:583589.[Abstract]
Jeon J, Jang S, Lee S, et al. (11 co-authors). (2000) Leafy hull sterile1 is a homeotic mutation in a rice MADS box gene affecting rice flower development. Plant Cell 12:871884.
Judo MSB, Wedel A, Wilson C. (1998) Stimulation and suppression of PCR-mediated recombination. Nucleic Acids Res 26:18191825.
Khush GS and Brar DS. (2001) Rice genetics from Mendel to functional genomics. In Khush GS, Brar DS, Hardy B (Eds.). Rice genetics IV. Proceedings of the Fourth International Rice Genetics Symposium(IRRI, Los Banos (Phillipines)) pp. 325.
Kimura M. (1980) A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences. J Mol Evol 16:111134.[CrossRef][Web of Science][Medline]
Klein J, Sato A, Nagl S, O'hUigin C. (1998) Molecular trans-species polymorphism. Annu Rev Ecol Syst 29:121.[CrossRef][Web of Science]
Kliman RM, Andolfatto P, Coyne JA, Depaulis F, Kreitman M, Berry AJ, McCarter J, Wakeley J, Hey J. (2000) The population genetics of the origin and divergence of the Drosophila simulans complex species. Genetics 156:19131931.
Kumar S, Tamura K, Nei M. (2004) MEGA3: integrated software for molecular evolutionary genetics analysis and sequence alignment. Brief Bioinformatics 5:150163.
Liu A and Burke JM. (2006) Patterns of nucleotide diversity in wild and cultivated sunflower. Genetics 173:321330.
Llopart A, Lachaise D, Coyne JA. (2005) Multilocus analysis of introgression between two sympatric sister species of Drosophila: Drosophila yakuba and D. santomea. Genetics 171:197210.
Machado CA, Kliman RM, Markert JA, Hey J. (2002) Inferring the history of speciation from multilocus DNA sequence data: the case of Drosophila pseudoobscura and close relatives. Mol Biol Evol 19:472488.
Morrell PL, Lundy KE, Clegg MT. (2003) Distinct geographic patterns of genetic diversity are maintained in wild barley (Hordeum vulgare ssp. spontaneum) despite migration. Proc Natl Acad Sci USA 100:1081210817.
Morrell PL, Toleno DM, Lundy KE, Clegg MT. (2005) Low levels of linkage disequilibrium in wild barley (Hordeum vulgare ssp. spontaneum) despite high rates of self-fertilization. Proc Natl Acad Sci USA 102:24422447.
Nayar NM. (1973) Origin and cytogenetics of rice. Adv Genet 17:153292.[CrossRef][Web of Science]
Nei M. (1987) Molecular evolutionary genetics. (Columbia University Press, New York).
Nei M and Li W-H. (1979) Mathematical model for studying genetic variation in terms of restriction endonucleases. Proc Natl Acad Sci USA 76:52695273.
Nei M and Takahata N. (1993) Effective population size, genetic diversity, and coalescent times in subdivided populations. J Mol Evol 37:240244.[Web of Science][Medline]
Nordborg M and Innan H. (2002) Molecular population genetics. Curr Opin Plant Biol 5:6973.[CrossRef][Web of Science][Medline]
Olsen KM, Caicedo AL, Polato N, McClung A, McCouch S, Purugganan MD. (2006) Selection under domestication: evidence for a sweep in the rice Waxy genomic region. Genetics 173:975983.
Olsen KM and Purugganan MD. (2002) Molecular evidence on the origin and evolution of glutinous rice. Genetics 162:941950.
Palumbi S and Baker C. (1994) Contrasting population structure from nuclear intron sequences and mtDNA of humpback whales. Mol Biol Evol 11:426435.[Abstract]
Pamilo P and Nei M. (1988) Relationships between gene trees and species trees. Mol Biol Evol 5:568583.[Abstract]
Posada D and Crandall KA. (2001) Intraspecific gene genealogies: trees grafting into networks. Trends Ecol Evol 16:3745.[CrossRef][Medline]
Pot D, McMillan L, Echt C, Le Provost G, Garnier-Gere P, Cato S, Plomion C. (2005) Nucleotide variation in genes involved in wood formation in two pine species. New Phytol 167:101112.[CrossRef][Web of Science][Medline]
Ramos-Onsins SE, Stranger BE, Mitchell-Olds T, Aguade M. (2004) Multilocus analysis of variation and speciation in the closely related species Arabidopsis halleri and A. lyrata. Genetics 166:373388.
Rozas J, Sánchez-Delbarrio JC, Messeguer X, Rozas R. (2003) DnaSP, DNA polymorphism analyses by the coalescent and other methods. Bioinformatics 19:24962497.
Saitou N and Nei M. (1987) The neighbor-joining method: a new method for reconstruction phylogenetic trees. Mol Biol Evol 4:406425.[Abstract]
Sakamoto T, Miura K, Itoh H, et al. (15 co-authors). (2004) An overview of gibberellin metabolism enzyme genes and their related mutants in rice. Plant Physiol 134:16421653.
Savolainen O, Langley CH, Lazzaro BP, Freville H. (2000) Contrasting patterns of nucleotide polymorphism at the alcohol dehydrogenase locus in the outcrossing Arabidopsis lyrata and the selfing Arabidopsis thaliana. Mol Biol Evol 17:645655.
Seo HS, Kim HY, Jeong JY, Lee SY, Cho MJ, Bahk JD. (1995) Molecular cloning and characterization of RGA1 encoding a G protein subunit from rice (Oryza sativa L. IR-36). Plant Mol Biol 27:11191131.[CrossRef][Web of Science][Medline]
Shammas FV, Heikkila R, Osland A. (2001) Fluorescence-based method for measuring and determining the mechanisms of recombination in quantitative PCR. Clin Chim Acta 304:1928.[CrossRef][Web of Science][Medline]
Sharma SD and Shastry SVS. (1965) Taxonomic studies in the genus Oryza. VI. A modified classification of genus. Indian J Genet 25:173178.
Shcherban AB, Vaughan DA, Tomooka N, Kaga A. (2001) Diversity in the integrase coding domain of a gypsy-like retrotransposon among wild relatives of rice in the Oryza officinalis complex. Genetica 110:4353.[Web of Science]
Small RL, Ryburn JA, Wendel JF. (1999) Low levels of nucleotide diversity at homoeologous Adh loci in allotetraploid cotton (Gossypium L.). Mol Biol Evol 16:491501.[Abstract]
Small RL and Wendel JF. (2000) Phylogeny, duplication, and intraspecific variation of Adh sequences in New World diploid cottons (Gossypium L, Malvaceae). Mol Phylogenet Evol 16:7384.[CrossRef][Web of Science][Medline]
Städler T, Roselius K, Stephan W. (2005) Genealogical footprints of speciation processes in wild tomatoes: demography and evidence for historical gene flow. Evolution 59:12681279.[CrossRef][Web of Science][Medline]
Swofford DL. (2002) PAUP*: phylogenetic analysis using parsimony (*and other methods), Version 4.0. (Sinauer Associates, Sunderland (MA)).
Tajima F. (1989) Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics 123:585595.
Tajima F. (1993) Simple methods for testing the molecular evolutionary clock hypothesis. Genetics 135:599607.[Abstract]
Tarchini R, Biddle P, Wineland R, Tingey S, Rafalski A. (2000) The complete sequence of 340 kb of DNA around the rice Adh1-Adh2 region reveals interrupted colinearity with maize chromosome 4. Plant Cell 12:381391.
Tateoka T. (1965) A taxonomy study of Oryza eichingeri and O. punctata. Bot Mag Tokyo 78:156163.
Tenaillon MI, U'Ren J, Tenaillon O, Gaut BS. (2004) Selection versus demography: a multilocus investigation of the domestication process in maize. Mol Biol Evol 21:12141225.
Thompson JD, Gibson TJ, Plewniak F, Jeanmougin F, Higgins DG. (1997) The Clustal X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res 25:48764882.
Tiffin P and Gaut BS. (2001) Sequence diversity in the tetraploid Zea perennis and the closely related diploid Z. diploperennis: insights from four nuclear loci. Genetics 158:401412.
Vaughan D, Kadowaki K, Kaga A, Tomooka N. (2005) On the phylogeny and biogeography of the genus Oryza. Breeding Sci 55:113122.[CrossRef]
Vaughan D, Morishima H, Kadowaki K. (2003) Diversity in the Oryza genus. Curr Opin Plant Biol 6:139146.[CrossRef][Web of Science][Medline]
Vaughan DA. (1989) The genus Oryza L: current status of taxonomy. (International Rice Research Institute, Los Banos (Phillipines)).
Vaughan DA. (1990) A new rhizomatious Oryza species (Poaceae) from Sri Lanka. Bot J Linn Soc 103:159163.[CrossRef]
Wakeley J and Hey J. (1997) Estimating ancestral population parameters. Genetics 145:847855.[Abstract]
Wang RL, Wakeley J, Hey J. (1997) Gene flow and natural selection in the origin of Drosophila pseudoobscura and close relatives. Genetics 147:10911106.[Abstract]
Wang ZY, Second G, Tanksley SD. (1992) Polymorphism and phylogenetic relationships among species in the genus Oryza as determined by analysis of nuclear RFLPs. Theor Appl Genet 83:565581.[Web of Science]
Washio K and Ishikawa K. (1994) Organ-specific and hormone-dependent expression of genes for serine carboxypeptidases during development and following germination of rice grains. Plant Physiol 105:12751280.[Abstract]
Watterson GA. (1975) On the number of segregating sites in genetical models without recombinations. Theor Popul Biol 7:256276.[CrossRef][Web of Science][Medline]
Wayne ML and Simonsen KL. (1998) Statistical tests of neutrality in the age of weak selection. Trends Ecol Evol 13:236240.[CrossRef]
Wendel JF and Doyle JJ. (1998) Phylogenetic incongruence: window into genome history and molecular evolution. In Soltis DE, Soltis PS, Doyle JJ (Eds.). Molecular systematics of plants, II: DNA sequencing(Kluwer Press, Boston (MA)) pp. 265296.
White SE and Doebley JF. (1999) The molecular evolution of terminal ear1, a regulatory gene in the genus Zea. Genetics 153:14551462.
Won YJ and Hey J. (2005) Divergence population genetics of chimpanzees. Mol Biol Evol 22:297307.
Wright SI and Gaut BS. (2005) Molecular population genetics and the search for adaptive evolution in plants. Mol Biol Evol 22:506519.
Wu CI. (1991) Inferences of species phylogeny in relation to segregation of ancient polymorphisms. Genetics 127:429435.[Abstract]
Xia X and Xie Z. (2001) DAMBE: software package for data analysis in molecular biology and evolution. J Hered 92:371373.
Yoshida K and Miyashita N. (2005) Nucleotide polymorphism in the Adh2 region of the wild rice Oryza rufipogon. Theor Appl Genet 111:12151228.[CrossRef][Web of Science][Medline]
Yu N, Jensen-Seaman MI, Chemnick L, Ryder O, Li W-H. (2004) Nucleotide diversity in gorillas. Genetics 166:13751383.
Zhu Q, Zheng X-M, Luo J-C, Gaut BS, Ge S. Forthcoming. (2007) Multilocus analysis of nucleotide variation of Oryza sativa and its wild relatives: severe bottleneck during domestication of rice. Mol Biol Evol.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
J. Ross-Ibarra, M. Tenaillon, and B. S. Gaut Historical Divergence and Gene Flow in the Genus Zea Genetics, April 1, 2009; 181(4): 1399 - 1413. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. S.S. Ammiraju, F. Lu, A. Sanyal, Y. Yu, X. Song, N. Jiang, A. C. Pontaroli, T. Rambo, J. Currie, K. Collura, et al. Dynamic Evolution of Oryza Genomes Is Revealed by Comparative Genomic Analysis of a Genus-Wide Vertical Data Set PLANT CELL, December 1, 2008; 20(12): 3191 - 3209. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Slotte, H. Huang, M. Lascoux, and A. Ceplitis Polyploid Speciation Did Not Confer Instant Reproductive Isolation in Capsella (Brassicaceae) Mol. Biol. Evol., July 1, 2008; 25(7): 1472 - 1481. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Stadler, U. Arunyawat, and W. Stephan Population Genetics of Speciation in Two Closely Related Wild Tomatoes (Solanum Section Lycopersicon) Genetics, January 1, 2008; 178(1): 339 - 350. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||


, Oryza officinalis;
and
, Oryza rhizomatis. (b)
and
, Oryza eichingeri-LKA;
and
, O. eichingeri-AFR. Symbols above the line indicate that the observed values are greater than the simulated. Similarly, symbols below the line indicate that the observed values are less than the simulated.


