Skip Navigation


MBE Advance Access originally published online on September 22, 2006
Molecular Biology and Evolution 2007 24(1):90-101; doi:10.1093/molbev/msl131
This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Supplemantary Material
Right arrow All Versions of this Article:
24/1/90    most recent
msl131v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Willyard, A.
Right arrow Articles by Cronn, R.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Willyard, A.
Right arrow Articles by Cronn, R.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2006. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oxfordjournals.org

Research Articles

Fossil Calibration of Molecular Divergence Infers a Moderate Mutation Rate and Recent Radiations for Pinus

Ann Willyard*, John Syring{dagger}, David S. Gernandt{ddagger}, Aaron Liston* and Richard Cronn§

* Department of Botany and Plant Pathology, Oregon State University
{dagger} Department of Biological and Physical Sciences, Montana State University
{ddagger} Centro de Investigaciones Biológicas, Universidad Autónoma del Estado de Hidalgo, Pachuca, Hidalgo, Mexico
§ Pacific Northwest Research Station, USDA Forest Service, Corvallis, Oregon

E-mail: rcronn{at}fs.fed.us.


    Abstract
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Supplementary Material
 Acknowledgements
 References
 
Silent mutation rate estimates for Pinus vary 50-fold, ranging from angiosperm-like to among the slowest reported for plants. These differences either reflect extraordinary genomic processes or inconsistent fossil calibration, and they have important consequences for population and biogeographical inferences. Here we estimate mutation rates from 4 Pinus species that represent the major lineages using 11 nuclear and 4 chloroplast loci. Calibration was tested at the divergence of Pinus subgenera with the oldest leaf fossil from subg. Strobus (Eocene; 45 MYA) or a recently published subg. Strobus wood fossil (Cretaceous; 85 MYA). These calibrations place the origin of Pinus 190–102 MYA and give absolute silent rate estimates of 0.70–1.31 x 10–9 and 0.22–0.42 x 10–9·site–1·year–1 for the nuclear and chloroplast genomes, respectively. These rates are approximately 4- to 20-fold slower than angiosperms, but unlike many previous estimates, they are more consistent with the high per-generation deleterious mutation rates observed in pines. Chronograms from nuclear and chloroplast genomes show that the divergence of subgenera accounts for about half of the time since Pinus diverged from Picea, with subsequent radiations occurring more recently. By extending the sampling to encompass the phylogenetic diversity of Pinus, we predict that most extant subsections diverged during the Miocene. Moreover, subsect. Australes, Ponderosae, and Contortae, containing over 50 extant species, radiated within a 5 Myr time span starting as recently as 18 MYA. An Eocene divergence of pine subgenera (using leaf fossils) does not conflict with fossil-based estimates of the Pinus–Picea split, but a Cretaceous divergence using wood fossils accommodates Oligocene fossils that may represent modern subsections. Because homoplasy and polarity of character states have not been tested for fossil pine assignments, the choice of fossil and calibration node represents a significant source of uncertainty. Based on several lines of evidence (including agreement with ages inferred using calibrations outside of Pinus), we conclude that the 85 MYA calibration at the divergence of pine subgenera provides a reasonable lower bound and that further refinements in age and mutation rate estimates will require a synthetic examination of pine fossil history.

Key Words: molecular evolution • Pinus • silent substitution rates • chronogram • fossils


    Introduction
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Supplementary Material
 Acknowledgements
 References
 
Morphological and molecular analyses of the pine genus (Pinus; Pinaceae) reveal conflicting estimates of age and evolutionary rates that are difficult to reconcile. Pinus contains 2 monophyletic subgenera, Pinus (diploxylon or "hard pines") and Strobus (haploxylon or "soft pines"), diagnosable by 2 versus 1 fibrovascular bundle per leaf (Gernandt et al. 2005Go). Origin of the genus is thought to date to the Early Cretaceous (Millar 1998Go), whereas estimates for the divergence of the subgenera range from the Late Cretaceous (Millar 1998Go) to the Mid-Eocene (Miller 1976Go). Limited morphological differentiation in the approximately 110 species has been attributed to an exceptionally slow rate of change, but morphological homoplasy (Gernandt et al. 2005Go) and retention of ancestral molecular polymorphism (Syring et al. 2007Go) are common. Several studies have reported slow molecular divergence rates for pines (Krupkin et al. 1996Go; Dvornyk et al. 2002Go; Geada López et al. 2002Go; Brown et al. 2004Go; Sokol and Williams 2005Go; Ma et al. 2006Go), but these rates appear inconsistent with estimates of per-generation deleterious mutation rates which are known to be at least 10-fold higher in pines than in self-compatible annual flowering plants (Kärkkäinen et al. 1996Go; Klekowski 1998Go). In contrast, mutation rate estimates comparable to angiosperms have been reported for pines based on antigenic distances (Prager et al. 1976Go) and for a retrotransposon (Kossack and Kinlaw 1999Go).

Unlike many plant groups, Pinus is prominent in paleofloras (Miller 1976Go; Millar 1998Go; Price et al. 1998Go), has a long history of taxonomic inquiry (Price et al. 1998Go; Gernandt et al. 2005Go), and extensive genomic resources are available (Brown et al. 2001Go; Temesgen et al. 2001Go; Chagné et al. 2003Go; Komulainen et al. 2003Go; Krutovsky et al. 2004Go). These enhance its usefulness for studying the interrelationship between morphological and molecular evolution. However, the paleontological data cannot be utilized to full advantage until critical tests of fossil–phylogenetic associations are conducted (e.g., Magallón and Sanderson 2005Go). The earliest fossil attributable to Pinus, Pinus belgica (Alvin 1960Go), is a Cretaceous ovulate cone apparently originating from the Wealden Formation in Belgium 145–125 MYA. Attempts to incorporate this and other Pinus fossils into molecular phylogenetic analyses have taken strikingly different approaches. For calibration, P. belgica has been placed at the divergence between Pinus and other modern Pinaceae genera (Wang et al. 2000Go; García-Gil et al. 2003Go) or at the divergence of subgenera (Sokol and Williams 2005Go; Ma et al. 2006Go). Because Alvin (1960)Go further ascribed P. belgica to subsect. Pinus, its age has been commonly applied to the divergence between representatives from the sections of subg. Pinus (Krupkin et al. 1996Go; Dvornyk et al. 2002Go; Geada López et al. 2002Go; Brown et al. 2004Go; Eckert and Hall 2006Go). Alternative calibrations used 195 MYA based on a presumed Jurassic origin of the genus that lacks explicit fossil evidence (Kutil and Williams 2001Go) or 45 MYA as the divergence of subgenera based on the earliest fossils representing both subgenera (Kossack and Kinlaw 1999Go). As a consequence of inconsistent calibration, synonymous mutation rate estimates vary 50-fold, ranging from angiosperm-like (2.8 x 10–9 substitutions·site–1·year–1 for "gypsy"-like retrotransposons; Kossack and Kinlaw 1999Go) to far slower than angiosperms (0.05 x 10–9 for Adh; Dvornyk et al. 2002Go). Similarly, divergence rates calculated for pine chloroplast DNA (cpDNA; 0.06 x 10–9; Krupkin et al. 1996Go) are among the slowest reported for any plant.

Here we evaluate hypotheses concerning the age of Pinus and explore the impact of different calibrations on estimated absolute mutation rates across 11 nuclear and 4 chloroplast loci. A phylogenetic framework was created with an exemplar from each of the 4 monophyletic sections (a "quartet" of species). Using a critical evaluation of the fossil record, we test 2 calibrations at the crown, inferred as the divergence between subg. Strobus and subg. Pinus, and represented by competing hypotheses for the oldest fossils from subg. Strobus (putative lower age of 85 MYA and an upper bound of high certainty at 45 MYA). We also use our multilocus data set to recalculate ages and rates with the calibration scenarios used in recent studies (Krupkin et al. 1996Go; Dvornyk et al. 2002Go; Geada López et al. 2002Go; Brown et al. 2004Go; Eckert and Hall 2006Go; Ma et al. 2006Go) to test whether highly heterogeneous rates are an intrinsic feature of the loci and taxa sampled in those studies or if the variation can be attributed to calibration.


    Materials and Methods
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Supplementary Material
 Acknowledgements
 References
 
Plant Materials
Each subgenus comprises 2 monophyletic groups—sections Trifoliae and Pinus from subg. Pinus and sections Parrya and Quinquefoliae from subg. Strobus. For this study, 1 species was chosen from each section: Pinus taeda L. (Mississippi, United States) for section Trifoliae; Pinus thunbergii Parl. (Cheonnam, South Korea) or Pinus merkusii Jungh. & deVriese s.l. (Thailand) for section Pinus; Pinus monticola D. Don (Oregon, United States) for section Quinquefoliae; and Pinus nelsonii Shaw (Nuevo León, Mexico) for section Parrya. Haploid genomic DNA from seed megagametophyte tissue was isolated for amplifying nuclear DNA (nDNA) (FastDNA, Qbiogene, Irvine, CA), whereas leaf DNA was the source for cpDNA (Gernandt et al. 2005Go).

Loci Evaluated
Nuclear loci included in this study map to 7 of 12 linkage groups in Pinus (see table S1, Supplementary Material online). Represented are LG2, aquaporin; LG3, chlorophyll-binding protein type II precursor (LHC-CAB) and late embryogenesis abundant–like protein (LEA-like); LG5, arabinogalactan-like protein (AGP6) and ferritin; LG6, phenylalanine ammonia lyase (PAL1); LG7, 4-coumarate:CoA ligase (4CL); LG9, cinnamyl alcohol dehydrogenase (CAD) and open stomata (OST1); LG10, chloroplast-localized Cu–Zn superoxide dismutase (SODchl); and the unmapped early response to dehydration (ERD3). Loci from LG3, LG5, and LG9 are sufficiently distant such that they are effectively unlinked (Krutovsky et al. 2004Go). Chloroplast loci included in this study are matK, rbcL, the rpL20/rpsS18 intergenic spacer, and the trnV intron. See table S1, Supplementary Material online for primer and amplicon descriptions.

Amplification, Sequencing, and Analysis
Polymerase chain reaction (PCR) products were directly sequenced using BigDye v. 3.1 (Applied Biosystems, Foster city, CA) and visualized on an Applied Biosystems 3730 Genetic Analyzer. DNA alignments were made using ClustalW (Thompson et al. 1994Go) as implemented in BioEdit v. 7.0.1 (Hall 1999Go) or an iterative process of BlastN analysis (http://www.ncbi.nih.gov), followed by hand alignment, to align cDNA sequences from GenBank (http://www.ncbi.nlm.nih.gov/Genbank) to genomic sequences. Alignments were made at 3 levels: 1) species within a subgenus, that is, P. taeda to P. thunbergii and P. monticola to P. nelsonii; 2) the quartet of Pinus species, excluding unalignable regions (Syring et al. 2005Go); and 3) the quartet of Pinus species with an outgroup sequence. For 10 loci (4CL, AGP6, aquaporin, CAD, LHC-CAB, PAL1, matK, rbcL, rpl20/S18, and trnV; see table S1, Supplementary Material online for details), outgroup sequences from Picea were obtained using PCR (as described for ingroup sequences) or by searching GenBank for putative genomic or expressed sequence orthologs. Outgroup sequences were not isolated for ERD3 and SODchl because of orthological concerns regarding some ingroup amplicons (see Results). Amplification of the LEA-like locus failed in Picea, so a GenBank sequence from Pseudotsuga was used as the outgroup. The closest GenBank matches available for ferritin and OST1 (Arabidopsis thaliana) were used to provide a root for these 2 loci. New nucleotide sequences were submitted to GenBank (see table S2, Supplementary Material online for details); alignments are available as Supplementary Material online.

Pairwise substitution rates for silent (dS; synonymous plus noncoding) and nonsynonymous (dN) sites were calculated using DnaSP (Rozas et al. 2003Go) using the approximation method of Nei and Gojobori (1986)Go with a Jukes–Cantor correction for multiple substitutions. DnaSP (Rozas et al. 2003Go) was also used to calculate GC content. Models of sequence evolution (number of substitution categories, base frequencies, shape parameter, and proportion of invariant sites) were tested independently for all sites and for the silent sites (approximated by including noncoding and third-codon positions) of each locus using Modeltest 3.7 (Posada and Crandall 1998Go). Models selected by the hierarchical likelihood ratio tests in Modeltest 3.7 were used to obtain maximum likelihood (ML) estimates of phylogeny with PAUP* ver. 4.0b10 (Swofford 2002Go). To test for rate equality (i.e., clock-like evolutionary history) among lineages, we used the likelihood ratio test (Muse and Weir 1992Go) to compare clock-enforced and clock-relaxed likelihood scores that were obtained using the selected evolutionary models described above (1-tailed probability, {alpha} = 0.05).

A partition homogeneity test (Cunningham 1997Go; 100 replicates using a heuristic search) was used to test for significant conflict between loci (P = 0.01) for all sites as well as silent sites from 3 different data sets (see table S3, Supplementary Material online for details). The 3 data sets are 1) 9 nuclear loci with different outgroup species (Picea, Pseudotsuga, or Arabidopsis; 4CL, AGP6, aquaporin, CAD, ferritin, LEA-like, LHC-CAB, OST1, PAL1; 8,346 bp; 6,098 silent bp); 2) 5 nuclear loci with Picea as the outgroup (4CL, AGP6, aquaporin, CAD, LHC-CAB, 3,973 bp; 2,348 silent bp); and 3) 4 chloroplast loci with Picea as the outgroup (matK, rbcL, rpL20/S18, trnV; 3,903 bp; 1,914 silent bp). Because data sets did not exhibit significant conflict, they were concatenated, and models of sequence evolution were selected for concatenated data sets as described above. To investigate the impact of increased taxon density on rate and divergence estimates, we examined data from Syring et al. (2005)Go that include 4 loci evaluated in this study (4CL, AGP6, LEA-like, LHC-CAB; 5,338 bp; 3,766 silent bp) for 12 pine species serving as exemplars for subsections with Picea as the outgroup. Evolutionary model selection, likelihood ratio tests, and partition homogeneity tests were performed on each locus and on the concatenated silent sites as described above.

Fossil Calibration
Because all pine fossil reports that were considered as calibration sources lack radiometric dating, ages were adjusted to the midpoint of the currently assigned range for their geological Epoch or Stage (Gradstein and Ogg 2004Go). Pinus pollen, pollen cones, and ovulate cone casts lack diagnostic characters for subgeneric identification (Miller 1976Go; Phipps et al. 1995Go), so only anatomical reports from ovulate cones, leaves, and wood were considered for calibration sources. The earliest Pinus fossil, P. belgica (145–125 MYA; Alvin 1960Go), shows affinities to subg. Pinus (Miller 1976Go) and has been considered part of subsect. Pinus (Millar 1998Go). However, the collection locality for P. belgica (Wealden Formation, Belgium) was inferred from the lignitic state and adhering particles (Alvin 1960Go), leaving its geographic and stratigraphic origins uncertain. There is an approximate 35-Myr gap until the next Pinus fossils (100–75 MYA ovulate cones from North America, Japan, and Europe), and these fossils also have affinity to subg. Pinus (Fliche 1896Go; Alvin 1960Go; Robison 1977Go; Blackwell 1984Go; Miller and Malinky 1986Go; Stockey and Nishida 1986Go; Saiki 1996Go). Because these early fossils show symplesiomorphic features and lack synapomorphies to support infrageneric nodes, they cannot be used as reliable calibration sources (Magallón and Sanderson 2001Go).

The node corresponding to the divergence of the subgenera is well supported with both morphological and molecular synapomorphies (Gernandt et al. 2005Go). Because all of the oldest Pinus fossils have affinity to subg. Pinus, the first fossil representing subg. Strobus supports the minimum age of divergence of the subgenera. The appearance of subg. Strobus dates to the Late Cretaceous based on permineralized wood anatomy (Santonian, 83.5–85.8 MYA, midpoint approximately 85 MYA, Pinuxylon sp.; Meijer 2000Go), or to the Mid-Eocene based on either leaf anatomy (37.2–48.6 MYA, midpoint approximately 45 MYA, Pinus similkameenensis; Miller 1973Go), or ovulate cones (ca. 43 MYA; Axelrod 1986Go). It is important to note that Cretaceous fossils have been attributed to subg. Strobus (Jeffrey 1908Go; Stopes and Kershaw 1910Go; Penny 1947Go), but these fossils are not useful for calibration because they have been reassigned to subg. Pinus or to other genera (see Discussion). Therefore, we calibrated the divergence of the subgenera with a putative lower age of 85 MYA based on wood anatomy and an upper age of 45 MYA based on leaves and ovulate cones.

Rates and Ages
Absolute rates of silent-site changes in the concatenated data sets were estimated using 2 approaches. First, we computed silent-site divergence (dS) for comparisons between subgeneric representatives using 9 nuclear and 4 chloroplast loci. Lower and upper rate estimates (i.e., 2 calibration ages) were calculated using the formula µ = dS/2T, where µ is the silent divergence·site–1·year–1, dS is the mean of silent substitutions·site–1 (weighted by number of sites in each locus), and T is time in years. Second, we calculated ML branch lengths (using the selected model of sequence substitution described above; see table S3, Supplementary Material online for details) from clock-enforced phylograms using silent sites from pine quartet alignments of nDNA and cpDNA. Silent-site data sets were used for age inference to maintain consistency with silent rate estimation and to minimize the potential for selection to distort the evolutionary history or rate at a locus. Ages were also inferred using "all" nucleotide sites to provide corroboration of silent-site estimates. To account for departures from clock-like behavior in the 12-species data set, we used a non–clock-enforced tree based on silent sites to assign dates and to estimate a range of local rates using penalized likelihood (PL), with a smoothing factor estimated using cross-validation (r8s ver. 1.70; Sanderson 2002Go).


    Results
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Supplementary Material
 Acknowledgements
 References
 
Sequence Variation across Loci, Genomes, and Lineages
Alignments of 11 nuclear and 4 chloroplast loci from 4 Pinus species included 11,481 bp and 10,643 bp from subg. Pinus and Strobus, respectively (table 1). Mean lengths of nuclear sequences were shorter than chloroplast sequences (690 bp vs. 972 bp in subg. Pinus), but the larger sample of nuclear loci yielded a substantially larger nDNA data set (e.g., 7,594 bp nDNA vs. 3,887 bp cpDNA in subg. Pinus). In total, 65% of nDNA and 77% of cpDNA was exonic, whereas the noncoding portion included introns (4CL, aquaporin, CAD, ERD3, ferritin, LEA-like, OST1, SODchl, and trnV), 5' (OST1) and 3' (ferritin) untranslated regions, and an intergenic spacer (rpl20/S18). Noncoding regions provided 51% of nDNA silent sites (1,959 bp vs. 1,882 bp at third-codon positions) and 40% of cpDNA silent sites (876 bp vs. 1,289 bp at third-codon positions). Nine of the mapped nuclear loci lacked heterozygosity when amplified from haploid megagametophyte tissue, providing evidence for target specificity and orthology. Two loci—SODchl and ERD3—showed evidence of paralogy and were excluded from rate analyses. These loci were unexpectedly heterogeneous in subg. Pinus, indicative of gene duplication or nonspecific priming. The SODchl sequence from P. monticola also lacked 4 expected introns, raising the possibility that it is a reverse transcribed pseudogene. GC% showed considerable variation across loci. For nDNA, the average GC content was 47.3% (range = 32.9–66.7%), with coding slightly higher (51.3%; range = 40.5–67.0%) and noncoding considerably lower (35.8%; range = 23.6–44.5%). For cpDNA, the average GC content was 37.6% (range = 31.4–45.4%), and the GC content in coding (37.8%; range = 32.5–45.4%) and noncoding (36.2%; range = 34.4–39.2%) regions were very similar.


View this table:
[in this window]
[in a new window]

 
Table 1 Length (L), Noncoding Length (NC), Silent (dS; synonymous plus noncoding), and Nonsynonymous (dN) Substitutions per Site across 11 Nuclear and 4 Chloroplast Genes within Subg. Pinus, Subg. Strobus, and across Subgenera

 
Sequence divergence in nDNA and cpDNA showed 2 important trends. First, although silent substitutions per site (dS) averaged approximately 3-fold higher in nDNA than cpDNA, mean nonsynonymous substitutions per site (dN) are almost identical among 8 nuclear (0.020) and 4 chloroplast (0.021) loci (table 1). Second, divergence at nDNA was significantly greater between representatives of subg. Strobus (mean dS = 0.063) than representatives of subg. Pinus (mean dS = 0.042; F = 15.19, P = 0.001). This trend was apparent in cpDNA, but the difference was not significant (dS Strobus = 0.015, dS Pinus = 0.010; F = 0.982, P = 0.359).

Fossil-Calibrated Mutation Rates and Divergence Dates
Single-locus ML phylogenies (results not shown) using "all" characters in the quartet alignments showed the expected species relationships in all cases except OST1 (section Quinquefoliae is sister to section Parrya and subg. Pinus) and ferritin (ingroup nodes were unresolved). Rooting of these 2 loci with Arabidopsis is likely responsible for the topological differences, and we excluded these loci from our chronograms. Rate equivalence (as inferred by the likelihood ratio test) was statistically supported for all individual loci except LHC-CAB, ferritin, and PAL1 using all characters and for all individual loci except PAL1 using "silent" sites (see table S3, Supplementary Material online for details). A concatenated alignment of the 5 clocklike loci that share Picea as outgroup (4CL, AGP6, aquaporin, CAD, and LHC-CAB) exhibited rate equality using either silent sites or all sites (see table S3, Supplementary Material online for details). Partition homogeneity tests among these loci (data not shown) indicated that these data sets do not reflect conflicting topologies, supporting concatenation. Rate equivalence was supported for all sites as well as silent sites for each cpDNA locus and for a concatenated cpDNA alignment (see table S3, Supplementary Material online for details). Rates were inferred from silent sites for each genome; ages were calculated using all-site and silent-site data sets.

Based on a calibration with 85 or 45 MYA, absolute silent mutation rates (µ) for 9 nuclear loci average 0.70 or 1.31 x 10–9 silent substitutions·site–1·year–1 for nDNA, and 0.22 or 0.42 x 10–9 for cpDNA (table 2). To address the impact of unequal base frequencies and among-site rate heterogeneity on rate estimates, the absolute silent mutation rate (µ) was also calculated from ML branch lengths for each calibration test (table 3). ML-based rates are marginally higher than dS-based rates for nDNA (µ = 0.75 or 1.41 x 10–9), whereas cpDNA rates are almost identical (µ = 0.21 or 0.40 x 10–9).


View this table:
[in this window]
[in a new window]

 
Table 2 Estimated Absolute Mutation Rates (µ; substitutions·site–1·year–1) Based on Silent (dS) and Nonsynonymous (dN) Substitutions in Comparisons between subg. Pinus and subg. Strobus. Rates Are Averages of 9 Nuclear Loci or 4 Chloroplast Loci, Using Divergence Dates of 85 or 45 MYA. Values in Parentheses Are 1 SD

 

View this table:
[in this window]
[in a new window]

 
Table 3 Estimated Divergence Dates for Pinus Nodes Shown in Figure 1 Based on Silent Sites (or all sites) from nDNA or Chloroplast DNA. Calibration Is at Node C at 85 or 45 MYA. Ages and Silent Mutation Rates (µ) Are Estimated Using ML Branch Lengths for Data Showing Rate Equality or Using PL for the Nonclocklike 12-Taxon Data set

 
Chronograms based on silent sites from nDNA and cpDNA reveal identical topologies, but disparities in the branch lengths (especially at deeper nodes) yield different estimated divergence dates for Pinus lineages (fig. 1 and table 3). Most notably, estimates from the 5-locus nDNA data set predict older divergence events between Pinus and Picea (190–102 MYA) than does cpDNA (164–136 MYA). Both genomes show sections of subg. Strobus diverging before those of subg. Pinus, although nDNA estimates are more ancient (48–25 for subg. Strobus vs. 30–16 MYA for subg. Pinus) than those from cpDNA (37–19 MYA for subg. Strobus vs. 25–13 MYA for subg. Pinus). Divergence dates inferred from the same data sets using all nucleotides were nearly identical at sectional divergences but somewhat younger for the PinusPicea node (table 3). Divergence dates were also estimated using silent sites in the clock-enforced data set that included 4 additional nuclear loci (ferritin, LEA-like, OST1, and PAL1; 6,098 bp). The use of different outgroups for 3 of these loci precluded us from estimating the divergence of Pinus from Picea (node D), but estimated sectional divergences within subg. Pinus (31–16 MYA; node A) and subg. Strobus (48–25 MYA; node B) were not different from estimates based on 5 clock-like loci (table 3).


Figure 1
View larger version (24K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
FIG. 1.— ML chronograms of major lineages of Pinus using silent sites from 5 nuclear loci (2a, 2b; 2,348 bp) or 4 chloroplast loci (2c, 2d; 1,914 bp). Calibration is at node C with either 45 MYA, based on leaves (2a, 2c), or 85 MYA, based on wood (2b, 2d). Branch lengths are shown above each branch, and estimated ages are shown below each node. See Supplementary Material online for tree statistics.

 
Cross-validation yielded an optimum smoothing factor of 63 for PL in the 12-taxon data set. In general, results from PL indicate more recent divergence events than predicted from the analysis of only 4 exemplar species (fig. 2 and table 3). For example, using an 85 MYA calibration, subg. Strobus sections are predicted to have diverged 37 MYA or 10 Myr more recently than predicted from 4-taxon comparisons (fig. 1). Similarly, subg. Pinus sections are predicted to have diverged 28 MYA, which is 2 Myr more recent than indicated by 4-taxon estimates (fig. 1). An important finding highlighted by this additional taxon sampling is that modern pine subsections radiated in a narrow time span in the relatively recent past. For example, using the 85 MYA calibration, the 3 subsections of Trifoliae (fig. 2b, nodes E–G), with approximately 51 extant species, radiated within a 5 Myr time span starting 18 MYA.


Figure 2
View larger version (28K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
FIG. 2.— PL chronograms of 12 taxa using silent sites from 4 nuclear loci (3,766 bp), based on calibration at node C with a) 45 MYA or b) 85 MYA. Estimated ages are shown below each node. See Supplementary Material online for tree statistics. Nodes E–I represent subsections referred to in the text: E, Australes; F, Ponderosae; G, Contortae; H, Strobus; and I, Balfourianae/Cembroides. All nodes have greater than 70% bootstrap support except for nodes E and F, which collapse in the strict consensus tree.

 

    Discussion
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Supplementary Material
 Acknowledgements
 References
 
Mutation rate estimates are used widely for hypothesis testing in molecular, population, and evolutionary genetic studies (Muse 2000Go), and they are increasingly used to estimate genetic parameters of conifers (Dvornyk et al. 2002Go; García-Gil et al. 2003Go; Brown et al. 2004Go; Ma et al. 2006Go). Numerous estimates of mutation rates have been made for angiosperms (e.g., Wolfe et al. 1987Go, 1989Go; Gaut et al. 1996Go; Koch et al. 2000Go; Clark et al. 2005Go), but comparable conifer estimates are limited to studies of single-locus variation across multiple species (Geada López et al. 2002Go), multilocus variation in a single species (García-Gil et al. 2003Go; Brown et al. 2004Go), or multilocus variation in a few closely related species (Ma et al. 2006Go). The perspective added by our synthesis of multiple nuclear and chloroplast loci based on exemplar taxa and 2 fossil calibration points indicates that divergence times separating pine lineages have been frequently overestimated, with a concomitant underestimation of absolute mutation rates (Krupkin et al. 1996Go; Dvornyk et al. 2002Go; Geada López et al. 2002Go; Brown et al. 2004Go; Sokol and Williams 2005Go; Ma et al. 2006Go). This new perspective supports a relatively recent radiation of extant pine sections in the Early Miocene, a hypothesis that has been proposed (e.g., Miller 1973Go; Strauss and Doerksen 1990Go) (but often ignored) to explain the modest morphological and genetic divergence between pine species.

Sources of Error
Confidence intervals (CI) for molecular clock estimations must consider the simultaneous uncertainties of rate and time that are confounded within divergences, as well as the age and placement of calibration points. Because fossils document minimum divergence times, the elapsed time from taxon origin to the earliest fossil cannot be known. Additional uncertainty is introduced by variation in deposition, discovery, and the quality of our understanding of the age of the source formation as well as the taxonomic affinity of the fossil. There are promising approaches for estimating CI for divergence times (e.g., Kumar et al. 2005Go; Yang and Rannala 2006Go), but these will require multiple robust fossil calibrations for Pinus. In addition, molecular divergence rates may vary across the genome and across lineages. We addressed this complex problem using several approaches to provide a perspective on the relative impact of various factors on the error ranges of our rate estimates.

First, both of the fossils selected for calibration lack radiometric dating. Because midpoints of the ranges for geological Epochs or Stages were used to create the chronograms (figs. 1 and 2 and table 3), we recalculated divergence dates using the upper and lower bounds of these periods. The stratigraphic age ranges for wood from the Santonian Pinuxylon sp. (Meijer 2000Go) and leaves from the Middle Eocene P. similkameenensis (Phipps et al. 1995Go) are currently considered to be 83.5–85.8 MYA, and 37.2–48.6 MYA, respectively (Gradstein and Ogg 2004Go). The inferred age ranges for node D (cf. table 3), using these stratigraphic ranges to calibrate at node C, are 187–192 MYA and 83–109 MYA for calibrations based on wood and leaves, respectively. Inferred age ranges for node A are 29.6–30.4 and 13–17 MYA for calibrations based on wood and leaves, respectively, whereas node B ages are 47–48 and 21–27 MYA, respectively. Clearly, the use of midpoint ages introduces uncertainty in our estimates, particularly for Eocene fossils because that Epoch is much longer than the Santonian (11.4 vs. 2.3 Myr). Nonetheless, this uncertainty is small relative to the differences in alternative calibration points suggested by fossil wood (85 MYA) versus fossil leaves (45 MYA) for the origin of Pinus subgenera.

Second, the use of the mean substitution rates from multiple loci reduces stochastic deviation resulting from rate variation among genomic regions, and limiting the sample to silent sites reduces the potential for selection to influence rates. To assess the amplitude of this variation among our 9 nuclear loci, we also calculated absolute substitution rates using the mean dS plus and minus 1 standard deviation (SD). This results in µ = 0.43–0.96 x 10–9 for a calibration based on wood and 0.81–1.81 x 10–9 for leaves (cf. table 2). We also projected the "combined" effects of locus-specific rate variation (mean dS plus 1 SD and the upper bound for the stratigraphy; dS minus 1 SD and the lower bound) for each calibration. This results in "inclusive" ranges of µ = 0.42–0.98 x 10–9 for a calibration based on wood and 0.75–2.2 x 10–9 for leaves (cf. table 2). Among the 9 loci that were used for rate calculation, distortion may have been introduced by our use of different outgroups for ferritin, LEA-like, and OST1 (see table S2, Supplementary Material online for details). Further, PAL1 silent sites showed a significant departure from rate equivalence even with Picea as the outgroup. However, ages and rates inferred from the 5- and 9-locus data sets (table 3) are largely consistent. Further, data accumulated for 50 Pinus loci (Cronn R, unpublished data) show a median silent rate that is very similar to the silent rate described in these results. This implies that despite locus-specific rate variation, our 5-locus data set is useful as a first approximation of µ in Pinus.

Finally, we derived CIs for age estimates using the method of Haubold and Wiehe (2001)Go, which uses nonoverlapping pairs of phylogenetic distances to infer the unknown mutation rate for the other pair. Using this method at node A (fig. 1b), with subg. Strobus as the reference and ages calculated from the 85 MYA crown calibration, yields 95% CIs of 14–67 MYA for the divergence of subg. Pinus. Applying this technique to node B (with subg. Pinus as the reference) yields intervals of 22–106 MYA for subg. Strobus. The magnitude of these intervals far exceeds error ranges introduced by stratigraphic ranges, intralocus rate variation, or even fossil choice between permineralized wood (85 MYA) or leaves (45 MYA). These large uncertainties clearly highlight the challenge inherent in the simultaneous uncertainties of rate and time over evolutionary time scales.

Although the magnitude of these sources of error highlight the importance of considering molecular rate variation and stratigraphic uncertainty, past calibration scenarios applied to Pinus show that incorrect fossil assignment can be a far more dramatic source of error. For example, the common practice of using P. belgica to calibrate the divergence of sections within subg. Pinus (node A, fig. 3e) pushes the divergence of pine subgenera (node C) to 339 MYA and the divergence of PinusPicea (node D) to 758 MYA in the Precambrian. The attendant silent mutation rate in the nuclear genome is exceptionally slow (µ = 0.19 x 10–9). This unrealistic (but commonly cited) calibration is clearly responsible for many of the exceptionally low µ values reported for pine nDNA and cpDNA (Krupkin et al. 1996Go; Dvornyk et al. 2002Go; Geada López et al. 2002Go; Brown et al. 2004Go; Ma et al. 2006Go). This calibration can also cause substantial distortion in biogeographical interpretations. For example, Eckert and Hall (2006)Go recently hypothesized dispersal and vicariance events using P. belgica to calibrate the divergence of sections within subg. Pinus (node A, fig. 3f) in a cpDNA data set. In doing so, they fail to consider the unrealistic estimate that this calibration indicates for the divergence of Pinus and Picea (i.e., 720 MYA) based on our clock-like cpDNA branch lengths (fig. 3f). In the same manner, associating P. belgica with the divergence of pine subgenera also conflicts with the fossil record. Such a calibration (e.g., 136 MYA, Sokol and Williams 2005Go; 130 MYA, Ma et al. 2006Go) indicates a PinusPicea split of approximately 300 MYA (fig. 3d) and produces absolute silent mutation rates of µ = 0.47–0.49 x 10–9. These projected ages for Pinus (Carboniferous or Early Permian, respectively) predate P. belgica by more than 150 Myr. In summary, the errors resulting from incorrect fossil/node association far exceed the cumulative error estimated with CIs that account for the confounding effect of simultaneous rate and time variation. In contrast, using P. belgica to represent the divergence of Pinus and Picea in our clock-like data set (node D, fig. 3c; Wang et al. 2000Go; García-Gil et al. 2003Go) places the divergence of Pinus subgenera at 63 MYA. This estimate lies between the earliest putative subg. Strobus wood and leaf fossils.


Figure 3
View larger version (19K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
FIG. 3.— Application of some recently published fossil calibration scenarios (see Discussion), using clock-enforced branch lengths (shown in fig. 1) from our nDNA or cpDNA data sets. ML-projected ages are shown below each node; ML-calculated silent rates are given for each scenario. Node letters correspond to figure 1.

 
Implications of a Moderate Tempo for Pine Mutation Rates
Several authors have argued that a high mutation rate is necessary to account for the extremely high level of inbreeding depression found in conifer species capable of partial self-fertilization (Lande et al. 1994Go; summarized in Scofield and Schultz 2006Go). Because plants do not segregate a germ line, somatic mutations may accumulate in meristems and be incorporated into gametes. Hence, longevity and large stature contribute to a large number of mitoses, both of which may serve to elevate the per-generation mutation rate (Scofield and Schultz 2006Go). Evidence for a high per-generation mutation rate in Pinus compared with other plant groups is provided by observations of the frequency of chlorophyll-deficient mutants. Assuming an equal number of loci capable of mutating to chlorophyll deficiency and equivalent lengths across loci, the per-generation rate of deleterious mutations for Pinus sylvestris (U = 1–3 x 10–2; Kärkkäinen et al. 1996Go) is comparable to the mutation rate for Rhizophora mangle, another long-lived woody perennial (U = 1.5 x 10–2; Klekowski and Godfrey 1989Go). These mutation rates are approximately 100-fold higher than the average for 10 annual flowering plant species (U = 1–3 x 10–4; Klekowski 1992Go).

Mutation rates in Pinus can be evaluated by comparing absolute rates derived from fossil calibrations in other plant groups. Our 85 MYA calibration provides an estimate for nDNA (µ = 0.70 x 10–9 synonymous substitutions·site–1·year–1; table 2) that is 14-fold higher than previous Pinus estimates (Dvornyk et al. 2002Go), but approximately 4-fold slower than the rate for palms (µ = 2.61 x 10–9; Gaut et al. 1996Go) and 7- to 40-fold slower than the range reported for herbaceous angiosperms (µ = 5–33 x 10–9; Gaut et al. 1996Go; Koch et al. 2000Go; Clark et al. 2005Go). Similarly, our estimated rates for chloroplast silent sites (µ = 0.22 x 10–9 synonymous substitutions·site–1·year–1; table 2) are approximately 6-fold slower than those reported for angiosperm cpDNA (µ = 1.1–1.6 x 10–9; Wolfe et al. 1987Go). Expressed on a per-year basis, these comparisons suggest that substitution rates in Pinus are still far slower than most angiosperms. In this context, it is important to note that molecular substitution rates can also show a "generation-time effect" (e.g., Gaut et al. 1992Go, 1996Go; Kay et al. 2006Go). Based on our estimate of the average rate (µ = 0.70 x 10–9 synonymous substitutions·site–1·year–1) and correcting for longevity (assuming a 25-year generation time; Brown et al. 2004Go), the per-generation nuclear substitution rate for Pinus averages 1.75 x 10–8 substitutions·site–1·generation–1. This rate is nearly equivalent to the 1.5 x 10–8 substitutions·site–1·year–1 rate inferred for short-lived Brassicaceae (Koch et al. 2000Go). The near-equivalence between these values closes the gap between gymnosperm and angiosperm rates relative to prior rate estimates (e.g., Dvornyk et al. 2002Go; Brown et al. 2004Go), but they still fall short of the 100-fold difference that might be expected based on chlorophyll-deficiency mutations.

Clearly, these calculations hinge on an appropriate association between a calibration time and a phylogenetic node; the choice of a node with which to associate a fossil is the largest determinant of the rate. The availability of Pinus fossils with synapomorphies supporting the divergence of Pinus subgenera (a well-supported node in a molecular phylogeny) is more robust than options available for many plant groups, and we suggest that this will be an important factor for future, more refined, rate comparisons. This study illustrates the variability of divergence rates among loci because dS for 9 nuclear loci ranges from 0.063 to 0.205 (table 1). Even though fossil dates were used as fixed calibration points in our calculations, each fossil represents a "minimum" age for a lineage. It follows then that the rates calculated from mean dS (µ = 0.70–1.31 x 10–9 substitutions·site–1·year–1; table 2) represent "maximum" absolute rates, although the same caveat applies to angiosperm rates used for comparison.

Our revised estimates of mutation rates in pines have important implications for studies of population and molecular genetic parameters. For example, an effective population size (Ne) of 5.6 x 105 was calculated for P. taeda using 19 loci and a substitution rate (µ = 1.17 x 10–10; estimated by calibrating divergence between members of section Trifoliae with the age of P. belgica; Brown et al. 2004Go). If we apply our estimate of µ = 0.70 x 10–9 (calibrated at 85 MYA between the subgenera; table 2), we find that the predicted Ne for P. taeda is far lower at 9.4 x 104. This value of Ne contrasts sharply with census population estimates for P. taeda (which may exceed 1010; Brown et al. 2004Go), and it suggests that this species has experienced dramatic population growth following a relatively recent genetic bottleneck in the history of this species.

Pinus Fossil Status
Despite the diversity and abundance of Pinus fossils (see Millar 1998Go), several obstacles need to be overcome before integrating additional fossils into a multipoint calibration. To date, pine fossil reports have been based on unattached organs (ovulate or pollen cones, leaves, or wood fragments). Two of these organs have yet to be found on a contiguous fossil, although species have been named based on the hypothesized common origin of separate organs. Retention of ancestral character states and homoplasy among extant Pinus species are sufficiently frequent that characters from a single organ are inadequate for discriminating among extant pine subsections or sections (Gernandt et al. 2005Go). Because Pinus subgenera are diagnosable by leaf vasculature (Gernandt et al. 2005Go) and by a combination of wood features (Van der Burgh 1973Go), this key divergence event provides the most reliable calibration point for Pinus.

Historically, fossil descriptions used a typological approach for assigning affiliations with extant taxa. This process is complicated by historical revisions of Pinus sectional affiliations, and it lacks the necessary phylogenetic framework for making assignments. As previously noted, the number of fibrovascular bundles per leaf is the only nonhomoplasious character known to date that diagnoses Pinus subgenera (Gernandt et al. 2005Go). However, the presence of a single fibrovascular bundle at the base of a leaf fossil may not be diagnostic for the subgenera because bundles branch distally into the 2 bundles characteristic of subg. Pinus (Stockey and Nishida 1986Go). Based on this information, Pinus sp. leaf fossils from Staten Island, New York (Jeffrey 1908Go) are now thought to represent subg. Pinus rather than subg. Strobus. Similarly, Stockey and Nishida (1986)Go suggest that Cretaceous P. yezoensis leaf fossils from Japan (Stopes and Kershaw 1910Go), also once believed to represent subg. Strobus, are more representative of Cedrus.

Although needles are informative within Pinus, ovulate cones are considered the most dependable evidence for assigning fossils to the genus. Miller (1976)Go outlined 4 characteristics that together define Pinus: inflated scale apex, bract and scale traces united at origin, all resin canals abaxial to vascular tissue in scale base, and scale strands curved on adaxial side. Critically, members of the extinct cone genus Pityostrobus can share at least 2 of these features, complicating the identification of Cretaceous pine ovulate cones. Indeed, using Miller's criteria, some early "pine" fossils have been reassigned to Pityostrobus, Picea, or Cedrus. For example, Miller and Malinky (1986)Go referred cone scales from the Magothy Formation of Delaware, originally described as representing subg. Strobus (Penny 1947Go) to Pityostrobus. Because many fossil cones are found in marine deposits, abrasion during transport can remove important cone scale features (Smith and Stockey 2002Go), misleading attempts at classification (Wolfe and Schorn 1989Go).

Evidence for an Eocene Divergence of Subgenera
Some authors have suggested a relative recent origin for subg. Strobus in the Eocene (Miller 1973Go; Phipps et al. 1995Go; Kossack and Kinlaw 1999Go), as reflected by our 45 MYA crown calibration. This hypothesis is based on numerous ovulate cone fossils assigned to subg. Strobus that appear in the fossil record starting approximately 40 MYA (Millar 1998Go), closely following the oldest known leaves attributable to subg. Strobus (45 MYA). Unattached leaves (with subg. Strobus features), deposited near ovulate cones (with subg. Pinus features), in the Princeton Chert in British Columbia were originally described as 2 species: P. similkameenensis and P. arnoldii (Miller 1973Go). More recently, Phipps et al. (1995)Go reinterpreted these fossils as a single species (P. similkameenensis) and proposed that the mosaic of characters in this hypothetical species may represent a lineage that predates the divergence of subg. Strobus. These authors did note that alternative explanations (e.g., organs derived from a lineage with no extant descendent or from sympatric pine species) cannot be ruled out.

A 45-MYA calibration also shows surprising agreement with a recent cpDNA-based estimate of seed plant divergences using entirely different calibration sources (Magallón and Sanderson 2005Go). In that analysis, the divergence of Gnetophytes and Pinaceae was constrained with a fossil date of 216 MYA; estimated divergence dates for Pinus subgenera (ca. 50 MYA) and synonymous cpDNA substitution rates (µ = 0.26 x 10–9) nearly match our results. Because fossils represent minimum ages, calibration at the divergence of Pinus subgenera with 45-MYA leaf fossils appears to provide a reasonable upper bound for this event.

Evidence for a Cretaceous Divergence of Subgenera
The description of a Pinuxylon sp. from the Aachen (Meijer 2000Go) may push the age of subg. Strobus more than 40 Myr before the oldest leaf and cone fossils, but this is not a simple determination. The cone genus Pityostrobus was abundant and diverse during the Cretaceous, but associated wood remains undescribed and its exact relationship to Pinus is unresolved (Smith and Stockey 2002Go). Further, the 2 characters used to affiliate the Aachen Pinuxylon wood with subg. Strobus (cross-field pitting and ray tracheid dentations) are not unequivocal synapomorphies (data not shown), even in extant pine species, indicating the possibility of retention of ancestral polymorphism or parallel evolution. Both of these tracheid features are quantitative traits for which intraspecies variation has been noted (Shaw 1914Go; Hudson 1960Go; Van der Burgh 1973Go) and for which the ancestral state is unclear (Hart 1987Go). Perhaps most challenging is that diagnosis of subg. Strobus fossil wood is made by determining that a specimen has "weak-to-absent" dentations. In this context, the description of the Aachen Pinuxylon sp. as possessing "ray tracheids faintly dentate but owing to rather poor state of preservation only locally visible" (Meijer 2000Go) may best be considered tantalizing evidence for the existence of subg. Strobus in the Cretaceous, but evidence requiring corroboration. The evidence for subg. Strobus–like wood in the Cretaceous is bolstered by fossil wood of about the same age that shows similarities to modern-day members of subg. Pinus (Blackwell 1984Go), hence providing wood fossils flanking the crown node. Although the possibility that the subg. Pinus–like fossils may simply be the ancestral state in Pinaceae cannot be excluded, they do lend credence to the hypothesis that the Aachen Pinuxylon represents Pinus rather than another member of the Pinaceae.

The 45-MYA calibration (even when viewed as a minimum age) suggests a more recent Pinus–Picea split (102–87 MYA; fig. 1) than is indicated by the existence of P. belgica at approximately 135 MYA (Miller 1976Go). However, there are unresolved issues concerning the age of the genus. First, the approximately 35-Myr gap between P. belgica and the next Pinus fossil, combined with uncertainties about the source of P. belgica, deserves note. Second, the only other evidence for an "Early" Cretaceous origin for Pinus is pollen from Alaska (Langenheim et al. 1960Go). The age of the Kuk River fossil formation that is the source for these reported 145–100 MYA Pinus microfossils is now estimated to be the Mid- to Late Albian (104–97 MYA; Koteja and Poinar 2001Go). In addition, Erwin and Schorn (2006)Go recommend caution when relying on pollen without associated megafossils due to the unknown status of pollen associated with seed cone genera such as Pityostrobus. If one were to discount the inference of an "Early" Cretaceous age for P. belgica and the previous estimate for the age of the Alaskan pollen, the oldest evidence for Pinus would be ovulate cones from the Albian–Cenomanian (ca. 103 MYA; Fliche 1896Go). This does not appear to fit with the 190–164 MYA PinusPicea split implied by a Cretaceous divergence of the subgenera (fig. 1). On the other hand, the earlier age estimate for our node D (fig. 1) that results from the 85 MYA calibration appears consistent with an older divergence of Pinus and Picea because there is support for an early independent divergence of these 2 genera from the Pityostrobus grade (Smith and Stockey 2002Go). The 85 MYA calibration also agrees with a study of Pinaceae genera, which calibrated the PinusPicea split at 140 MYA, yielding an estimated age of approximately 84 MYA for the divergence of Pinus subgenera (Wang et al. 2000Go).

Perhaps the strongest evidence that subg. Strobus diverged in the Cretaceous are the numerous Oligocene (ca. 26 MYA) fossils that potentially represent recent lineages. Oligocene ovulate cone fossils have been affiliated with 6 different Pinus subsect.: Pinaster and Pinus in section Pinus (Mai 1986Go; Erwin and Schorn 2006Go); Ponderosae in section Trifoliae (Wolfe and Schorn 1989Go); Balfourianae and Cembroides in section Parrya (Wolfe and Schorn 1989Go); and Strobus in section Quinquefoliae (Mai 1986Go). Existence of these subsections approximately 26 MYA is in better agreement with our predicted age of these groups using an 85 MYA calibration (fig. 2b, nodes E–I). This assessment carries 2 important caveats. First, most subsectional assignments have been made by typological matching that considered extant taxa from only 1 continent (but see Erwin and Schorn 2006Go). Second, a cladistic analysis of the characters used to affiliate fossils to subsections has yet to be conducted, so their value at this taxonomic level remains speculative. Based on the framework provided by our clock-like data, many of these Oligocene fossils must reflect symplesiomorphic character states for the simple reason that they infer too ancient a divergence for Pinus lineages. Even our 85 MYA chronogram does not project subsections diverging early enough to support the affiliation of Late Cretaceous leaf fossils (ca. 85–78 MYA) with 3 different subsections: Pinaster (Stockey and Nishida 1986Go), Pinus (Robison 1977Go), and Ponderosae (Stockey and Nishida 1986Go). Integrative fossil validation, such as the method used to discard "outlier" fossils in a time-calibrated phylogeny of turtles (Near et al. 2005Go), can be used to take advantage of the numerous pine fossils, but only as putative synapomorphies supporting fossil associations are identified.

Concluding Remarks
Our interpretations of mutation rates and divergence ages are relatively insensitive to the choice of genome (nDNA vs. cpDNA), the selection of silent sites versus all sites, the number of taxa, or clock constraints (4 taxa constrained vs. 12 taxa unconstrained). Instead, the choice of fossil and node for calibration have a pronounced impact. This suggests that refinements of estimated mutation rates will benefit more from a reevaluation of the Pinus fossil record based on a cladistic analysis of morphological characters in extant taxa rather than increased genomic sampling. Because Pinus has a rich fossil record, a critical evaluation of the morphological synapomorphies that support additional phylogenetic nodes should be made a priority. The dramatic distortions of ages and rates (fig. 3) resulting from incorrect placement of fossils should serve as a cautionary note to studies of fossil-poor families. As in angiosperms (Sanderson and Doyle 2001Go), a relatively recent divergence of the crown group in Pinus may have been obscured by the antiquity of the stem lineage. Others have suggested that a recent origin for most extant pine taxa, especially in subg. Pinus, is the simplest explanation for low divergence rates (e.g., Strauss and Doerksen 1990Go; Govindaraju et al. 1992Go). Regardless of which absolute calibration is chosen, the relative divergence times inferred from multiple nuclear and chloroplast loci provide an important perspective for studying pine species relationships. For example, "relative" divergences between sections in subg. Pinus are only one-sixth of the total divergence within the genus.

We conclude that a 45-MYA subgeneric divergence may be too young but yields an upper limit for Pinus evolutionary rates. Because a subgeneric calibration at 85 MYA based on permineralized wood yields realistic age projections for both older and younger nodes, it provides a useful lower rate limit. Together, these rate estimates (table 2) reveal a moderate tempo for pine divergence and provide a framework that can be used to compare future conifer gene– and taxon–specific rates.


    Supplementary Material
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Supplementary Material
 Acknowledgements
 References
 
Supplementary tables S1, S2, and S3 and the alignment file are available at Molecular Biology Evolution online (http://www.mbe.oxfordjournals.org/).


    Acknowledgements
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Supplementary Material
 Acknowledgements
 References
 
We thank J. Berdeen, D. Johnson, W. Kwan-Soo, and M. McGregor for providing plant collections; G. Brown, S. Gonzalez-Martinez, K. Krutovsky, D. Neale, and C. Plomion for sharing linkage maps and primer sequences; D. Erwin, S. Manchester, G. Poinar, and J. van der Burgh for advice on fossils; S. Muse, D. Remington, R. Small, J. Wendel, and 2 anonymous reviewers for helpful comments; and K. Farrell, C. Streng, and O. Zerón Flores for laboratory assistance. Funding for this study was provided by National Science Foundation grant DEB 0317103 to A.L. and R.C., Secretaria de Educación Pública grant to D.S.G., and the USDA Forest Service Pacific Northwest Research Station. The research was conducted at Oregon State University and Universidad Autónoma del Estado de Hidalgo.


    Footnotes
 
Spencer Muse, Associate Editor


    References
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Supplementary Material
 Acknowledgements
 References
 

    Alvin K. Further conifers of the Pinaceae from the Wealden Formation of Belgium. Inst R Sci Nat Belg Mém (1960) 146:1–39.

    Axelrod D. Cenozoic history of some western American pines. Ann Mo Bot Gard0 (1986) 73:565–641.[CrossRef]

    Blackwell W. Fossil ponderosa-like pine wood from the Upper Cretaceous of northeast Mississippi. Ann Bot (Lond) (1984) 53:133–136.[Abstract/Free Full Text]

    Brown G, Gill G, Kuntz R, Langley C, Neale D. Nucleotide diversity and linkage disequilibrium in loblolly pine. Proc Natl Acad Sci USA (2004) 101:15255–15260.[Abstract/Free Full Text]

    Brown G, Kadel E, Bassoni D, Kiehne K, Temesgen B, van Buijtenen J, Sewell M, Marshall K, Neale D. Anchored reference loci in loblolly pine (Pinus taeda L.) for integrating pine genomics. Genetics (2001) 159:799–809.[Abstract/Free Full Text]

    Chagné D, Brown G, Lalanne C, Madur D, Pot D, Neale D,