Skip Navigation


MBE Advance Access originally published online on October 5, 2007
Molecular Biology and Evolution 2007 24(12):2816-2826; doi:10.1093/molbev/msm216
This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Supplementary Material
Right arrow All Versions of this Article:
24/12/2816    most recent
msm216v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Streisfeld, M. A.
Right arrow Articles by Rausher, M. D.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Streisfeld, M. A.
Right arrow Articles by Rausher, M. D.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2007. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oxfordjournals.org

Research Articles

Relaxed Constraint and Evolutionary Rate Variation between Basic Helix-Loop-Helix Floral Anthocyanin Regulators in Ipomoea

Matthew A. Streisfeld and Mark D. Rausher

Department of Biology, Duke University, Durham, NC

E-mail: mstreisf{at}duke.edu.


    Abstract
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Supplementary Material
 Acknowledgements
 References
 
Regulatory genes are believed to play a large role in morphological diversification and are often characterized by elevated rates of evolution. Whether this rapid evolution is primarily due to adaptive differentiation or relaxed selective constraint remains an open question. We attempted to distinguish between these alternative outcomes in 2 transcription factors known to regulate the expression of anthocyanin pigmentation genes in flowers. We cloned the full-length coding region from 2 basic helix-loop-helix transcription factors from several species of Ipomoea with diverse flower colors and determined the selective forces operating on them. In both genes, rapidly evolving sites and indel mutations are clustered in nonbinding domains, but the extent of rate acceleration in these domains is reduced relative to most previously characterized plant transcription factors. Moreover, codon models of substitution rates and models evaluating the magnitude of change to physical amino acid properties demonstrate little evidence for adaptive evolution and suggest that elevated nonsynonymous substitution rates in these domains represent relaxed selective constraint. Although both genes show qualitatively similar patterns, their rates of evolution differ significantly due to an increased rate of nonsynonymous substitutions in the nonbinding domains in one copy, suggesting substantial differences in functional constraint on each gene. In general, these results provide additional evidence demonstrating that decreased constraint as opposed to positive selection is largely responsible for the frequently observed pattern of rapid evolution in particular domains of plant transcription factors. More specifically, they suggest that most of the amino acid substitutions are neutral and do not implicate a role for natural selection on these regulatory genes in the diversification of flower color in Ipomoea.

Key Words: Ipomoea • positive selection • rate variation • anthocyanin • basic helix-loop-helix • transcription factor


    Introduction
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Supplementary Material
 Acknowledgements
 References
 
Rates of morphological evolution are often greatly decoupled from rates of protein sequence evolution. A popular explanation to account for this pattern is that differences in gene regulation rather than diversification of structural genes contribute most to morphological evolution (King and Wilson 1975Go; Doebley 1993Go; Doebley and Lukens 1998Go; Carroll 2005Go). Developmental genetic analyses routinely have shown that altering the timing and/or location of gene expression can have profound effects on morphological diversity (e.g., Weigel and Meyerowitz 1993Go). However, it is still largely unclear whether mutations in structural or regulatory genes contribute most to phenotypic evolution.

One observation supporting the role of regulatory genes in plant diversification is that plant transcription factors often display elevated nonsynonymous to synonymous substitution rate ratios ({omega} or dN/dS) compared with structural genes (Purugganan and Wessler 1994Go; Purugganan et al. 1995Go; Rausher et al. 1999Go; Langercrantz and Axelsson 2000Go; Barrier et al. 2001Go; Lukens and Doebley 2001Go; Remington and Purugganan 2002Go). These higher overall rates across the genes are primarily due to particular domains with {omega} ratios that often approach 1, whereas other regions are highly conserved (Purugganan and Wessler 1994Go; Purugganan et al. 1995Go; Purugganan 1998; Rausher et al. 1999Go; Langercrantz and Axelsson 2000Go; Chang et al. 2005Go). If repeated positive selection accounts for these domain-specific elevated {omega}’s, then those substitutions may have contributed to phenotypic diversification among species. Conversely, if these substitutions are largely nonadaptive, it is unlikely that rapid evolution of these regulatory genes has played a role in morphological divergence.

Despite this evidence for rapid evolution in plant transcription factors, there have been few attempts to determine whether these elevated rates of nonsynonymous substitution are due to the effects of positive selection or whether they represent a relaxation of selective constraint. In the few cases where this has been explicitly tested, analyses repeatedly have found that particular domains in these plant transcription factors evolve at elevated rates, but rarely because repeated positive selection has driven diversification (Remington and Purugganan 2002Go; Fan et al. 2004Go; Chang et al. 2005Go; but see Hernández-Hernández et al. 2007Go). Our primary objective in this study was to extend these analyses by attempting to distinguish between relaxed selective constraint and repeated positive selection as explanations for high rates of nonsynonymous substitutions in 2 transcription factors that control the expression of anthocyanin pathway genes in the genus Ipomoea (morning glories).

The anthocyanin biosynthetic pathway has been a model for the study of gene regulation for some time, and the structural genes as well as their regulatory elements are well characterized and conserved across the angiosperms (Koes et al. 1994Go; Mol et al. 1998Go; Rausher 2006Go). According to the most recent models of transcriptional regulation of floral anthocyanin production, at least 3 gene families are believed to be involved: 1) members of the R2R3-MYB family, 2) two members of the R-like, basic helix-loop-helix (bHLH) family that are known to interact directly with each other and with their partner MYB protein, and 3) members of the WD-40 family that are believed to be important in posttranslational control of the MYB and bHLH proteins (Mol et al. 1998Go; Kroon 2004Go). The complex formed by the MYB and bHLH proteins has been shown to bind directly to the promoter regions of target structural anthocyanin genes to coordinately control transcription (Mol et al. 1998Go). Previous estimates of {omega} ratios in one bHLH regulator suggest that particular functional domains within these genes have greatly elevated rates of amino acid substitution (Purugganan and Wessler 1994Go; Rausher et al. 1999Go; Fan et al. 2004Go), again showing the familiar pattern that plant transcription factors often appear to evolve rapidly relative to the genes they regulate. However, these analyses did not rigorously test for the effects of positive selection and so cannot explicitly establish whether accelerated substitution rates were due to adaptive evolution or relaxed constraint.

The bHLH gene family is characterized by a highly conserved helix-loop-helix motif that is preceded by a series of basic amino acids. This motif is present in over 200 copies among vertebrates (Massari and Murre 2000Go), and there have been 133 bHLH factors identified in the Arabidopsis thaliana genome (Heim et al. 2003Go). Members of the R-like subfamily have been described as transcriptional regulators of anthocyanin synthesis in several model species. In particular, at least 2 gene copies that function in anthocyanin regulation have been identified in Zea mays, Petunia hybrida, Perilla frutescens, and Antirrhinum majus (Kroon 2004Go). In Ipomoea nil and Ipomoea purpurea, orthologs of the above 2 genes have been characterized and named bHLH1 and bHLH2 (Morita et al. 2006Go; Park et al. 2007Go). A third copy (bHLH3) in I. nil also has been described and appears to be derived from a recent duplication event in the ancestor of bHLH1 (Morita et al. 2006Go).

Molecular genetic studies have identified 4 functional domains within the proteins: interaction, acidic, bHLH, and C-terminal (fig. 1). The highly conserved interaction domain consists of approximately 190 amino acids at the N-terminus of the protein and has been shown to bind directly with partner R2R3-MYB factors (Goff et al. 1992). The acidic domain is composed of a region of approximately 230 amino acids that contains poorly conserved stretches of negatively charged acidic amino acids required to activate transcription of the target structural genes (Gong et al. 1999Go). The bHLH domain contains 59 highly conserved amino acids presumed to be involved in DNA binding, whereas the final approximately 150 amino acids on the C-terminus of the protein are required for subunit dimerization with other bHLH proteins (Kroon 2004Go). Rates of nonsynonymous substitution have been shown to be greatest in the acidic and C-terminal domains (Purugganan and Wessler 1994Go). This offers additional support for the observation that plant transcription factors often show domain-specific elevated dN/dS ratios, and it implies that differences in functional constraint exist among the domains. Therefore, a second goal of the current study is to determine whether the bHLH factors involved in regulation of anthocyanin genes in the genus Ipomoea also display domain-specific elevated dN/dS ratios.


Figure 1
View larger version (5K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
FIG. 1.— A schematic representation of the 2 bHLH genes sequenced in this study. The numbers represent the amino acid position of each functional domain in the Ipomoea purpurea sequence. The domains shaded in gray denote the binding domains, whereas those in white represent the nonbinding domains.

 
A final objective of this investigation is to examine whether differences in the extent of pleiotropy manifested by the 2 bHLH paralogs has resulted in variable levels of evolutionary constraint. Mutant analyses of bHLH2 in Ipomoea species have demonstrated that the gene controls multiple additional functions besides the regulation of floral anthocyanins. Mutants deficient for the bHLH2 gene in I. purpurea and Ipomoea tricolor have reductions in flower and seed coat pigmentation, yielding pale flowers and ivory-colored seeds (Park et al. 2004Go, 2007Go). In addition, other seed epidermal traits that are known to be important in plant defense are negatively affected in the I. purpurea mutants. These include smaller and fewer unbranched trichomes and reduced accumulation of the proanthocyanidin and phytomelanin secondary metabolites in the ivory seeds (Park et al. 2007Go). In contrast, regulation of anthocyanin synthesis is the only function currently assigned to the orthologs of the bHLH1 gene in the flowering plant species that have been characterized (Goff et al. 1992Go; Quattrocchio et al. 1998Go; Spelt et al. 2000Go). These results suggest that there may be substantial differences in the extent of constraint on the 2 loci, and this has implications for their relative rates of evolution. Because of the multiple functions assigned to bHLH2, it is reasonable to predict that bHLH2 evolves more slowly than bHLH1, whose only known function is in the regulation of anthocyanin synthesis.

In summary, we address here 3 questions concerning the rates and patterns of molecular evolution of 2 bHLH transcription factors in Ipomoea. First, we ask how well these bHLH transcription factors conform to the patterns of regulatory gene evolution seen in other plant transcription factors. Specifically, is there evidence for rapid evolution of the coding region as a whole, and if so, can this be attributed to particular domains with elevated {omega} ratios? Second, as mentioned above, orthologs of these bHLH genes appear to be essential for the transcriptional regulation of floral anthocyanins in several model species. Because floral pigmentation is an ecologically important character that is often linked to pollinator attraction, rapid evolution of these bHLH regulatory genes due to repeated positive selection may be indicative of adaptive evolution. Therefore, we select a set of species in the genus Ipomoea that shows substantial flower color diversification and ask whether there is evidence for positive selection in either of the bHLH transcriptional regulators. Among several of the species included in this investigation, flower color has been shown to contribute to adaptive differentiation (Machado and Sazima 1987Go; Rausher and Fry 1993Go). Finally, we test whether the gene copies evolve at different rates, as would be predicted by what is known about relative differences in functional constraint on the individual genes.


    Materials and Methods
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Supplementary Material
 Acknowledgements
 References
 
Species
The genus Ipomoea (Convolvulaceae) contains approximately 1,000 species that are distributed throughout the tropics and subtropics. In this study, we selected 12 species from within Ipomoea as ingroup species as well as the more distantly related Operculina pteripes as an outgroup. The phylogenetic relationships among these species are well established (Miller et al. 1999Go; Manos et al. 2001Go), and they display substantial variation of floral colors (table 1). Plant material used for isolation of nucleic acids was obtained from our greenhouse collections. Voucher information for several of these species is available in Miller et al. (1999)Go or can be obtained directly from the authors.


View this table:
[in this window]
[in a new window]

 
Table 1 A List of the Species Sequenced in This Study, with Their Associated Flower Colors and GenBank Accession Numbers

 
Sequencing of bHLH Genes
We sequenced Ipomoea bHLH genes using polymerase chain reaction (PCR) amplification of first-strand complementary DNA (cDNA), derived from total RNA extracted from the distal half of floral bud tips, approximately 1 day before the flower opened. RNA was extracted using the Spectrum Plant Total RNA extraction kit (Sigma-Aldrich, St. Louis, MO) and first-strand cDNA was generated using M-MLV reverse transcriptase (Invitrogen, Carlsbad, CA), both according to the manufacturers’ specifications. Orthologs of 2 previously identified bHLH genes in I. nil (InbHLH1 and InbHLH2; Morita et al. 2006Go) were PCR amplified in this study from all 13 species, with the exception of I. purpurea, I. nil, and I. tricolor (bHLH2 only), whose sequences were available on public databases. A third copy of this gene in I. nil, InbHLH3, was not sequenced in this study as it did not appear to be expressed in corolla tissues (Morita et al. 2006Go). PCR primers were designed based on copy-specific conserved regions throughout the coding sequence by manually aligning the publicly available sequences. In most cases, 2 to 3 overlapping PCR products were obtained that covered the entire coding sequence. After PCR amplification, products were gel extracted (Qiagen, Valencia, CA) and cloned into the pCR 2.1 TOPO vector (Invitrogen). A minimum of 3 clones were sequenced for each product using an ABI 3730 capillary sequencer and the Big-Dye protocol (Applied Biosystems, Foster City, CA). Both strands of plasmid DNA were sequenced using the M13 forward and reverse primers. All PCR primers used are listed in Supplementary Material online.

Phylogenetic Reconstructions
Overlapping sequencing products were combined to form single contiguous reads for each species. Nucleotide sequences were translated to the predicted amino acid sequence and aligned separately for each copy using ClustalW, followed by manual adjustment of both the amino acid and nucleotide sequences. Only short stretches of amino acid residues were conserved well enough between the paralogs to perform unambiguous alignments. As a result, all subsequent analyses evaluated each gene copy separately. Phylogenetic reconstructions were first performed for each bHLH gene to determine whether the recovered gene trees reflected the established species relationships (Miller et al. 1999Go). In addition, internal transcribed spacer (ITS) sequences for these species were provided by R. Miller (Southeastern Louisiana University), aligned, and used for separate phylogenetic reconstructions. Phylogenetic trees were constructed using MrBayes v. 3.1 (Huelsenbeck and Ronquist 2001; Ronquist and Huelsenbeck 2003Go). We analyzed 4 independent runs, each using the general time reversible (GTR) model plus gamma-distributed rates (GTR + {gamma}), as determined by Modeltest v3.7 (Posada and Crandall 1998Go). The data were partitioned by codon position (for the bHLH genes), and we ran 5 million generation Markov Chain Monte Carlo simulations with 4 separate chains, with the first 250,000 generations discarded as burn-in. Trees were summarized for each independent run and compared to check for concordant topologies. The consensus tree of all compatible groupings among all runs was used in all analyses.

Estimating Rates of Amino Acid Substitution
We evaluated codon- and lineage-based maximum likelihood models to characterize the selective processes that may have previously acted on each bHLH gene (Nielsen and Yang 1998Go; Yang and Nielsen 2000Go; Yang et al. 2000Go). Although these models may have low power to detect positive selection if it occurred infrequently at only a limited number of sites, it has been shown that repeated episodes of positive selection are readily detectable using these methods. All analyses were implemented using the codeml program of PAML version 3.15 (Yang 1997Go). The ratio of nonsynonymous to synonymous substitution rates (dN/dS or {omega}) was compared in these models. An estimate significantly less than 1 implies purifying selection, whereas an estimate of 1 suggests the presence of neutral evolution. Positive selection is indicated if the estimated {omega} is significantly greater than 1.

We first determined whether there was evidence for heterogeneous rates of evolution along any of the lineages leading to the extant species we studied. These lineage-based models assume a single {omega} for all codon sites. We compared estimates from a model where a single {omega} was constrained on all branches of the phylogeny with a free-ratio model, where {omega} was allowed to vary independently on each branch. We used the standard likelihood ratio framework (Hocking 1985Go) to evaluate whether the free-ratio model fit the data significantly better than the single-{omega} model. Specifically, 2 times the difference in log-likelihood scores from each model were compared with a {chi}2 distribution with degrees of freedom (df) equal to the difference in estimated parameters between the models.

We next evaluated whether particular amino acid residues were subject to positive selection by using codon site models, where the mean value of {omega} was held constant on each branch, but {omega} was estimated independently for each codon. For these analyses, a series of nested models was run, and likelihood ratio tests (LRTs) were conducted to test for the presence of positive selection on individual amino acids (Yang et al. 2000Go). Model M0 represents the null model that all sites have a single-{omega} value. Model M1a estimates 2 different site classes that are either constrained (0 < {omega} < 1) or evolving neutrally ({omega} = 1). Model M2a is identical to M1a, except that it adds a third site class of amino acids that may be positively selected ({omega} > 1). In Model M3 (discrete), each codon is assigned to 1 of 3 site classes that have been estimated without any constraint. Models M7 and M8 both model {omega} according to a beta distribution, with the parameters of the distribution estimated from the data. Model M8 differs from M7 in that it contains an extra site category with {omega} ≥ 1. LRTs were constructed between Models M1a and M2a and between M7 and M8. If Models M2a and/or M8 fit the data significantly better than Models M1a or M7, respectively, and some of the {omega} values for individual amino acids are estimated according to an empirical Bayes procedure to be greater than 1, then positive selection is indicated.

Analysis of Amino Acid Substitutions
The ability to accurately detect positive selection using methods that rely only on the dN/dS ratio may be quite poor, particularly if adaptive evolution targeted only a limited subset of amino acid changes and/or occurred infrequently (Kosakovsky Pond and Frost 2005Go; McClellan et al. 2005Go). Additional methods for examining the effects of nonsynonymous substitutions on protein evolution have focused on changes to physicochemical properties of particular amino acid residues. Because these properties are important in determining the 3-dimensional structure and conformation of the protein, and hence its biochemical function, it may be more appropriate to evaluate selection in terms of the magnitude of change in physicochemical property.

For each bHLH gene, we examined whether nonsynonymous substitutions caused significantly more or less change in 31 different physicochemical amino acid properties than expected under neutrality, as implemented in the program TreeSAAP version 3.2 (Wooley et al. 2003Go). For all possible pairwise amino acid changes, the range of effect size for each of the 31 properties was determined and equally divided into 8 magnitude categories. Lower number categories represented more conservative changes, and higher category numbers indicated more radical changes. Ancestral nucleotide sequences at each node of the phylogeny were then reconstructed using the baseml program of PAML v.3.15, with the GTR model of nucleotide substitution. For each inferred amino acid substitution in the data, the magnitude of effect size for each physicochemical property was determined and assigned to 1 of the 8 categories. An expected distribution that modeled the effects of neutral evolution was calculated by characterizing the 9 possible nucleotide changes from each codon in the data set, evaluating each amino acid substitution for the magnitude of change in each property, and assigning each change to 1 of the 8 categories defined above. These changes were summed across the data to construct a set of relative frequencies of change for all of the magnitude classes. If the distribution of observed changes fit the data significantly better than the expected distribution according to {chi}2 goodness-of-fit tests, we rejected the null hypothesis that physicochemical property changes in each bHLH gene were neutral. For all properties that differed significantly from neutrality, Z-scores were then calculated in each magnitude category to determine which classes contributed to this deviation. Significantly positive Z-scores indicated an overrepresentation of substitutions of that magnitude relative to neutrality, and significantly negative Z-scores indicated an underrepresentation of those substitutions relative to neutral expectations. We used a Bonferroni correction to account for the effects of multiple comparisons by adjusting the critical significance level for both goodness-of-fit and Z-score tests to P < 0.001 (Sokal and Rohlf 1995Go). Substitutions of magnitude categories 1 and 2 were considered to be the most conservative, whereas substitutions of categories 7 and 8 were the most radical. Radical substitutions affecting a particular property that occurred more frequently than expected by chance constituted the signature of adaptive evolution (McClellan et al. 2005Go).

Domain-Specific Differences in Rates of Amino Acid Substitution
In order to determine if rapidly evolving amino acid sites were clustered into particular functional domains of the bHLH genes, we first partitioned the sequence of each bHLH gene into binding domains (including the interaction and bHLH domains) and nonbinding domains (acidic and C-terminal), according to figure 1. We used a method described previously by Lu and Rausher (2003)Go to evaluate whether the dN/dS ratios were significantly different between the domains for each gene. Briefly, the dN/dS ratio was estimated separately for each domain using Model M0 from the codeml program of PAML. We next attempted to find a value of {omega} that satisfied each of 2 criteria: 1) its value lied in between the {omega} value for each gene and 2) it was significantly different than the {omega} for each gene. If such a value could be found, then it would suggest that the confidence intervals around the estimates of {omega} did not overlap between the 2 genes and the {omega} values were significantly different from each other. In order to assess significance, we performed LRTs by comparing the log likelihood of the model where {omega} was estimated to the log likelihood of the same model where {omega} was constrained to particular values.

We also compared the locations of the most rapidly evolving sites, as identified by Model M8 in PAML. Because our analysis of the bHLH2 gene indicated that approximately 15% of sites occurred in the rapidly evolving site class with {omega} = 1.2 (see below), we used empirical Bayes estimation to designate the 15% of sites with the greatest estimated {omega} ratio as the most rapidly evolving sites. We then noted the functional domain where each identified site occurred (binding or nonbinding) and used a G-test (correcting for differences in domain size) to examine the hypothesis that rapidly evolving sites occurred more frequently in the nonbinding domain than expected by chance. We performed the same analysis with the 15% of sites to be evolving most rapidly in the bHLH1 gene as well.

Comparing dN/dS between Genes
In order to determine whether rates of evolution differed between the 2 bHLH genes, we compared dN/dS ratios using 2 separate methods. The first procedure was described above and attempted to find a value of {omega} that was significantly different than the estimated {omega} for each gene (Lu and Rausher 2003Go). We used Model M0 from PAML to estimate the {omega} ratio for the full sequence of each bHLH gene.

The second approach followed the methods of Yang and Swanson (2002)Go, where variation in site heterogeneity was estimated between previously defined partitions in a combined data set consisting of both genes. We partitioned the data by gene and ran Model C, which estimated different substitution patterns and equilibrium codon frequencies between the partitions, but the same transition/transversion rate ratio ({kappa}) and nonsynonymous/synonymous rate ratio ({omega}). We compared that model with the nested-Model E, which was identical to Model C, except that it estimated {kappa} and {omega} separately between the partitions. We used a likelihood ratio test with 2 df to determine if Model E fit the data significantly better than Model C, which would indicate that {kappa} and/or {omega} evolved at different rates between the 2 genes. In these analyses, a significant likelihood ratio statistic cannot unambiguously determine that {omega} differs between the genes because in both models, {omega} is simultaneously estimated with {kappa}. However, we compared the point estimates of each parameter between the partitions in Model E to evaluate whether 1 or both parameters changed substantially between the partitions. Finally, to determine if differences in rates between the genes could be attributed to particular functional domains, we ran 2 additional analyses that partitioned either the binding or the nonbinding domains of each gene. We then compared which domains contributed most to variation in evolutionary rates.


    Results
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Supplementary Material
 Acknowledgements
 References
 
In order to characterize the selective forces that have operated on the bHLH anthocyanin transcriptional regulators, we cloned and sequenced the entire coding region from 2 bHLH genes from 13 species of Ipomoea. Sequence lengths varied among species from 1,878 to 1,896 bp in bHLH1, whereas bHLH2 was slightly larger and ranged from 1,980 to 2,055 bp. The amino acid alignments of both genes are provided in Supplementary Material online. The effective number of codons used and GC nucleotide content (calculated with DNAsp v. 4.0; Rozas et al. 2003) were highly similar among species and genes. There were no indications in any of the sequences of premature stop codons or frameshift mutations that would appear to cause a nonfunctional enzyme, even though insertion–deletion (indels) mutations were common.

Bayesian phylogenetic reconstructions of the ITS sequences for these 13 taxa recovered the established species relationships (fig. 2; Miller et al. 1999Go). The individual bHLH gene trees also consistently recovered the relationships among species. This indicates that the gene sequences we analyzed were likely to be orthologous, but it should be noted that the molecular genetic analyses to test protein function have not been performed for most species. However, conserved function of these bHLH transcription factors has been maintained since the split of the monocots and dicots (Lloyd et al. 1992Go; Quattrocchio et al. 1993Go), suggesting similar function among species of Ipomoea. The only major incongruity among the gene trees was in the placement of Ipomoea hochstetteri. The bHLH1 gene tree reconstructed it as subtending the clade containing I. nil and Ipomoea hederaceae, whereas the ITS and bHLH2 consensus trees placed it basal to all Ipomoea species except Ipomoea violaceae. For all further analyses, we used the topology derived from the ITS sequences (fig. 2), although analyses using each bHLH gene tree provided qualitatively similar results (data not shown).


Figure 2
View larger version (9K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
FIG. 2.— A Bayesian phylogeny of the 13 species used in this study, derived from ITS sequence data. This topology was used in all analyses. Branch lengths connote no information.

 
Domain-Specific Differences in Rates of Amino Acid Substitution
The nonbinding domain shows elevated {omega} ratios relative to the binding domain in bHLH2 but not in bHLH1 (table 2). In bHLH2, the {omega} ratio in the nonbinding domain is more than 2 times greater than the binding domain, whereas the difference between the domains in bHLH1 is more modest and does not vary significantly (table 2). To further characterize differences in domain-specific rates of amino acid substitution, we determined whether the 15% of sites with the highest {omega} ratio in each gene, as described by Bayesian estimation, are located more frequently in the nonbinding domains. G-tests indicate that these rapidly evolving sites do occur more frequently in the nonbinding domains for both genes (bHLH1: G = 11.26, df = 1, P < 0.001; bHLH2: G = 26.87, df = 1, P < 0.0001), suggesting that rapidly evolving sites in both genes are significantly clustered in the nonbinding domains.


View this table:
[in this window]
[in a new window]

 
Table 2 Estimates of {omega} from Codon-Based Model M0 of PAML, Partitioned for the Binding and Nonbinding Domains for the bHLH1 and bHLH2 Genes

 
In addition, indel mutations occur more frequently in the nonbinding domains. For bHLH1, no indels are present in either the interaction or the bHLH domains, but 7 indels occur in the nonbinding domains. The bHLH2 gene contains considerably more indels than bHLH1, and they again appear to avoid the binding domains (binding = 3; nonbinding = 26). Finally, in regions outside of indels, invariant amino acid sites are found more frequently in the binding domains as compared with the nonbinding domains for both bHLH1 (G = 6.09, df = 1, P < 0.05) and bHLH2 (G = 14.64, df = 1, P < 0.001), indicating that rates of evolution differ between the domains. Although these data are consistent with previous analyses of plant transcription factors showing that certain domains have elevated rates of evolution relative to other regions, the overall low {omega} ratios in the nonbinding domains of each bHLH regulator indicate strong constraint on these regions as well (table 2).

Rates and Patterns of Amino Acid Substitution
In order to determine the selective forces operating on these bHLH genes, we first examined whether there was heterogeneity in dN/dS ratios among lineages for each gene by performing branch tests. LRTs comparing single-ratio and free-ratio models do not differ significantly for either gene (bHLH1: {Lambda} = 30.46, df = 22, P > 0.1; bHLH2: {Lambda} = 20.7, df = 22, P > 0.5), providing no reason to reject the null hypothesis that the genes have evolved at constant rates along lineages.

We then compared models that evaluate the evolutionary forces acting on individual codon sites within each gene. For bHLH1, Models M1a, M2a, M3, M7, and M8 of PAML all result in nearly identical log-likelihood scores (table 3), providing no reason to reject Model M1a that classifies sites as either highly constrained or evolving neutrally. Model M1a indicates that 85% of sites are located in the highly constrained site class, with {omega} estimated as 0.087. Furthermore, for both of the selection models (M2a and M8) and the discrete Model (M3), estimates of {omega} are not greater than 1 for any site class, again providing no evidence for positive selection in the bHLH1 gene. Empirical Bayes estimation of individual amino acid sites belonging to the "selected" site class in Model M8 indicates that only 9 amino acids sites throughout the entire gene have an estimated {omega} greater than 1. For these sites, the maximum {omega} is 1.29 ± 0.369, and none of these sites is estimated to have a posterior probability greater than 0.73 of belonging to the rapidly evolving site class. These results provide consistently strong evidence that the bHLH1 gene has experienced substantial selective constraint and little evidence for positive selection at any amino acid site.


View this table:
[in this window]
[in a new window]

 
Table 3 Results of Codon-Based Models of Molecular Evolution for the bHLH1 and bHLH2 Genes in Ipomoea, Using the Tree Topology Derived from ITS Sequence Data

 
A similar analysis of the bHLH2 gene provides nearly identical results to bHLH1 and is unable to identify putatively, positively selected sites in this gene either. Models M1a–M3 result in log-likelihood scores that are indistinguishable from each other (table 3). Model M1a estimates that approximately 82% of sites are highly constrained with {omega} = 0.136. Even though additional parameters are estimated in the M2a and M3 models, they do not fit the data significantly better than a model that assumes all sites to be either highly constrained or evolving neutrally. In fact, the discrete model (M3), which freely estimates site classes based on the data, defines only 2 different classes, one with {omega} = 0.15 and the other with {omega} = 1.19, a value not substantially greater than 1. Approximately 85% of the amino acid sites are assigned to the more constrained class of sites in this model. Model M8 fits the data significantly better than Model M7 ({Lambda} = 11.0, df = 2, P < 0.01), and because 15% of sites are located in the positively selected site class with {omega} = 1.20, weak positive selection may be indicated with these comparisons. However, it has been found that a poor fit of the data to a beta distribution may result in a high frequency of significant tests when comparing Models M7 and M8 even in the absence of positive selection. As a result, Swanson et al. (2003)Go devised a likelihood ratio test to account for these apparently elevated type I error rates. Briefly, the original Model M8 that contains an extra site class constrained to have {omega} ≥ 1 is compared with a more restricted null model (M8a), where the extra site class is constrained to have {omega} = 1. This test has the added benefit of directly testing whether the estimated {omega} in the extra site class is significantly greater than 1. The likelihood ratio score is compared with a {chi}2 distribution with 1 df. When performing this analysis with the sequence data from the bHLH2 gene, Model M8 does not significantly differ from Model M8a ({Lambda} = 0.94, df = 1, P > 0.3), indicating that the estimated {omega} = 1.20 is not significantly different than 1 and that there is little indication of positive selection in this gene either.

Analysis of Amino Acid Substitutions
Methods relying on dN/dS ratios typically have low power to detect positive selection, especially if selection targeted only a limited number of amino acid sites or occurred infrequently. As a result, a new class of models that evaluates the effects of amino acid substitutions on the magnitude of change in physicochemical amino acid properties has recently been developed. We used the program TreeSAAP (Wooley et al. 2003Go) to model how 31 different physicochemical properties were affected by amino acid substitutions in both bHLH genes. Consistent overrepresentation of radical amino acid changes (i.e., categories 7 and 8) would suggest repeated adaptive substitution (McClellan et al. 2005Go).

According to both goodness-of-fit tests and Z-scores, amino acid substitutions in the bHLH1 gene affect the magnitude of change of 4 physicochemical properties more frequently than would be expected if all substitutions are equally likely to occur (table 4). Conservative substitutions of magnitude category 3 occur more frequently than expected by chance for 2 of the properties (β-structure tendencies and hydropathy), whereas slightly more radical substitutions (i.e., those found in category 6) affecting alpha-helical tendencies and the power to be at the C-terminal also are overrepresented in the data set (table 4). However, for none of the properties do the most radical amino acid substitutions (i.e., those found in categories 7 and 8) occur more frequently than expected under neutrality, indicating that these substitutions do not appear to have large effects on the physical structure of the protein. These results provide additional evidence that the entire bHLH1 gene is highly constrained and has not experienced repeated episodes of adaptive evolution.


View this table:
[in this window]
[in a new window]

 
Table 4 A List of Statistically Significant Physicochemical Amino Acid Properties for A) bHLH1 and B) bHLH2, As Implemented in the TreeSAAP v. 3.2 Software Program (Wooley et al. 2003Go)

 
For bHLH2, 13 different properties show significant deviations from neutral expectations (table 4). Interestingly, for 12 of these 13 properties, amino acid substitutions of particular magnitude categories occur less frequently than expected by chance, suggesting that purifying selection eliminates these classes of substitutions. Of these 12 properties, 3 of them also show an overrepresentation of particular types of amino acid substitutions, but these occur only in the most highly conservative classes (categories 1–3). For only one property (turn tendencies) are marginally radical amino acid substitutions (category 6) overrepresented. For 2 properties related to polarity, radical amino acid substitutions from categories 7 and 8 occur significantly less often than expected due to random substitution, suggesting that drastically altering the polarity of the protein is strongly selected against. Consistent with our analyses comparing dN/dS ratios, these results indicate that amino acid substitutions have not radically altered the physicochemical makeup of the protein, providing little evidence that repeated positive selection in either bHLH gene contributed to adaptive differentiation of flower color in Ipomoea.

Comparing dN/dS between Genes
In order to determine whether bHLH1 and bHLH2 evolved at different rates, we followed previously described methods from Lu and Rausher (2003)Go. Model M0 from PAML estimates the {omega} ratio for bHLH1 and bHLH2 to be 0.192 and 0.289 for the 2 genes, respectively. The log-likelihood scores for these models each fit the data significantly better than a model, where {omega} is constrained to the intermediate value of {omega} = 0.241 (bHLH1: {Lambda} = 5.84, df = 1, P < 0.02; bHLH2: {Lambda} = 5.3, df = 1, P < 0.03). Therefore, we have 1) found a value of {omega} that lies in between the estimated values for each gene and 2) that value of {omega} is significantly different than the values individually estimated by PAML for each gene, satisfying both criteria and allowing us to conclude that the dN/dS ratios for bHLH1 and bHLH2 are significantly different from each other.

A second method for comparing the dN/dS ratio between the genes provides similar results, consistent with the idea that the bHLH2 gene is evolving faster than bHLH1. By evaluating a combined data set partitioned according to each gene, we observe significant differences between Models C and E ({Lambda} = 18.9, df = 2, P < 0.0001), indicating that both the transition/transversion rate ratio ({kappa}) and {omega} vary significantly between the genes. The estimated {omega} is higher for bHLH2 than for bHLH1 (0.289 vs. 0.193) and {kappa} is higher for bHLH1 than for bHLH2 (2.47 vs. 1.94).

The higher estimated dN/dS ratio for bHLH2 can be attributed to an increase in the nonsynonymous substitution rate as opposed to a decrease in the synonymous substitution rate. When Model M0 is run for each gene separately in the site-based analyses, the tree lengths for synonymous and nonsynonymous substitutions differ. The ratio of bHLH2 to bHLH1 tree lengths for synonymous substitutions (synonymous substitutions per codon) is 1.12, indicating that synonymous substitutions occur at roughly similar rates between the 2 loci. However, the ratio of nonsynonymous tree lengths between bHLH2 and bHLH1 is 1.68, which is 50% greater than the rate of synonymous substitution. These different rates between the genes as a whole can be attributed to a higher dN/dS ratio in the nonbinding domains. When we compare only the binding domains of each gene, we find no significant difference between the genes for Models C and E ({Lambda} = 0.98, df = 2, P > 0.6). The freely estimated {omega} ratios for each gene from Model E are similar (bHLH1: 0.136; bHLH2: 0.130). However, when we run the analysis only comparing the nonbinding domains between the genes, we obtain a significant likelihood ratio statistic ({Lambda} = 36.68, df = 2, P < 0.0001). The estimated {omega} ratio for this region of the 2 genes is nearly twice as high for bHLH2 than for bHLH1 (0.408 vs. 0.210). These differences are due to greater rates of nonsynonymous substitution in bHLH2 as the ratio of the nonsynonymous tree lengths is nearly twice the ratio of the synonymous tree lengths (1.751 vs. 0.893). These results indicate that bHLH2 is experiencing significantly reduced selective constraint as compared with bHLH1 and this difference is restricted to the nonbinding domains. Furthermore, measures of nucleotide composition were similar among species and among genes, suggesting that selection for codon usage does not appear to account for differences in subsitution rates between the genes.


    Discussion
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Supplementary Material
 Acknowledgements
 References
 
Regulatory Gene Evolution
Recent investigations into the patterns of molecular evolution of plant transcription factors have revealed 2 common patterns (Purugganan and Wessler 1994Go; Purugganan et al. 1995Go; Rausher et al. 1999Go; Langercrantz and Axelsson 2000Go; Barrier et al. 2001Go; Remington and Purugganan 2002Go): 1) transcription factors frequently exhibit elevated rates of nonsynonymous substitutions compared with the structural genes they regulate and 2) elevated rates are often restricted to certain domains, whereas other regions remain highly conserved. Purugganan (1998)Go has termed these patterns "rapid mosaic" evolution, but the functional significance of these regions remains an open question. In some cases, it is clear that the rapidly evolving regions play essential roles in protein function (Gong et al. 1999Go; Kroon 2004Go), whereas in others it appears that most substitutions are essentially neutral (e.g., Chang et al. 2005Go). Results from our study largely conform to the pattern of rapid mosaic evolution, although the rate acceleration in the nonbinding domains is not as great as is commonly observed. In bHLH2, the rate in the nonbinding domain (0.41) is approximately 2-fold lower than that exhibited by variable regions in some transcription factors (e.g., 0.8 in a bHLH gene in the Poaceae; Purugganan and Wessler 1994Go), but comparable to that exhibited by the tb1 gene in grasses (0.39; Lukens and Doebley 2001Go). Moreover, the dN/dS ratio in the nonbinding domain of bHLH2 in Ipomoea is substantially higher than that in the structural genes of the anthocyanin pathway (0.034–0.277; Lu and Rausher 2003Go). By contrast, although rapidly evolving sites in bHLH1 are clustered in the nonbinding domains, the dN/dS ratio is not significantly different than that of the binding domains (table 2). Furthermore, the dN/dS ratios for both the binding and nonbinding domains of bHLH1 fall within the range of values for the structural genes. Therefore, bHLH1 does not appear to exhibit as rapid evolution in nonbinding domains as many plant transcription factors, including bHLH2.

Our results for bHLH1 also differ somewhat from those reported for its orthologous copies in the Poaceae (Purugganan and Wessler 1994Go) and in Cornus (Fan et al. 2004Go). Although the Poaceae study reports very similar dN/dS ratios for the binding domains (0.14 for Ipomoea vs. 0.12 for Poaceae), the ratio in the nonbinding domains is 4.5 times greater (0.89 vs. 0.21) for the Poaceae than for Ipomoea. In Cornus, the ratios in all 4 of the individual domains (interaction = 0.20; acidic = 0.48; bHLH = 0.38; C-terminal = 0.42; Fan et al. 2004Go) show somewhat higher values than either the binding or nonbinding domains in Ipomoea. However, despite sites in Cornus with estimated {omega} ratios that approach 1.0, no class of sites consistently demonstrated positive selection. Because attempts to detect selection on the grass bHLH genes have not been undertaken, it is currently unknown whether the more rapid rate of evolution in the Poaceae is due to relaxed constraint or an increase in the frequency of adaptive substitution.

Adaptive Evolution
The primary objective of this study was to determine whether accelerated rates of evolution in bHLH1 and bHLH2 nonbinding domains are due primarily to increased positive selection or decreased constraint. Our results provide very little support for the positive selection hypothesis. Although within the nonbinding domains approximately 20% of sites exhibit high {omega} ratios, in neither paralogs is there a class of sites for which {omega} is significantly greater than 1.0. Moreover, in neither copy do radical amino acid substitutions occur in substantially greater frequencies than would be expected by chance. Additionally, the absence of indels in the binding domains implies that these regions are structurally more constrained than the nonbinding domains. In fact, it has been shown that indel mutations in the binding domains can reduce or completely abolish transcription of target anthocyanin genes in bHLH orthologs of maize and P. hybrida (Liu et al. 1998Go; Spelt et al. 2002).

Further evidence for relaxed constraint is provided by the observation that ectopic expression of the maize R-like bHLH anthocyanin regulator (which appears orthologous to bHLH1 in Ipomoea) elicits expression of anthocyanin pigmentation in Petunia, Nicotiana, and Arabidopsis (Lloyd et al. 1992Go; Quattrocchio et al. 1993Go). Despite the maintenance of function between these widely diverged taxa for over 100 Myr, there is little sequence homology among species in the nonbinding domains, regions known to be essential for transcriptional activation and dimerization with other bHLH proteins. This pattern suggests that proper function of these domains appears to depend not on particular amino acids being present at particular locations but more loosely on conservation of general properties of these regions such as size, shape, and charge. This is evident from the significant underrepresentation of particular classes of substitutions affecting 12 physicochemical amino acid properties in Ipomoea bHLH2 (table 4), suggesting that these properties remain constrained even though amino acid substitutions are common.

These patterns are very similar to those found for another anthocyanin pathway transcription factor in Ipomoea. Chang et al. (2005)Go report that despite a very high dN/dS ratio (0.80) in the nonbinding domain of Ipmyb1, adaptive substitution could not be detected either by analysis of dN/dS ratios or by analysis of properties of substituted amino acids. Based on this evidence, it seems doubtful that the elevated rates of substitution in the nonbinding regions of these genes contribute substantially to adaptive floral color differentiation in Ipomoea. More generally, the failure to detect adaptive substitution in other plant transcription factors (e.g., Lukens and Doebley 2001Go; Remington and Purugganan 2002Go; Fan et al. 2004Go), along with the common observation that indels are restricted to nonbinding regions (Lukens and Doebley 2001Go; Chang et al. 2005Go), suggests that constraint is relaxed and most substitutions are neutral in variable domains of these genes. By contrast, positive selection has been implicated in the evolution of substitutions in the binding domains of several plant transcription factors (Jia et al. 2003Go, 2004Go). These patterns suggest that, to the extent that phenotypic change is caused by transcription factor evolution, it will involve primarily substitutions within conserved binding domains.

Functional Constraint
A secondary objective of this investigation was to test the hypothesis that bHLH2 would evolve more slowly than bHLH1 because of its greater pleiotropy. In both Ipomoea and Petunia, multiple functions are ascribed to bHLH2 and its orthologs, including production of seed coat color and trichomes, acidification of the vacuole, and production of secondary metabolites involved in plant defense (Spelt et al. 2002; Park et al. 2004Go, 2007Go). In contrast, regulation of anthocyanin synthesis is the only function currently assigned to the orthologs of bHLH1 in the flowering plant species that have been characterized (Goff et al. 1992Go; Quattrocchio et al. 1998Go; Spelt et al. 2000Go).

Surprisingly, we observed the opposite pattern: in Ipomoea, the bHLH2 gene evolves at a significantly higher rate than bHLH1, due primarily to an increase in the rate of nonsynonymous substitutions in its nonbinding domains. These apparently conflicting results potentially can be resolved when we consider that the orthologs of bHLH1 are involved in more protein–protein interactions than orthologs of bHLH2 (Kroon 2004Go). This suggests that despite more pleiotropic effects of bHLH2 relative to bHLH1, the nonbinding domains of the Ipomoea bHLH2 gene may be released from some of the constraint associated with ensuring that all protein-binding sites are maintained. Therefore, this may account for the observed significant increase in the {omega} ratio in bHLH2 relative to bHLH1.


    Supplementary Material
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Supplementary Material
 Acknowledgements
 References
 
Supplementary materials are available at Molecular Biology and Evolution online (http://www.mbe.oxfordjournals.org/).


    Acknowledgements
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Supplementary Material
 Acknowledgements
 References
 
We thank L. Xu for assistance in the laboratory; R. Miller for providing ITS sequence data; S. Smith and D. Des Marais for helpful discussions with analyses; and D. McClellan for assistance with the TreeSAAP software. This work was supported by an award from the National Science Foundation.


    Footnotes
 
Patricia Beldade, Associate Editor


    References
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Supplementary Material
 Acknowledgements
 References
 

    Barrier M, Robichaux RH, Purugganan MD. Accelerated regulatory gene evolution in an adaptive radiation. Proc Natl Acad Sci USA. (2001) 98:10208–10213.[Abstract/Free Full Text]

    Carroll SB. Evolution at two levels: on genes and form. PLoS Biol (2005) 3:1159–1166.[Web of Science]

    Chang SM, Lu Y, Rausher MD. Neutral evolution of the nonbinding region of the anthocyanin regulatory gene Ipmyb1 in Ipomoea. Genetics (2005) 170:1967–1978.[Abstract/Free Full Text]

    Doebley J. Genetics, development and plant evolution. Curr Opin Genet Dev (1993) 3:865–872.[CrossRef][Medline]

    Doebley J, Lukens L. Transcriptional regulators and the evolution of plant form. Plant Cell (1998) 10:1075–1082.[Free Full Text]

    Fan C, Purugganan MD, Thomas DT, Wiegmann BM, Xiang QY. Heterogeneous evolution of the Myc-like anthocyanin regulatory gene and its phylogenetic utility in Cornus L. (Cornaceae). Mol Phylogenet Evol (2004) 33:580–594.[CrossRef][Web of Science][Medline]

    Goff SA, Cone KC, Chandler VL. Functional analysis of the transcription activator encoded by the maize B-gene: evidence for a direct functional interaction between two classes of regulatory proteins. Gene Dev (1992) 6:864–875.[Abstract/Free Full Text]

    Gong ZZ, Yamagishi E, Yamazaki M, Saito K. A constitutively expressed Myc-like gene involved in anthocyanin biosynthesis from Perilla frutescens: molecular characterization, heterologous expression in transgenic plants and transactivation in yeast cells. Plant Mol Biol (1999) 41:33–44.[CrossRef][Web of Science][Medline]

    Heim MA, Jakoby M, Werber M, Martin C, Weisshaar B, Bailey PC. The basic helix-loop-helix transcription factor family in plants: a genome-wide study of protein structure and functional diversity. Mol Biol Evol (2003) 20:735–747.[Abstract/Free Full Text]

    Hernández-Hernández T, Martínez-Castilla LP, Alvarez-Buylla ER. Functional diversification of B MADS-box homeotic regulators of flower development: adaptive evolution in protein-protein interaction domains after major gene duplication events. Mol Biol Evol (2007) 24:465–481.[Abstract/Free Full Text]

    Hocking DR. The analysis of linear models (1985) Monterey (CA): Brooks/Cole Pub. Co.

    Huelsenbeck JP, Ronquist F. MRBAYES: Bayesian inference of phylogeny. Bioinformatics (2001) 17:754–755.[Abstract/Free Full Text]

    Jia L, Clegg MT, Jiang T. Excess nonsynonymous substitutions suggest that positive selection episodes operated in the DNA-binding domain evolution of Arabidopsis R2R3-MYB genes. Plant Mol Biol (2003) 52:627–642.[CrossRef][Web of Science][Medline]

    Jia L, Clegg MT, Jiang T. Evolutionary dynamics of the DNA-binding domains in putative R2R3-myb genes identified from rice subspecies indica and japonica genomes. Plant Physiol (2004) 134:575–585.[Abstract/Free Full Text]

    King MC, Wilson AC. Evolution at two levels in humans and chimpanzees. Science (1975) 188:107–116.[Free Full Text]

    Koes RE, Quattrocchio F, Mol JNM. The flavonoid biosynthetic pathway in plants: function and evolution. Bioessays (1994) 16:123–132.[CrossRef][Web of Science]

    Kosakovsky Pond SL, Frost SDW. Not so different after all: a comparison of methods for detecting amino acid sites under selection. Mol Biol Evol (2005) 22:1208–1222.[Abstract/Free Full Text]

    Kroon AR. Transcription regulation of the anthocyanin pathway in Petunia hybrida [Dissertation] (2004) [Amsterdam (Netherlands)]: Vrije Universiteit.

    Langercrantz U, Axelsson T. Rapid evolution of the family of CONSTANS LIKE genes in plants. Mol Biol Evol (2000) 17:1499–1507.[Abstract/Free Full Text]

    Liu Y, Wang L, Kermicle J, Wessler S. Molecular consequences of Ds insertion into and excision from the helix-loop-helix domain of the maize R gene. Genetics (1998) 150:1639–1648.[Abstract/Free Full Text]

    Lloyd AM, Walbot V, Davis RW. Arabidopsis and Nicotiana anthocyanin production activated by maize regulators R and C. Science (1992) 258:1773–1775.[Abstract/Free Full Text]

    Lu Y, Rausher MD. Evolutionary rate variation in anthocyanin pathway genes. Mol Biol Evol (2003) 20:1844–1853.[Abstract/Free Full Text]

    Lukens L, Doebley J. Molecular evolution of the teosinte branched gene among maize and related grasses. Mol Biol Evol (2001) 18:627–638.[Abstract/Free Full Text]

    Machado ICS, Sazima M. A comparative study in floral biology of two weed species Ipomoea hederifolia and I. quamoclit Convolvulaceae. Rev Bras Biol (1987) 47:425–436.

    Manos PS, Miller RE, Wilkin P. Phylogenetic analysis of Ipomoea, Argyreia, Stictocardia, and Turbina suggests a generalized model of morphological evolution in morning glories. Syst Bot (2001) 26:585–602.

    Massari ME, Murre C. Helix-loop-helix proteins: regulators of transcription in eucaryotic organisms. Mol Cell Biol (2000) 20:429–440.[Free Full Text]

    McClellan DA, Palfreyman EJ, Smith MJ, Moss JL, Christensen RG, Sailsberry JK. Physicochemical evolution and molecular adaptation of the Cetacean and Artiodactyl cytochrome b proteins. Mol Biol Evol (2005) 22:437–455.[Abstract/Free Full Text]

    Miller RE, Rausher MD, Manos PS. Phylogenetic systematics of Ipomoea (Convolvulaceae) based on ITS and waxy sequences. Syst Bot (1999) 24:209–227.[CrossRef]

    Mol J, Grotewold E, Koes R. How genes paint flowers and seeds. Trends Plant Sci (1998) 3:212–217.[CrossRef][Web of Science]

    Morita Y, Saitoh M, Hoshino A, Nitasaka E, Iida S. Isolation of cDNAs for R2R3-MYB, bHLH, and WDR transcriptional regulators and identification of c and ca mutations conferring white flowers in the Japanese morning glory. Plant Cell Physiol (2006) 47:457–470.[Abstract/Free Full Text]

    Nielsen R, Yang Z. Likelihood models for detecting positively selected amino acid sites and applications to the HIV-1 envelope gene. Genetics (1998) 148:929–936.[Abstract/Free Full Text]

    Park KI, Choi J, Hoshino A, Morita Y, Iida S. An intragenic tandem duplication in a transcriptional regulatory gene for anthocyanin biosynthesis confers pale-colored flowers and seeds with fine spots in Ipomoea tricolor. Plant J (2004) 38:840–849.[CrossRef][Web of Science][Medline]

    Park KI, Ishikawa N, Morita Y, Choi JD, Hoshino A, Iida S. A bHLH regulatory gene in the common morning glory, Ipomoea purpurea, controls anthocyanin biosynthesis in flowers, proanthocyanidin and phytomelanin pigmentation in seeds, and seed trichome formation. Plant J (2007) 49:641–654.[CrossRef][Web of Science][Medline]

    Posada D, Crandall KA. Modeltest: testing the model of DNA substitution. Bioinformatics (1998) 14:817–818.[Abstract/Free Full Text]

    Purugganan MD. The molecular evolution of development. Bioessays (1998) 20:700–711.[CrossRef][Web of Science][Medline]

    Purugganan MD, Rounsley SD, Schmidt RJ, Yanofsky MF. Molecular evolution of flower development: diversification of the plant MADS-Box regulatory gene family. Genetics (1995) 140:345–356.[Abstract]

    Purugganan MD, Wessler SR. Molecular evolution of the plant R regulatory gene family. Genetics (1994) 138:849–854.[Abstract]

    Quattrocchio F, Wing JF, Leppen HTC, Mol JNM, Koes RE. Regulatory genes controlling anthocyanin pigmentation are functionally conserved among plant species and have distinct sets of target genes. Plant Cell (1993) 5:1497–1512.[Abstract]

    Quattrocchio F, Wing JF, van der Woude K, Mol JNM, Koes R. Analysis of bHLH and MYB domain proteins: species-specific regulatory differences are caused by divergent evolution of target anthocyanin genes. Plant J (1998) 13:475–488.[CrossRef][Web of Science][Medline]

    Rausher MD. The evolution of flavonoids and their genes. In: The science of flavonoids—Grotewold E, ed. (2006) New York: Springer. 175–212.

    Rausher MD, Fry JD. Effects of a locus affecting floral pigmentation in Ipomoea purpurea on female fitness components. Genetics (1993) 134:1237–1247.[Abstract]

    Rausher MD, Miller RE, Tiffin P. Patterns of evolutionary rate variation among genes of the anthocyanin biosynthetic pathway. Mol Biol Evol (1999) 16:266–274.[Abstract]

    Remington DL, Purugganan MD. GAI homologues in the Hawaiian Silversword alliance (Asteraceae-Madiinae): molecular evolution of growth regulators in a rapidly diversifying plant lineage. Mol Biol Evol (2002) 19:1563–1574.[Abstract/Free Full Text]

    Ronquist F, Huelsenbeck JP. MYBAYES 3: Bayesian phylogenetic inference under mixed models. Bioinformatics (2003) 19:1572–1574.[Abstract/Free Full Text]

    Rozas J, Sanchez-DelBarrio JC, Messeguer X, Rozas R. DnaSP, DNA polymorphism analyses by the coalescent and other methods. Bioinformatics (2003) 19:2496–2497.[Abstract/Free Full Text]

    Sokal RR, Rohlf FJ. Biometry (1995) 3rd ed. New York: WH Freeman and Co.

    Spelt C, Quattrocchio F, Mol J, Koes R. ANTHOCYANIN1 of Petunia controls pigment synthesis, vacuolar pH, and seed coat development by genetically distinct mechanisms. Plant Cell (2002) 14:2121–2135.[Abstract/Free Full Text]

    Spelt C, Quattrocchio F, Mol JNM, Koes R. anthocyanin1 of Petunia encodes a basic helix-loop-helix protein that directly activates transcription of structural anthocyanin genes. Plant Cell (2000) 12:1619–1631.[Abstract/Free Full Text]

    Swanson WJ, Nielsen R, Yang Q. Pervasive adaptive evolution in mammalian fertilization proteins. Mol Biol Evol (2003) 20:18–20.[Abstract/Free Full Text]

    Weigel D, Meyerowitz EM. Activation of floral homeotic genes in Arabidopsis. Science (1993) 261:1723–1726.[Abstract/Free Full Text]

    Wooley S, Johnson J, Smith MJ, Crandall KA, McClellan DA. TreeSAAP: selection on amino acid properties using phylogenetic trees. Bioinformatics (2003) 19:671–672.[Abstract/Free Full Text]

    Yang Z. PAML: a program package for phylogenetic analysis by maximum likelihood. CABIOS (1997) 13:555–556.[Medline]

    Yang Z, Nielsen R. Estimating synonymous and nonsynonymous substitution rates under realistic evolutionary models. Mol Biol Evol (2000) 17:32–43.[Abstract/Free Full Text]

    Yang Z, Nielsen R, Goldman N, Pedersen AM. Codon-substitution models for heterogeneous selection pressure at amino acid sites. Genetics (2000) 155:431–449.[Abstract/Free Full Text]

    Yang Z, Swanson WJ. Codon-substitution models to detect adaptive evolution that account for heterogeneous selective pressures among site classes. Mol Biol Evol (2002) 19:49–57.[Abstract/Free Full Text]

Accepted for publication September 27, 2007.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?



This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Supplementary Material
Right arrow All Versions of this Article:
24/12/2816    most recent
msm216v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Streisfeld, M. A.
Right arrow Articles by Rausher, M. D.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Streisfeld, M. A.
Right arrow Articles by Rausher, M. D.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?