Skip Navigation


MBE Advance Access originally published online on November 29, 2006
Molecular Biology and Evolution 2007 24(2):465-481; doi:10.1093/molbev/msl182
This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Supplementary Material
Right arrow All Versions of this Article:
24/2/465    most recent
msl182v2
msl182v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Hernández-Hernández, T.
Right arrow Articles by Alvarez-Buylla, E. R.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Hernández-Hernández, T.
Right arrow Articles by Alvarez-Buylla, E. R.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2006. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oxfordjournals.org

Research Articles

Functional Diversification of B MADS-Box Homeotic Regulators of Flower Development: Adaptive Evolution in Protein–Protein Interaction Domains after Major Gene Duplication Events

Tania Hernández-Hernández, León Patricio Martínez-Castilla and Elena R. Alvarez-Buylla

Laboratorio de Genética Molecular, Desarrollo y Evolución de Plantas, Instituto de Ecología, Universidad Nacional Autónoma de México, Ciudad Universitaria, Coyoacán, México D.F., México

E-mail: ealvarez{at}miranda.ecologia.unam.mx.


    Abstract
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Conclusions
 Supplementary Material
 Acknowledgements
 References
 
B-class MADS-box genes have been shown to be the key regulators of petal and stamen specification in several eudicot model species such as Arabidopsis thaliana, Antirrhinum majus, and Petunia hybrida. Orthologs of these genes have been found across angiosperms and gymnosperms, and it is thought that the basic regulatory function of B proteins is conserved in seed plant lineages. The evolution of B genes is characterized by numerous duplications that might represent key elements fostering the functional diversification of duplicates with a deep impact on their role in the evolution of the floral developmental program. To evaluate this, we performed a rigorous statistical analysis with B gene sequences. Using maximum likelihood and Bayesian methods, we estimated molecular substitution rates and determined the selective regimes operating at each residue of B proteins. We implemented tests that rely on phylogenetic hypotheses and codon substitution models to detect significant differences in substitution rates (DSRs) and sites under positive adaptive selection (PS) in specific lineages before and after duplication events. With these methods, we identified several protein residues fixed by PS shortly after the origin of PISTILLATA-like and APETALA3-like lineages in angiosperms and shortly after the origin of the euAP3-like lineage in core eudicots, the 2 main B gene duplications. The residues inferred to have been fixed by positive selection lie mostly within the K domain of the protein, which is key to promote heterodimerization. Additionally, we used a likelihood method that accommodates DSRs among lineages to estimate duplication dates for AP3–PI and euAP3–TM6, calibrating with data from the fossil record. The dates obtained are consistent with angiosperm origins and diversification of core eudicots. Our results strongly suggest that novel multimer formation with other MADS proteins could have been crucial for the functional divergence of B MADS-box genes. We thus propose a mechanism of functional diversification and persistence of gene duplicates by the appearance of novel multimerization capabilities after duplications. Multimer formation in different combinations of regulatory proteins can be a mechanistic basis for the origin of novel regulatory functions and a gene regulatory mechanism for the appearance of morphological innovations.

Key Words: B MADS-box genes • flower development • positive Darwinian selection • functional diversification • protein evolution • morphological novelties


    Introduction
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Conclusions
 Supplementary Material
 Acknowledgements
 References
 
The neo-Darwinian synthesis has provided a solid framework to understand evolutionary processes in terms of changes in the genetic composition of populations (Dobzhansky 1951Go). However, this framework has not considered ontogenetic processes explicitly (Gilbert et al. 1996Go). Any comprehensive theory aiming at explaining the emergence of novel morphological traits and their fixation within populations must feature development because it determines the mapping functions of genetic variation into morphological variation (Alberch 1980Go). Comparative studies of the molecular basis of development can eventually bridge the gap between molecular evolution and phenotypic evolution in a macroevolutionary scale.

In angiosperms, the relatively abrupt and extensive diversification that occurred shortly after they appeared in the fossil record (Magallón et al. 1999Go; Magallón and Sanderson 2001Go) has been considered a classic example of species radiation and has been linked to the appearance of novel structures that allowed a unique reproductive mode: flowers. The genetic bases of flower development have been widely studied in eudicot model species such as Arabidopsis thaliana and Antirrhinum majus since the 1990s. These studies yielded the ABC combinatorial model for floral organ determination (Coen and Meyerowitz 1991Go). Four of the 5 genes in the ABC model belong to the Type II plant MADS-box genes, which have been shown to be key regulators in several plant developmental traits (Alvarez-Buylla, Liljergren, et al. 2000Go; Alvarez-Buylla, Pelaz, et al. 2000Go; Ng and Yanofsky 2001Go). Further studies have extended the interactions between ABC proteins to another set of floral MADS-box proteins encoded by the SEPALLATA genes subfamily to achieve their regulatory functions (Egea-Cortines and Davies 2000Go; Pelaz et al. 2000Go; Honma and Goto 2001). Several studies have yielded evidence indicating that the basic function of floral ABC proteins is conserved across angiosperms and to some extent even in gymnosperms (Bowman 1997Go; Egea-Cortines et al. 1999Go; Kramer and Irish 1999Go, 2000Go; Sündstrom et al. 1999Go; Sündstrom and Engström 2002Go; Whipple et al. 2004Go: Kim et al. 2005Go). A recent network dynamic model that considers the concerted action of ABC and interacting non-ABC proteins has proposed an explanation for the robustness of the floral genetic developmental program among core eudicots (Espinosa-Soto et al. 2004Go).

In a previous study that focused on the complete family of MADS-box genes for A. thaliana, we reported that positive Darwinian or adaptive selection might have been important in fixing specific protein residues after some of the main duplication events leading to the paralogous groups of the gene family (Martínez-Castilla and Alvarez-Buylla 2003Go). However, we were not able to establish whether or not adaptive evolution played a major role in the functional diversification of orthologous genes in the major angiosperm lineages. By comparing sequences of orthologous genes from several angiosperm species in a phylogenetic context, in this work, we address this question in one of the main subfamilies of floral regulatory genes, the so-called B MADS-box genes.

There are 2 lineages of B type genes in angiosperms, named APETALA3-like (AP3) and PISTILLATA-like (PI), after their characterization in A. thaliana. As described in the ABC model of flower development, when coexpressed with A function genes in the floral meristem, B genes specify petal formation, whereas when coexpressed with C function genes, they specify stamen formation (Bowman et al. 1991Go; Coen and Meyerowitz 1991Go; Meyerowitz et al. 1991Go). AP3 and PI proteins interact to form a heterodimer that indirectly regulates PI expression and directly binds to the AP3 promoter in a self-activating regulatory loop that maintains the B function in the second and third whorls of the meristem during flower development (Krizek and Meyerowitz 1996Go; Hill et al. 1998Go; Honma and Goto 2000Go).

It has been postulated that the obligate heterodimerization of PI and AP3 proteins observed in core eudicots (Davies et al. 1996Go; Riechmann et al. 1996Go) evolved from the homodimerization found in gymnosperms, perhaps via a transitory state of facultative homo-heterodimerization (Winter et al. 2002Go). The evolution of heterodimerization in B-class proteins could have been fundamental for the establishment of the floral genetic developmental program in angiosperm lineages, promoting their floral morphological diversification. In this study, we address whether or not adaptive evolution was important during the origin of obligate B-protein heterodimerization, particularly following the critical AP3–PI duplication toward the base of angiosperms.

In a recent study, it was estimated that the duplication leading to the AP3 and PI lineages in angiosperms occurred 260 million years ago (MYA) (Kim et al. 2004Go), shortly after the angiosperm–gymnosperm divergence (fig. 1B), after which the B MADS-box genes underwent a complex pattern of independent duplications. A main duplication occurred in the AP3 lineage, leading to 2 AP3 sublineages distinguished by characteristic motifs in their C-terminal region (Kramer et al. 1998Go). The paleoAP3 lineage is found in basal angiosperms, monocots, magnoliids, and basal eudicots with the proteins possessing a paleoAP3 motif in their C-terminal. Most core eudicots possess a copy of a paleoAP3-like gene, named TM6-like genes after its description in tomato (Pnueli et al. 1991Go). In contrast, the euAP3 lineage is found exclusively in core eudicots B-class genes, possessing the characteristic euAP3 C-terminal motif (see figs. 1A, 1B, and 4A). Experimental evidence based on Arabidopsis genetic transformations using chimeric B proteins have shown that the C-terminal domains of AP3 and PI as well as the 2 distinct AP3 genes paleoAP3 and euAP3 are functionally divergent (Lamb and Irish 2003Go). A further study in Petunia hybrida showed that PethDEF (euAP3-like) and PethTM6 (paleoAP3-like) are also functionally divergent, with PethTM6 having a more important role in stamen specification and weaker expression in petals (Vandenbussche et al. 2004Go).


Figure 1
View larger version (43K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
FIG. 1.— A representation of the angiosperm phylogeny reconstructed with molecular data, modified from APG II (2003) (A). The main duplications of B MADS-box genes (B) are coincident with the origin of angiosperms (AP3–PI duplication) and the origin of the core eudicots clade (euAP3–TM6). The evolution and duplications of B genes are also coincident with the evolution of obligate heterodimerization between AP3 and PI paralogues found in Arabidopsis from homodimerization found in gymnosperms (C). It is also possible that the evolution of perianth morphology is related with B MADS-box gene duplications and heterodimerization evolution. At the base of angiosperms, there is not a clear differentiation between sepals and petals, which in turn characterize flowers of core eudicot species (D).

 
Core eudicots are the most abundant angiosperm lineage, comprising almost 73% of extant angiosperm species (Drinnan et al. 1994Go), thus representing the largest evolutionary radiation within the flowering plants. Members of this group show a relatively invariant differentiation of their floral perianth into distinct calyx and corolla (sepals and petals). Congruent with this morphological pattern, the expression patterns and function of B genes also seem to be well conserved (Bowman 1997Go; Kramer and Irish 1999Go). In contrast, B MADS-box genes have divergent expression patterns in noncore eudicots (Kramer and Irish 1999Go; Kim et al. 2005Go), whose flowers do not have a well differentiated calyx and corolla (fig. 1D). Molecular phylogenies show a possible duplication of A, B, C, and the subfamily of SEP1/2/4 floral MADS-box genes close to the base of the core eudicot clade (Kramer and Hall 2005Go; Zahn et al. 2005Go). In the case of B genes, this duplication gave rise to the euAP3 and TM6 (paleoAP3-like genes) lineages (Kramer and Hall 2005Go). Based on sequence analyses and experimental evidence, it has been hypothesized that this duplication played an important role in the origin of clearly differentiated petals in core eudicots (Kramer et al. 1998Go; Lamb and Irish 2003Go).

Using statistical comparisons in a likelihood framework, we tested for significant differences in the substitution rates (DSRs) and for positive selection (PS) at individual sites of the protein along the main B gene lineages, to test key hypotheses regarding the evolutionary roles played by the 2 main duplications: 1) the one leading to AP3 and PI lineages in angiosperms, to explore the evolutionary forces that could have played a role in driving the evolution of heterodimerization and 2) the one leading to the euAP3 and TM6 lineages because this duplication could have been key in the evolution of the floral developmental mechanisms characteristic of the core eudicots. Additionally, we estimated divergence dates with these data and compared our results with the information in the angiosperm fossil record.

Our results strongly suggest that shortly after the duplication that lead to the AP3 and PI genes, functional diversification driven by PS acting on different sites within the K domain, which is key for heterodimerization, occurred along both duplicated gene lineages. We report an estimated date for this duplication at 290 MYA, which is closer to the divergence of angiosperms and gymnosperms registered in the fossil record as compared with a previously reported date (Kim et al. 2004Go). Our results suggest an early origin for heterodimerization of B MADS-box proteins in angiosperms shortly after they diverged from gymnosperms. Moreover, the estimated date for the euAP3TM6 duplication coincides with the origin of the core eudicots lineage. This result, together with the strong signal for PS along the euAP3 branch lineage, suggests a possible functional divergence of AP3 duplicates that might have been important for the evolution of the core eudicot floral developmental genetic program.


    Materials and Methods
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Conclusions
 Supplementary Material
 Acknowledgements
 References
 
Sequences
Full-length DNA sequences for coding regions of B MADS-box genes were obtained from GenBank at the National Center for Biotechnology Information Database (http://www.ncbi.nlm.nih.gov/) using Blast and key word searches for each plant family sensu the APG II (2003), aimed at obtaining a homogeneous taxonomic sampling across angiosperm lineages. Accession numbers as well as species names and plant families for each gene are presented in table 1. We sorted the sequences into 3 data sets: one for B MADS-box genes, another including a richer sampling exclusively for PI genes, and a third with a richer sampling for AP3 genes to obtain improved alignments and phylogenetic resolution inside each lineage.


View this table:
[in this window]
[in a new window]

 
Table 1. Sequences Used in This Study

 
Alignment Strategy
Downloaded sequences were edited to identify correct open reading frames. We obtained preliminary amino acid alignments for each of our 3 data sets using MUSCLE (Edgar 2004Go). Independently, amino acid sequences were analyzed using MEME, an algorithm based on likelihood methods that finds highly conserved regions or motifs in groups of related protein sequences (Bailey and Elkan 1994Go, 1995Go). Preliminary protein alignments were edited manually using BioEdit (Hall 1999Go), first following previously reported alignments (Kramer et al. 1998Go) and finally following MEME results. Especially in regions of the protein with ambiguous alignment (the C-terminal region), we attempted to recover MEME motifs during manual edition. Corresponding nucleotide sequences were aligned following these amino acid alignments to obtain final codon alignments. We constructed a second data set by editing these original codon alignments using Gblocks (Castresana 2000Go). Gblocks eliminate poorly aligned positions and divergent regions, and we used it especially to eliminate regions with a high number of in-dels, particularly in the C-terminal, without losing characteristic conserved motifs. Results from these alignments are emphasized. Alignments are shown in the supplementary material.

Phylogenetic Analysis
We performed the phylogenetic analysis using Bayesian methods as implemented in MrBayes 3.1.1 (Huelsenbeck and Ronquist 2001Go) with our edited and nonedited codon alignments. The substitution DNA model was selected with the help of Modeltest by using the Akaike Information Criterion for first + second and third codon positions separately (Posada and Crandall 1998Go). Models found were implemented in MrBayes, and partitioned analyses were performed when we found different models for different codon positions. Searches were started from a random tree with a plain prior on 4 different Markov chains in 2,500,000 generations saving every 100th tree. At convergence (around 20,000 generations), confirmed visually from the graphs of the log likelihood scores, we discarded the first 3,000 trees as burn-in and constructed a consensus using the Bayesian posterior probabilities (PPs) to evaluate clades support.

Statistical Tests for DSR and PS Detection
To detect the selective regimes operating on each coding site (codon) of B proteins, we performed statistical tests based on models of DNA substitution by codons that rely on estimates of the ratio {omega} = dN/dS of nonsynonymous substitution rates (dN) relative to synonymous substitution rates (dS). Estimated {omega} values can provide an unambiguous criterion to evaluate the selective regimes operating at the molecular level, expecting {omega} values {cong} 1 for neutrality, {omega} < 1 for purifying selection, and {omega} > 1 for positive adaptive selection.

We estimated likelihood scores using 2 different kinds of codon models. The first kind comprises the so-called site-specific models (Nielsen and Yang 1998Go; Yang et al. 2000Go) and were designed to detect heterogeneous selection pressures and PS among sites (codons) if the average nonsynonymous substitution rate over all lineages is higher than the average synonymous rate. The second kind, the so-called branch-site models, has the same objective but exclusively in particular lineages of a phylogeny, allowing the {omega} ratio to vary across sites and across lineages and detecting codon specific PS or DSRs in preselected branches of a given tree regardless of the average dN/dS ratio. Likelihood scores obtained given such codon models were compared against null models that imply homogeneity in substitution rates and neutrality or purifying selection at all sites and branches, statistically accepting PS and DSR only if the improvement in likelihood scores obtained with the PS/DSR codon models was significant. Model comparisons were performed using likelihood ratio tests (LRTs), which require nested hypotheses of models being compared. Degrees of freedom are equal to the number of extra parameters in the alternative hypothesis. The LRT statistic equals twice the difference of log likelihood scores of models compared (2{Delta}l = 2(l2l1)), and its distribution can be approximated with a chi-squared test; thus, statistical significance was evaluated using a test of goodness of fit for this statistic (Yang and Nielsen, 2002Go). The calculations were performed following the algorithms implemented in version 3.14 of the PAML package (http://abacus.gene.ucl.ac.uk/software/paml.html).

The site-specific models used were 0, 1a, 2a, 3, 7, and 8 according to the nomenclature and methods described in Yang et al. (2000)Go and in the PAML documentation. These models do not account for molecular rate shifts or PS in particular lineages, thus averaging over all sequences in the phylogeny. PS is difficult to detect with these models because it often operates episodically on few amino acids in a small number of taxa. In contrast, branch-site models have more power to detect DSR and PS on specific lineages because they allow for branches prespecified in the phylogeny to have a class of sites with a different {omega} from the rest of the tree (model D) or for a class of sites with {omega} > 1 (model A) (Yang and Nielsen 2002Go; Bielawski and Yang 2004Go). Using branch-site models, we tested branches of main lineages before and after important duplication events during the evolution of B genes, which represent potential points for the evolution of possible newly acquired biochemical properties and roles in the floral developmental program following these duplications. We repeated this procedure for different branches in the PI, AP3, and B MADS-box gene Bayesian phylogenies (indicated with arrows in figs. 2 and 3). Branches selected are assigned as "foreground" and as such have additional site classes that allow for PS, whereas the other branches in the phylogeny comprise the "background" branches. First, we used codon model D as an indicator of functional divergence as described in Bielawski and Yang (2004Go) that was designed to detect substitution rate shifts not necessarily due to PS. Then, for PS detection, we used codon model A (Yang and Nielsen 2002Go) that was designed to infer PS in the preselected tree branches.


Figure 2
View larger version (36K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
FIG. 2.— B type MADS-box gene Bayesian phylogeny. Constructed from the codon alignment edited with Gblocks. Numbers above or below branches represent the Bayesian PPs for clades. Branch lengths are proportional to number of nucleotide substitutions. We ran 2,500,000 generations, saving each 100 trees and discarding the first 300 as burn-in. Branches selected as hypothesis tested for significant DSR and PS are indicated with arrows.

 

Figure 3
View larger version (34K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
FIG. 3.— Bayesian phylogenies for AP3 genes (A) and PI genes (B) constructed from the codon alignment edited with Gblocks and with the nonedited alignment, respectively. Numbers above or below branches represent the Bayesian PPs for clades. Branch lengths are proportional to number of nucleotide substitutions. For each data set, we ran 2,500,000 generations, saving each 100 trees and discarding the first 300 as burn-in. Different branches selected as hypothesis tested for significant DSR and PS are indicated with arrows.

 
Simulation studies have shown that Yang's branch-site tests to detect PS in specific branches are sensitive to model assumptions and unable to distinguish between relaxation of selective constraints and PS (Zhang 2004Go). Although relaxation of constraints can suggest important changes in the biochemical properties of proteins, we additionally implemented the modified models and the more conservative branch-site LRT test 2 described in Zhang et al. (2005)Go, which are robust to violations of model assumptions.

In the case of alternative models that allow for PS (model 2, 8, and model A), the PP that a site (codon) was drawn from a given dN/dS class was calculated using a Bayes Empirical Bayes approach that accounts for sampling errors in maximum likelihood estimates of model parameters and that have shown to be conservative and to have desirable statistical properties (Yang et al. 2005Go). We report only those sites with a PP > 0.9 to belong to a class under PS or {omega} > 1. To evaluate the impact of alignment ambiguities and in-dels in our results, we repeated every test with our Gblocks edited alignments.

Estimation of Divergence Dates
We estimated divergence dates for major duplication events in the phylogenies reconstructed for B genes and AP3 genes using the heuristic rate-smoothing algorithm (the AHRS algorithm) described in Yang (2004)Go and implemented in PAML version 3.14. The calibration date used for estimating the AP3–PI duplication date in the B MADS-box phylogeny (fig. 2) was 121 MYA as minimum for eudicot origins inferred from tricolpate pollen fossil grains described in Doyle and Hotton (1991)Go, and for the euAP3–TM6 duplication in the AP3 phylogeny (fig. 3A), we used a calibration date of 132 MYA as minimum for the origins of angiosperms inferred from fossil pollen grains described in Brenner (1996)Go. Following the AHRS algorithm, the branch lengths for the topologies obtained in our Bayesian analysis were estimated first by maximum likelihood following the pruning algorithm of Felsenstein (1981)Go. Then the substitution rates and divergence dates were obtained as preliminary values. A stochastic model of changes in the substitution rates was adjusted with the estimated branch lengths, and the obtained values were used to classify branches into groups according to their rates following maximization by maximum likelihood, thus providing an objective method to consider differences in the branch lengths and substitution rates for different branches, and to enable their classification into different groups. In the final step of the AHRS algorithm, the divergence times and substitution rates were estimated (Yang 2004Go).


    Results
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Conclusions
 Supplementary Material
 Acknowledgements
 References
 
Phylogenetic Analyses of B MADS-Box Lineages
We obtained phylogenetic hypotheses for the entire set of B MADS-box sequences as well as AP3 and PI sequences separately using Bayesian methods with edited and nonedited alignments and evaluated clade support with the PP of each node. Figure 2 shows the consensus phylogeny obtained for B gene sequences with edited alignments. As expected, we recovered the AP3–PI clade, which comprises angiosperm sequences exclusively, with a high PP support value (0.99 PP). As sister sequences of this AP3–PI clade, we recovered the clade of B-sister genes (1.00 PP) in a poorly supported position (0.34 PP). At the base of angiosperm B genes and B-sister genes, we recovered a grade of gymnosperm B genes.

Figure 3A and B shows Bayesian phylogenies for AP3 and PI lineages, respectively, rooted with the sequences of the basal angiosperm Nymphaea. In both trees, the main clades of angiosperms sensu APG II (2003) were recovered as monophyletic with reasonable PP values. The relationships between main clades were different in each paralogous lineage, revealing a different evolutionary trajectory that may indicate differences in their functional diversification processes.

The trees with higher PP values at nodes for AP3 genes were obtained with nonedited alignments in a partitioned analysis, with a substitution model including 2 substitution types for first and second codon positions and 6 substitution types for third positions. In the case of PI gene trees, higher support values were obtained with complete sequences using 6 substitution types in the model.

Heterogeneous Substitution Rates within B MADS-Domain Protein Sites
We used different site-specific codon substitution models to make inferences regarding among-site substitution rate heterogeneity in B MADS-box proteins and then addressed whether or not some sites had been fixed by PS. We performed analyses for the AP3, PI, and the entire set of B gene sequence alignments with their corresponding phylogenetic hypotheses separately, repeating every test with edited alignments that contained almost exclusively unambiguously aligned positions. Results were nearly identical with both data sets, and we focus on those derived from alignments that exclude ambiguous regions. In a first analysis, we did not consider differences in rates at particular branches within the phylogeny, thus assuming homogeneous selective pressures along all angiosperm and gene lineages. Results are summarized in supplementary table A (Supplementary Material online), whereas model comparisons performed with LRTs as well as the LRT statistic values and P values are given in table 2.


View this table:
[in this window]
[in a new window]

 
Table 2. LRTs Done to Detect Heterogeneous Selection Regimes between Sites and PS in B MADS-Box Genes, PI, and AP3 Genes

 
Anisimova et al. (2001)Go have pointed out the impact that sequence divergence can have on the power of LRT tests for detection of PS among sites that use models 0, 1, 2, 3, 7, and 8 from Yang et al. (2000)Go. They measure sequence divergence as tree length because given the appropriate algorithm, tree branches reflect expected number of nucleotide changes. To evaluate sequence divergence in our data sets objectively, we used 2 approaches. First, we used maximum likelihood to estimate the branch lengths of our trees using the codon substitution model M0, to obtain branch lengths that reflect number of nucleotide changes per codon. Graphs showing the branch lengths for these trees are shown in the supplementary material. They show that the divergence within our data set is not high, and thus, we do not expect false positives due to saturated substitution rates (also Yang Z, personal communication, see Discussion at http://www.rannala.org/phpBB2). As a second test, we obtained branch lengths for our trees including only third codon positions by using the NJ algorithm, setting the GTR + I + G (general time reversible model with a proportion of invariant sites and a gamma distribution for the variation of rates among sites) nucleotide substitution model as distance measure. Here, branch lengths represent the expected number of changes at third codon sites. In this case, large branch lengths can indicate saturation. Distances for each branch were plotted and are shown in the supplementary material. To guarantee avoiding false positives due to saturation, we decided to exclude from the substitution rate analyses the sequences with distances greater than 2.0 from others. As expected, this was the case of out-group sequences but also of Peperomia hirta AP3 and PI-like sequences in the B MADS-box gene phylogeny, as well as SagmAP3, LilrDEF, TaccAP3, ChlsAP3, and CucpAP3 in the AP3 phylogeny. These sequences were also excluded from the substitution rates analyses.

We used the Model M0 to estimate a general {omega} value for each data set. The {omega} was very similar for all data sets used ({omega} = 0.151 for all B MADS-box genes, {omega} = 0.163 for PI genes, and {omega} = 0.170 for AP3 genes; supplementary table A, Supplementary Material online), suggesting a similar selection regime in both lineages and strong purifying selection operating on the majority of sites. Among the models implemented, M2a and M8 imply PS by adding a class of sites allowed to be under {omega} > 1. M8 is an elaborate codon model that introduces the beta distribution to model among-site heterogeneity and adds one extra class for sites under PS. Using this model and edited alignments, we did not detect any sites that could have been fixed by PS in AP3 and PI genes. However, when nonedited alignments were used, a few sites that could have been fixed by PS in AP3 and PI genes were detected (see Results in supplementary table A, Supplementary Material online). Although LRTs done to detect PS with site-specific models failed, LRTs performed to evaluate among-site heterogeneity in {omega} ratios between sites (M0 vs. M3 with K = 3, M0 vs. M3 with K = 2, and M3 K = 2 vs. M3 K = 3) without implying PS resulted in a rejection of the null hypotheses in all cases, with both edited and nonedited alignments (table 2).

Shifts in Substitution Rates and Changes in Protein Residues Fixed by PS after the AP3PI Duplication
It is difficult to detect PS using site-specific models if it has occurred only on particular branches or in certain lineages within the phylogeny because averaging over all lineages might hide the signal for PS or DSR. It is reasonable to expect molecular rate shifts and positive adaptive selection after gene duplications when functional diversification processes affect one or both duplicates. Therefore, we decided to assess whether or not signatures of molecular rate shifts were detectable and whether or not PS had fixed certain amino acids at particular moments of B MADS-box gene evolution, especially following the main duplication events. To achieve this, we used branch-site–specific codon models that detect DSR and PS at specific coding sites and in particular branches of the phylogenetic tree. We focused on 2 key duplications during the evolution of B genes that coincide with important events in angiosperm evolution (See fig. 1B): 1) the AP3PI duplication before angiosperm diversification, which is key for the origin and evolution of heterodimerization and during which PS might have been important in fixing some key protein residues and 2) the euAP3TM6 duplication within the AP3 genes in the clade recovered for the core eudicots, which correlates with the origin of differentiated petals in a bipartite perianth with distinct calyx and corolla in this important angiosperm group. We also selected branches before duplications as contrasting hypotheses. Branches tested for DSR and PS in the B gene phylogeny are indicated in figure 2 (B-H1–B-H3), whereas branches tested in the AP3 and PI lineages are indicated in figure 3A for AP3 (AP3-H1 to AP3-H3) and in figure 3B for PI (PI-H1) phylogenies.

LRTs performed to detect DSR in selected branches are shown in table 3, wherein we compare model D with model M3 with 2 site classes. Model D is useful in detecting DSR in branches selected as foreground lineages without assuming PS (Bielawski and Yang 2004Go), thus giving insights into functional diversification processes after duplications. The ancestral branch leading to the AP3–PI angiosperm clade (B-H1) was found, however, to have been under significant DSR with both alignments. This could imply slight changes in evolutionary rates early after the origin of angiosperms, before their crown group diversification. With this LRT, we also detected significant DSR after the AP3–PI duplication in the branches leading to both lineages.


View this table:
[in this window]
[in a new window]

 
Table 3. LRT Tests for Significant DSRs for Each Foreground Branch

 
To evaluate whether or not the detected molecular rate shifts could have been due to PS at particular sites along those branches we used codon model A, which featured an extra class of sites with {omega} values calculated from the data as free parameters in foreground or selected branches, thus allowing them to assume values of {omega} > 1 (Yang and Nielsen 2002Go; Zhang et al. 2005Go). LRTs, parameter estimates for classes under PS, and sites detected are listed in table 4, and a summarized diagram of results for the lineage-site analyses is shown in figure 4. We implemented Test 1 and Test 2 as described in Zhang et al. (2005)Go. In particular, Test 2 has been shown to be conservative and more robust than Test 1 with simulated and empirical data sets (Zhang et al. 2005Go). Unless otherwise stated, all results presented are for the edited alignments.


View this table:
[in this window]
[in a new window]

 
Table 4. Likelihood Scores and LRTs 1 and 2 Comparisons for PS in Selected Lineages

 

Figure 4
View larger version (36K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
FIG. 4.— (A) Schematic representation of the evolution of B MADS-box genes within angiosperms and gymnosperms and the C-terminal motifs characteristic of each gene lineage. Thick lines in the gene phylogeny show branches with significant DSRs detected with LRT comparisons. Arrows indicate branches where PS was detected with statistical significance. The sites found to be under PS (PP > 0.9) in those branches are indicated with the number in our complete alignment and corresponding residue in the Arabidopsis PI or AP3 protein. The protein domain where they are located is shown inside brackets. Estimated dates for the AP3–PI and euAP3–TM6 duplications are shown. (B) Sites detected to be under PS in PI, AP3, and euAP3 lineages are mapped in the K domain of PISTILLATA and APETALA3 from Arabidopsis. Lines indicate the 3 putative amphipathic {alpha}-helices, K1, K2, and K3; and regions critical for PI/AP3 and for PI/SEP3 interaction according to Yang and Jack (2004)Go.

 
PS was significantly supported for the ancestral branch leading to the angiosperm AP3–PI clade (B-H1) only using edited alignments. Interestingly, with both data sets, we detected a clear signal of PS after the AP3–PI duplication, in the PI lineage (B-H2) and in the AP3 lineage (B-H3), where the statistical support for PS was even stronger (table 4). We detected several sites under PS in both lineages with the Bayes Empirical Bayes approach, but we report here only the ones with PP values greater than 0.9. Remarkably, all sites inferred to have been fixed by PS are within the K domain. In figure 4B, we show the position of sites detected mapped in the K domain of B-protein sequences from A thaliana following Yang and Jack (2004)Go. According to their position in the PISTILLATA and APETALA3 protein sequences of A. thaliana, we detected that site 100N with 0.93 PP and 142A with 0.94 PP were fixed by PS in the PI branch lineage. Both fall within the K domain region. In the branch leading to AP3 clade, we detected site 130E with 0.95 PP, also in the K domain.

PS after the euAP3TM6 Duplication in Core Eudicots
To gain insight into the functional diversification of AP3 and PI genes in core eudicots, we tested for DSR and PS in B genes of this clade. Branches selected as hypotheses or foreground branches are indicated with arrows in figure 3A and B. We focused especially on the euAP3TM6 duplication of AP3 genes of core eudicots, in a manner analogous to that used for the AP3PI duplication in angiosperms. Table 3 shows the LRT comparisons performed with log likelihood scores and the results obtained for DSR tests in foreground branches tested on AP3 and PI phylogenies. According to our results, there are no significant DSRs in the branches leading to core eudicot AP3 or PI genes. Nor did we detect DSR in the branch leading to the euAP3 lineage in the AP3 phylogeny, but interestingly we observed a strong signal of significant DSR in the TM6 branch lineage.

Two alternative scenarios could account for our observation of DSR in the TM6 and euAP3 branches. On one hand, when DSRs were detected, PS could have acted throughout sites. Alternatively, when DSRs were not detected, PS could have acted on only a very small proportion of sites in foreground branches, thus yielding a hidden signal in DSR tests. To distinguish between these possibilities, we applied model A to the same branch-selection tests. Results are shown in table 4. We did not obtain a signal of PS for the ancestral core eudicot AP3 (AP3-H1) or PI (PI-H1) branches or in the branch leading to core eudicot TM6 lineage, but PS was detected with a strong signal in the branch leading to the euAP3 genes (AP3-H3) (see fig. 4A). The conflicting results of DSR and PS analyses in the euAP3 (DSR tests negative, PS positive) and TM6 (DSR tests positive, PS negative) branches could be an indication that there were DSRs within the TM6 branch early in the origin of this lineage but that these were not due to PS. In contrast, in the euAP3 lineage, substitution rates were overall homogeneous, probably because their functional diversification depended on a very small number of amino acid substitutions that were indeed fixed by PS.

Some of the protein residues inferred to have been fixed by PS in the euAP3 lineage were also found within the K domain, whereas others were found within the C-terminal domain (fig. 4). According to their correspondence to the APETALA3 protein sequence of A. thaliana, sites 148R (0.93 PP), 162C (0.94 PP), and 191K (0.95 PP) are within the K domain, whereas sites 305P (0.99 PP), 313S (0.98 PP), 316I (0.98 PP), 317T (0.97 PP), and 318F (0.97 PP) correspond to sites on the characteristic euAP3 motif in the C-terminal region of the protein. It is noteworthy that all sites detected to have been fixed by PS were corroborated in the analysis with both data sets, for the edited and the nonedited alignments.

Estimation of Divergence Dates
We used the Bayesian phylogenies generated to estimate divergence times of major duplication events in B MADS-box genes. For the AP3–PI duplication, we obtained an estimated date of 290 MYA, earlier than ~260 MYA (230–290 MYA) estimated in previous studies (Kim et al. 2004Go) and closer to the angiosperm–gymnosperm divergence that occurred shortly after the origin of the seed plants 290–309.2 MYA (Mapes and Rothwell 1984Go, 1991Go). The date of the euAP3–TM6 duplication was estimated at 92.3 MYA, which is relatively later than Fagales fossils dated at 96.2 MYA and thought to be coincident with the radiation of the core eudicots (Doyle and Hotton 1991Go; Magallón and Sanderson 2001Go).


    Discussion
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Conclusions
 Supplementary Material
 Acknowledgements
 References
 
Several studies have addressed the evolutionary patterns of floral developmental genes at population levels, showing evidence of distinct forces driving the evolution of different loci and including the signal of adaptive evolution in some such as LFY or TLF1 (Purugganan and Suddith 1999Go; Olsen et al. 2002Go; Moore et al. 2005Go). In our study, we used likelihood based statistical methods to explore the evolutionary patterns of floral developmental B MADS-box genes of flowering plants, which have been shown to play key roles in petal and stamen development (Bowman et al. 1991Go; Coen and Meyerowitz 1991Go; Meyerowitz et al. 1991Go). This approach allowed us to detect changes in substitution rates in particular gene and angiosperm lineages and to infer whether or not these rate shifts were due to positive (adaptive) selection at specific residues in the different protein domains. With our substitution rate estimates and statistical tests, we outline here an evolutionary scenario for the functional diversification of B genes in which adaptive selection at strategic protein residues in duplicated gene lineages played a major role. We finally propose a model for the maintenance and diversification of regulatory gene duplicates that is relevant for the explanation of the genetical basis of the appearance of novel morphological traits.

We focused on the 2 main duplications along the evolutionary history of B MADS-box genes, the AP3–PI and the euAP3–TM6 duplication. Phylogenetic analyses show that the main AP3–PI duplication occurred at the base of angiosperms and that the euAP3–TM6 duplication occurred at the base of core eudicots (Kramer et al. 1998Go, 2004Go). Both duplications occurred before the crown group diversification of each plant lineage, linked with the appearance of novel morphological structures conforming flowers in angiosperms and the establishment of the floral pattern characteristic of core eudicots. Experimental studies have demonstrated that the AP3 and PI lineages as well as the euAP3 and TM6 lineages exhibit functional divergence, each having relatively distinct biochemical roles in the floral developmental genetic program (Lamb and Irish 2003Go; Vandenbussche et al. 2004Go). Our estimates of duplication dates indicate that the AP3–PI duplication occurred 290 MYA, 30 million years earlier than previously thought (Kim et al. 2004Go), placing this duplication closer to the origin and divergence of angiosperms and gymnosperms 290–309.2 MYA (Mapes and Rothwell 1984, 1991). We also estimated that the euAP3–TM6 duplication occurred 92.3 MYA, perhaps during the time span of origin and amazing radiation of the core eudicot clade around 96 MYA (Brenner 1996Go). Taken together with our PS and DSR test results, these dates suggest the relevance of the main B gene duplications in the origins of the genetical mechanisms of the floral developmental program at the base of angiosperms and its establishment in core eudicots.

In our statistical tests, we found a strong signal of PS after the AP3–PI duplication, compelling evidence that positive adaptive selection was instrumental for the rapid functional diversification of B genes during angiosperm species radiation. Moreover, the sites inferred to have been fixed by PS in AP3 and PI lineages are found within the K domain (fig. 4B), that is crucial for the protein–protein interactions among MADS-domain proteins required for achieving their regulatory activities (Fan et al. 1997Go; Yang et al. 2003Go). Yang and Jack (2004)Go observed a strong correlation between alterations in the protein–protein interactions detected with yeast 2 hybrid experiments and the plant rescue or overexpression floral phenotypes of lines expressing different B proteins with mutations in the K domain, thus suggesting a correlation between protein–protein interaction strength in these regulatory proteins and the emergence of different phenotypes.

It has been postulated that the K domain folds into 3 amphipathic {alpha}-helices referred to as K1, K2, and K3 separated by interhelical regions, each conferring different dimerization specificities (Yang et al. 2003Go). Remarkably, one of the sites inferred under PS, 100N in the PI lineage, falls within the K1 region that has been shown specifically to control the strength of the PI/AP3 interaction (Yang an Jack 2004). In addition, site 142A, found to have been fixed by PS also on the PI lineage, is in the K2 region, whereas site 130E in the AP3 lineage is in the interhelical region between K1 and K2, which has been shown to be important in PI for both the PI/AP3 and the PI/SEP3 interaction (Yang and Jack 2004Go). Congruent to a remarkable degree with our results is the finding that a PI protein mutant in site 142A (A127P in Yang and Jack 2004Go) showed severe defects in the PI/SEP3 interaction (Yang and Jack 2004Go), and other mutants with severe defects in protein–protein interactions reported in Yang and Jack (2004)Go include mutations in sites reported here as fixed by PS. These data provide partial experimental support for the functional relevance of the sites identified in this study to have been fixed by PS. The changes detected here in the protein–protein interaction domains of B sequences could have been fixed by PS for different multimeric combinations and downstream specificities affecting flower development and its evolution during angiosperm diversification. Our results indicate the direction of further experimental work.

It has been suggested that the obligate heterodimerization between the AP3 and PI proteins observed in core eudicots evolved from homodimerization because a B-like protein from Gnetum gnemon was shown to be capable to bind DNA in a sequence-specific manner as a homodimer in vitro (Winter et al. 2002Go, see fig. 1). LMADS1, an AP3 protein from the monocot Lilium longiflorum was also observed to be capable to form homodimers as well as heterodimers with the PISTILLATA protein of Arabidopsis in vitro (Tzeng and Yang 2001Go; Tzeng et al. 2004Go). These studies suggested that this "facultative" condition could be an intermediate evolutionary state between homodimerization and heterodimerization, which could have appeared after the monocot divergence 140–200 MYA (Winter et al. 2002Go). In contrast, our data suggest that heterodimerization appeared shortly after the split between angiosperms and gymnosperms. At this point in evolution, which coincides with the AP3–PI duplication, strategic changes in residues within the protein–protein interaction domain appear to have been fixed by PS. In concordance with our suggestion, experimental studies of B genes from the monocot Zea mays showed that B function is conserved between maize and Arabidopsis (Whipple et al. 2004Go). In maize, SL1 (AP3) and ZMM16 (PI) form obligate heterodimers to bind DNA and also interact forming heterodimers with their respective orthologs in Arabidopsis (ZMM16/AP3, SL1/PI) to be functional both in vitro and in vivo (Whipple et al. 2004Go). It has also been reported the obligate functional heterodimerization in rice (Moon et al. 1999Go).

The hypothesis of the intermediate "facultative" state between functional homodimerization and obligate heterodimerization relies on the assumption that monocots are the intermediate lineage between gymnosperms and core eudicots. However, the phylogenetic position of monocots relative to eudicots and basal angiosperms is not yet resolved (APG II 2003Go). Monocots represent relatively derived lineages, and it is possible that the observed in vitro homodimerization of LMADS1 in Lilium is a reversion to a plesiomorphic state. Alternatively, the Lilium in vitro results may not hold in vivo, and this taxon also could have a conserved B function with respect to Arabidopsis. Therefore, additional in vivo functional assays of homo and heterodimer B proteins in monocots and other basal angiosperms as well as a molecular evolutionary analysis with more monocot sequences are needed.

Another important duplication occurred during the evolution of AP3 genes that coincides with the origin of core eudicots, the euAP3–TM6 duplication, that could have had an impact in the evolution of flower development in this speciose group of flowering plants (Lamb and Irish 2003Go; Vandenbussche et al. 2004Go). The floral structure in core eudicots is characterized by a fixed number of floral organs in each whorl and a perianth with a distinct calyx and corolla, suggesting a canalization of the floral developmental program to morphologically differentiated sepals and petals, in contrast to the more plastic and undifferentiated perianth structure found in noncore eudicot species (Drinnan et al. 1994Go; Magallón et al. 1999Go; Lamb and Irish 2003Go). Core eudicots share 2 paralogous B genes that differ in their C-terminal characteristic motifs, named euAP3 genes and TM6-like genes. The latter shares the ancestral paleoAP3 motif in their C-terminal domain, which is found in monocots, basal angiosperms, and basal eudicots, whereas the euAP3 lineage is characteristic of core eudicots (fig. 4A). The appearance of the euAP3 lineage has been postulated to be important in the evolution of the floral developmental program characteristic of this lineage (Lamb and Irish 2003Go).

We found strong evidence for PS after the euAP3–TM6 duplication (figs. 3A and 4A) only in the euAP3 lineage, in coincidence with available evidence for the functional divergence of these lineages (Lamb and Irish 2003Go; Vandenbussche et al. 2004Go). In our analysis, we detected sites 148R, 162C, and 191K to have been fixed by PS in the branch leading to the euAP3 clade. Site 148R is in the K1 subdomain, important for PI/AP3 dimerization (Yang and Jack 2004Go). Site 162C is in the interhelical region between K1 and K2. In PI, those regions are important for PI/AP3 and PI/SEP3 dimerization. Site 191K falls within the interhelical region between K2 and K3, which in PI protein of Arabidopsis has been experimentally shown to be specifically important for the PI/SEP3 (or SEP1) dimerization. Mutations in K2 and K3 are shown to be defective in forming AP3/PI/SEP1 ternary complexes (Yang and Jack 2004Go). In the euAP3 lineage, we also detected several sites under PS that correspond to sites on the characteristic euAP3 motif in the C-terminal region of the protein. It was observed that a frameshift mutation due to a single nucleotide deletion in the ancestral C-terminal paleoAP3 motif leads to the appearance of the euAP3 motif (Kramer et al. 2006Go), and it has been shown experimentally that these motifs are responsible for the functional diversification of euAP3 and paleoAP3 genes (Lamb and Irish 2003Go).

Our results reveal that in contrast to the AP3–PI duplication that fostered the possibility for acquisition of new functions by fixation of amino acid changes by PS in both duplicates in a symmetrical pattern, the euAP3–TM6 duplication was followed by an asymmetrical pattern of diversification, in which one gene copy retained the ancestral states (TM6 lineage), whereas in the other, new residues were fixed by adaptive evolution (euAP3 lineage). Whipple et al. (2004)Go experiments suggest that paleoAP3 sequences from Z. mays (SILKY1) function relatively well in Arabidopsis, which only has a euAP3 paralogue. However, future experiments should be performed with core eudicot species possessing both duplicates and additional complementation analyses and not only overexpression experiments must be done to better address functional conservation between paleoAP3 proteins from monocots and AP3 proteins from core eudicots.

It has been suggested that new sequences in the C-terminal domain evolving in euAP3 genes in core eudicots could have been associated with the evolution of differentiated petals (Kramer et al. 1998Go; Lamb and Irish 2003Go). Since their discovery, the importance of characteristic motifs in the C-terminal domain of B genes in the functional specificity of the proteins and in the evolution of B function has been emphasized. This has stimulated a search for similar motifs in other floral regulatory genes (Johansen et al. 2002Go; Kramer et al. 2004Go). However, in a recent study in Pisum sativum (Fabales, eurosids I), it was shown that there is a functional conservation of PISTILLATA activity in its PsPI homolog, that is an atypical PI-type polypeptide that lacks the highly conserved C-terminal PI motif (Berbel et al. 2005Go), suggesting the importance of other domains in the functional diversification of B function. Our study emphasizes the importance of the K domain in addition to the COOH region in the functional diversification of B MADS-box genes. The strong signal for PS detected for particular sites within both the K and C-terminal domains in euAP3 genes also suggests the importance of these genes in the evolution of the floral developmental mechanism in core eudicots, perhaps by strengthening the interaction of MADS-box proteins and altering the specificity of multimers.


    Conclusions
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Conclusions
 Supplementary Material
 Acknowledgements
 References
 
Our results strongly suggest that shortly after the duplication leading to the AP3 and PI genes, both lineages experienced a dramatic functional diversification driven by PS acting at specific sites of the K domain, whose main role is to promote heterodimerization with other MADS proteins to achieve different regulatory functions. In addition, we reported an estimated date for this duplication at 290 MYA, closer to the split of angiosperms and gymnosperms than the one reported previously. This suggests an early origin of heterodimerization capabilities of B MADS-box proteins in angiosperms, very close to the split from gymnosperms. Similarly, the euAP3–TM6 duplication in core eudicots seems to have been fundamental in the diversification process of these genes, also driven by PS acting at certain residues of the K and C-terminal motifs early after the formation of the euAP3 lineage.

Our study emphasizes the importance of the K domain in the evolutionary diversification of B genes. The importance of this domain in B gene function and evolution had been overshadowed by the discoveries of diagnostic C-terminal motifs and studies of their functional importance in B genes (Kramer et al. 1998Go, 2004Go; Lamb and Irish 2003Go). It has been speculated that the evolution of combinatorial multimer formation of MIKC-type MADS-domain proteins facilitated diversification of this gene family (Kaufmann et al. 2005Go). Our results support that the functional divergence of B MADS-box genes after duplications fostered different possible combinations for multimer formation that could have allowed the appearance of novel regulatory functions as the basis for the origin of morphological novelties.


    Supplementary Material
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Conclusions
 Supplementary Material
 Acknowledgements
 References
 
Supplementary material is available at Molecular Biology and Evolution online (http://www.mbe.oxfordjournals.org/).


    Acknowledgements
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Conclusions
 Supplementary Material
 Acknowledgements
 References
 
We thank Susana Magallón for her guidance and helpful discussions, essential during the final stages of this study. We also thank Mark Olson for useful comments that helped to improve this manuscript. Part of this work was done during an academic stance of T.H.-H. at Lorenzo Segovia Lab in the Biotechnology Institute, Universidad Nacional Autónoma de México (UNAM). We thank him for sponsoring that stance and providing computer time. This work was supported by grants to E.A.B. from Programa de Apoyo a Proyectos de Investigación e Innovación Tecnológica-UNAM (IN 230002 and -IN212995), UC-MEXUS-ECO IE 271, and Consejo Nacional de Ciencia y Tecnología (CONACYT)-41848-Q, 31871-N and is based on T.H.-H. Masters thesis, whose graduated studies were supported with a fellowship from CONACYT and Dirección General de Estudios de Posgrado at UNAM. We also acknowledge Rigoberto V Pérez-Ruiz and Aida Navarrete for support in various technical and logistical tasks, respectively.


    Footnotes
 
Neelima Sinha: Associate Editor


    References
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Conclusions
 Supplementary Material
 Acknowledgements
 References
 

    Alberch P. (1980) Ontogenesis and morphological diversification. Am Zool 20:653–667.

    Alvarez-Buylla ER, Liljergren SJ, Pelaz S, Gold SE, Burgeff C, Ditta GS, Vergara-Silva F, Yanofsky MF. (2000) MADS-box gene evolution beyond flowers: expression in pollen, endosperm, guard cells, roots and trichomes. Plant J 25:5593.[CrossRef]

    Alvarez-Buylla ER, Pelaz S, Liljergren SJ, Gold SE, Burgeff C, Ditta GS, Ribas DP, Martínez-Castilla L, Yanofsky MF. (2000) An ancestral MADS-box gene duplication occurred before the divergence of plants and animals. Proc Natl Acad Sci USA 97:105328–5333.[Abstract/Free Full Text]

    Anisimova M, Bielawski JP, Yang Z. (2001) Accuracy and power of the likelihood ratio test in detecting adaptive molecular evolution. Mol Biol Evol 18:81585–1592.[Abstract/Free Full Text]

    [APG II] Angiosperm Phylogeny Group II. (2003) An update of the angiosperm phylogeny group classification for the orders and families of flowering plants: APG II. Bot J Linn Soc 141:399–436.[CrossRef][ISI]

    Bailey TL and Elkan C. (1994) Fitting a mixture model by expectation maximization to discover motifs in biopolymers. Proceedings of the Second International Conference on Intelligent Systems for Molecular Biology(AAAI Press, Menlo Park (CA)) pp. 28–36.

    Bailey TL and Elkan C. (1995) The value of prior knowledge in discovering motifs with MEME. Proceedings of the Third International Conference on Intelligent Systems for Molecular Biology(AAAI Press, Menlo Park (CA)) pp. 21–29.

    Berbel A, Navarro C, Ferrandiz C, Calas LA, Beltran JP, Madueno F. (2005) Functional conservation of PISTILLATA activity in a pea homolog lacking the PI motif. Plant Physiol 139:1174–185.[Abstract/Free Full Text]

    Bielawski JP and Yang Z. (2004) A maximum likelihood method for detecting functional divergence at individual codon sites, with application to gene family evolution. J Mol Evol 59:121–132.[ISI][Medline]

    Bowman JL. (1997) Evolutionary conservation of angiosperm flower development at the molecular and genetic levels. J