Skip Navigation


MBE Advance Access originally published online on June 22, 2007
Molecular Biology and Evolution 2007 24(9):1982-1990; doi:10.1093/molbev/msm127
This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Supplementary Material
Right arrow All Versions of this Article:
24/9/1982    most recent
msm127v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Hoffmann, F. G.
Right arrow Articles by Storz, J. F.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Hoffmann, F. G.
Right arrow Articles by Storz, J. F.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2007. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oxfordjournals.org

Research Articles

The {alpha}D-Globin Gene Originated via Duplication of an Embryonic {alpha}-Like Globin Gene in the Ancestor of Tetrapod Vertebrates

Federico G. Hoffmann and Jay F. Storz

School of Biological Sciences, University of Nebraska, Lincoln

E-mail: jstorz2{at}unl.edu.


    Abstract
 TOP
 Abstract
 Introduction
 Material and Methods
 Results
 Discussion
 Supplementary Material
 Acknowledgements
 References
 
Gene duplication is thought to play an important role in the co-option of existing protein functions to new physiological pathways. The globin superfamily of genes provides an excellent example of the kind of physiological versatility that can be attained through the functional and regulatory divergence of duplicated genes that encode different subunit polypeptides of the tetrameric hemoglobin protein. In contrast to prevailing views about the evolutionary history of the {alpha}-globin gene family, here we present phylogenetic evidence that the {alpha}A- and {alpha}D-globin genes are not the product of a single, tandem duplication of an ancestral globin gene with adult function in the common ancestor of extant birds, reptiles, and mammals. Instead, our analysis reveals that the {alpha}D-globin gene of amniote vertebrates arose via duplication of an embryonic {alpha}-like globin gene that predated the radiation of tetrapods. The important evolutionary implication is that the distinct biochemical properties of {alpha}D-hemoglobin (HbD) are not exclusively derived characters that can be attributed to a post-duplication process of neofunctionalization. Rather, many of the distinct biochemical properties of HbD are retained ancestral characters that reflect the fact that the {alpha}D-globin gene arose via duplication of a gene that had a larval/embryonic function. These insights into the evolutionary origin of HbD illustrate how adaptive modifications of physiological pathways may result from the retention and opportunistic co-option of ancestral protein functions.

Key Words: hemoglobin • gene duplication • alpha-D globin • neofunctionalization • molecular phylogenetics • gene family evolution


    Introduction
 TOP
 Abstract
 Introduction
 Material and Methods
 Results
 Discussion
 Supplementary Material
 Acknowledgements
 References
 
Gene duplication can result in the acquisition of novel protein functions in cases where one duplicate copy retains the original function of the ancestral gene while the other copy accumulates new mutations that adapt the encoded protein to a new or modified physiological task (Ohno 1970Go; Kimura and Ohta 1974Go; Goodman et al. 1975Go; Ohta 1993Go, 1994Go; Hughes 1999Go). This functional specialization may entail changes in protein function as well as changes in the tissue specificity or developmental timing of gene expression. The globin superfamily of genes provides an excellent example of the kind of physiological versatility that can be attained through the functional and regulatory divergence of duplicated genes that encode different subunit polypeptides of the tetrametic hemoglobin protein (Goodman et al. 1975Go; Hardison 2001Go). In amniote vertebrates, hemoglobin has been optimized for oxygen transport under the vastly different physiological conditions encountered during the embryonic, fetal, and adult stages of development. For example, in eutherian mammals, regulatory switching between paralogous genes that encode functionally distinct subunit polypeptides can produce adult hemoglobins that are optimized for pulmonary/tissue oxygen transport, fetal hemoglobins that are optimized for placental/tissue oxygen transport, and embryonic hemoglobins that are optimized for diffusional oxygen scavenging from the amniotic fluid as well as oxygen transport during the transition from a vitelline to placental circulation (Hardison 1998Go, 2001Go; Nagel and Steinberg 2001Go; Brittain 2002Go).

In gnathostome vertebrates, the hemoglobin protein is a heterotetramer composed of two {alpha}-chain and two ß-chain subunits. In amniote vertebrates, each of the different subunit polypeptides is encoded by different sets of duplicated genes that are located on different chromosomes. In amphibians and ray-finned fishes, the {alpha}- and ß-like globin genes are closely linked on the same chromosome, reflecting the fact that the progenitors of the {alpha}- and ß-globin gene families arose via tandem duplication of an ancestral globin gene approximately 450-500 mya, prior to the radiation of amniote vertebrates (Goodman et al. 1975Go; Czelusniak et al. 1982Go; Goodman et al. 1987Go).

Members of the {alpha}- and ß-globin gene families have diversified in both biochemical properties and developmental timing of expression. The genes in each cluster are generally organized in a 5'-3' sequence that is co-linear with the temporal sequence of expression during development. The archetypal arrangement of the tetrapod {alpha}-globin gene family consists of two functional genes, each of which may be present in one or more copies: an {alpha}-like globin gene expressed early in embryonic development, {alpha}E-globin (known as {alpha}L-globin in amphibians, {zeta}-globin in mammals, and {pi}-globin in birds), and the adult {alpha}A-globin gene. Birds, mammals, and reptiles possess an additional member of the family, the adult {alpha}D-globin gene, whereas the {theta}-globin gene is found only in mammals. In nearly all mammals studied to date, the {alpha}-chains of adult hemoglobin are encoded by duplicate copies of the {alpha}A-globin gene, which are almost always identical in sequence and therefore encode identical polypeptides (Zimmer et al. 1980Go; Higgs et al. 1989Go). The {alpha}D-globin gene was only recently identified in mammals (Goh et al. 2005Go; Hughes et al. 2005Go; Cooper et al. 2006Go), and although it is transcriptionally active, the product of this gene is not known to be assembled into functional hemoglobin tetramers. In birds and reptiles, by contrast, the {alpha}-chains of adult hemoglobin are encoded by both the {alpha}D- and {alpha}A-globin genes. In most birds studied to date, {alpha}D-containing hemoglobin (HbD) constitutes the minor fraction of adult hemoglobin and {alpha}A-containing hemoglobin (HbA) constitutes the major fraction (Borgese and Bertles 1965Go; Brown and Ingram 1974Go; Hiebl et al. 1987Go). Thus, unlike the case in most mammals, the mature erythrocytes of adult birds and reptiles may contain a mixture of functionally distinct hemoglobin isoforms that have different biochemical properties. Specifically, HbD generally has a higher oxygen affinity and a higher cooperativity of oxygen binding than HbA (Cirotto and Geraci 1975Go; Baumann et al. 1984Go; Riggs 1998Go; Knapp et al. 1999Go). Thus, regulatory adjustments that alter the stoichiometric ratio of these two isoforms in circulating erythrocytes may modulate rates of oxygen flux in response to changes in metabolic demand (Hiebl et al. 1987Go; Hiebl et al. 1988Go).

It has traditionally been thought that the {alpha}D- and {alpha}A-globin genes arose via tandem duplication of an ancestral proto {alpha}A-globin gene with adult function in the common ancestor of birds and mammals (Czelusniak et al. 1982Go; Hardison 1998Go, 2001Go), and recent phylogenetic analyses lend support to this idea (Cooper et al. 2006Go). According to this scenario, biochemical properties of {alpha}D-globin that distinguish it from {alpha}A-globin may be derived characteristics that evolved under the influence of directional selection that favored a physiological division of labor between the two co-expressed gene duplicates. This scenario has intriguing evolutionary implications because functional studies have demonstrated striking similarities between HbD and embryonic hemoglobin (HbE) with respect to oxygen-binding affinity, cooperativity of oxygen binding, solubility, and responsiveness to chloride ions and other allosteric effectors (Cirotto and Geraci 1975Go; Baumann et al. 1984Go; Chapman et al. 1980Go; Chapman et al. 1982Go; Grigg et al. 1993Go; Knapp et al. 1999Go). If the {alpha}D- and {alpha}A-globins arose via tandem duplication of an ancestral proto {alpha}A-globin gene with adult function, then it would appear that {alpha}D-globin has convergently evolved a suite of biochemical properties that are characteristic of embryonic globins.

In contrast to prevailing views about the evolutionary history of the {alpha}-globin gene family, here we present evidence that the {alpha}D- and {alpha}A-globins are not the product of a single, tandem duplication of an ancestral globin gene with adult function in a common ancestor to extant birds, reptiles, and mammals. Instead, results of our phylogenetic analyses reveal that the {alpha}D-globin of amniote vertebrates arose via duplication of a proto {alpha}E-globin gene that predated the radiation of tetrapods. The important evolutionary implication is that the distinct biochemical properties of HbD are not exclusively derived characters that evolved during a post-duplication process of neofunctionalization. Rather, many of the distinct biochemical properties of HbD are retained ancestral characters that reflect the fact that the {alpha}D-globin gene arose via duplication of a gene that had a larval/embryonic function in the ancestor of all tetrapods.


    Material and Methods
 TOP
 Abstract
 Introduction
 Material and Methods
 Results
 Discussion
 Supplementary Material
 Acknowledgements
 References
 
DNA Sequence Data
We obtained DNA sequences for structural genes in the {alpha}-globin gene family from either Genbank or Ensembl (release 37, February 2006). When possible, we focused on sequences from a single genomic contig, genomic scaffold, or full chromosome, depending on the nature of the available data. We also included sequences from shorter records based on genomic DNA or cDNA in order to attain a broad and balanced taxonomic coverage of the tetrapod phylogeny. This provision allowed us to include sequences from amphibians (Xenopus and Pleurodeles), reptiles (Geochelone and Hydrophis), birds (Cairina), as well as some additional mammalian taxa. The basic annotation was derived from the database records in most cases, but we also identified globin genes in unannotated sequences by comparing known exon sequences to genomic contigs using the program BLAST 2 sequences (Version 2.2; Tatusova and Madden 1999), available from the NCBI website (http://www.ncbi.nlm.nih.gov/blast/bl2seq).

We restricted the phylogenetic analysis to functional copies of the four {alpha}-like globin genes and we excluded redundant sequences. We obtained sequences from functional members of the {alpha}-globin gene family in representatives from all tetrapod classes including amphibians, reptiles, birds, and mammals. The {alpha}-globin sequences from zebra fish (Danio rerio) and puffer fish (Sphoeroides nephelus) were used as outgroups. Alignment was based on the amino acid translation, and was carried out in ClustalW (Thompson et al. 1994Go) as implemented in BioEdit (Hall 1999Go). Having confirmed orthologous relationships among the larval/embryonic {alpha}-like globins among all vertebrate taxa (see Results), we use the name {alpha}E-globin to refer to the {alpha}L-globin of amphibians, the {pi}-globin of birds, and the {zeta}-globin of mammals.

Amino Acid Sequence Data
Due to the limited amount of DNA sequence data available for the {alpha}-like globin genes of reptiles, we also assembled a protein sequence data set to corroborate observations derived from the DNA data set. In the protein data set, we increased our representation of {alpha}-like globin genes from amphibians and reptiles, and we included translated amino acid sequences from a subset of the avian and mammalian taxa that were included in the DNA analyses. Specifically, we included translated sequences of {alpha}E-, {alpha}D-, and {alpha}A-globin from two representatives of the class Aves (duck and chicken) and translated sequences of {alpha}E-, {alpha}D-, {alpha}A-, and {theta}-globin from four representatives of the class Mammalia (an Australian marsupial, a New world marsupial, cow, and human). As with the DNA analysis, we included translated {alpha}-globin sequences from zebra fish and puffer fish as outgroups.

Phylogenetic Inference
We estimated phylogenetic relationships among the different {alpha}-like globin DNA sequences in our data set using neighbor-joining (NJ), maximum parsimony (MP), maximum likelihood (ML), and Bayesian approaches. Neighbor-joining and MP searches were conducted in PAUP version 4.b10 (Swofford 2002Go), ML analyses were implemented in Treefinder version May 2006 (Jobb et al. 2004Go), and Bayesian analyses were implemented in MrBayes version 3.1.2 (Ronquist and Huelsenbeck 2003Go). For the the NJ analyses, we selected the best-fit model of nucleotide substitution by using hierarchical Likelihood Ratio Tests as implemented in the program ModelTest version 3.7 (Posada and Crandall 1998Go). For the the MP analyses, all substitutions were equally weighted. In ML and Bayesian analyses, we allowed searches to jointly optimize branch lengths and parameter values for the best-fit model of nucleotide substitution. We then compared tree searches with and without site-specific rates for each of the three codon positions using an independent General-Time-Reversible model (GTR; Rodriguez et al. 1990Go) in which rate variation followed a discrete gamma distribution for each position. Support for nodes in NJ, MP, and ML analyses was evaluated using the bootstrap procedure. We employed 1000 pseudoreplicates in NJ and MP analyses, and 100 in ML analyses using the same parameter settings that were used in the phylogenetic searches. Bayesian analyses were run for 5 x 106 iterations of a Markov Chain Monte Carlo algorithm, with samples taken every 1000 iterations. Support for the nodes and parameter estimates were derived from a majority rule consensus of the last 2,500 trees sampled after convergence. In the case of the protein sequence data, we used ProtTest version 1.3 (Abascal et al. 2005Go) to find the best-fitting model of amino acid substitution, and set ML searches in Treefinder to jointly estimate the phylogeny and parameter values for the selected model.

Hypothesis Testing
To compare alternative tree topologies, we used the Shimodaira-Hasegawa topology test (Shimodaira and Hasegawa 1999Go), and the approximately unbiased test suggested by Shimodaira (2002Go). We computed site-specific likelihood scores for each site in the baseml program (Yang 1997Go), and used the CONSEL program (Shimodaira and Hasegawa 2001Go) to test whether differences among alternative topologies were statistically significant. Given that the hypotheses evaluated were mutually exclusive, we performed a SOWH test (Swofford et al. 1996Go), as suggested by Goldman et al. (2000)Go. This test uses a parametric bootstrapping approach to test for differences in topology. First we constrained our searches to find the ML tree that was congruent with the null hypothesis and its associated model of nucleotide substitution, partitioning data into the three different codon positions. Next, we simulated 1,000 data sets based on the null hypothesis ML phylogram, and its corresponding model of nucleotide substitution. For each of the 1,000 simulated data sets we calculated the difference in likelihood score, {Delta}, between the null hypothesis ML topology and the alternative hypothesis ML topology. Using an {alpha}-level of 0.01, the null hypothesis ML topology is rejected if ≥ 99% of the simulation-based {Delta}-values exceed the observed value.


    Results
 TOP
 Abstract
 Introduction
 Material and Methods
 Results
 Discussion
 Supplementary Material
 Acknowledgements
 References
 
Sequence Data
We collected 120 functional {alpha}-like globin sequences from a diverse array of tetrapod taxa, including amphibians, reptiles, birds, and mammals, in addition to two fish (zebra fish HBAA1 and puffer fish HBA2). The final data set consisted of a 429 bp alignment excluding stop codons (Supplementary Material, Table 1).


View this table:
[in this window]
[in a new window]

 
Table 1 Results from the Parametric Bootstrap Test. We Simulated 1000 Sequence Data Sets Using the Tree and GTR + {Gamma} Model of Nucleotide Substitution that Corresponded to the Null Hypothesis. The Difference in Likelihood Score Between the Null Hypothesis ML Topology and the Alternative Hypothesis ML Topology is given as {Delta} = (–lnL H0) – (–lnL Halt)

 
Phylogenetic Inference
On the basis of previous work, the following phylogenetic results are predicted: (i) {alpha}E-globins from mammals and birds, {alpha}D-globin, and {theta}-globin should each form a monophyletic clade; (ii) {alpha}A-globin should be paraphyletic relative to {theta}-globin; (iii) mammalian {alpha}A-globin should be sister to mammalian {theta}-globin; and (iv) relationships within each set of orthologous sequences should match the expected phylogenetic relationships among species. The goals of our phylogenetic analysis were to resolve relationships among the four {alpha}-like globin paralogs and to assess the robustness of the inferred tree in comparison with trees predicted by alternative hypotheses regarding the evolution of the {alpha}-globin gene family. Neighbor-joining, MP, ML, and Bayesian phylogenetic analyses of our DNA alignment produced highly concordant results. Here, we present results from the ML analysis, with support from ML bootstrap (MLbs) and Bayesian posterior probabilities (BApp). Results of the NJ and MP analyses are available in the supplementary material (Supplementary Material, Figs 1 and 2). In all cases the sequences were grouped into three main lineages: fish {alpha}-globins (which were sister to tetrapod {alpha}-globins), tetrapod {alpha}E- and {alpha}D-globins, and tetrapod {alpha}A- and {theta}-globins (Fig. 1). The ML analysis of the protein data set was consistent with results based on the DNA sequences (Supplementary Material, Fig. 3).


Figure 1
View larger version (22K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
FIG. 1.— Maximum likelihood phylogram describing relationships among the {alpha}-like globin genes of vertebrates, with {alpha}E-globin sequences (including amphibian {alpha}L-globin, avian {pi}-globin, and mammalian {zeta}-globin) shown in blue, {alpha}D-globin sequences shown in green, {alpha}A-globin sequences shown in black, and {theta}-globin sequences shown in red. Two fish {alpha}-globin sequences were used as outgroups. Searches were conducted in Treefinder version May 2006 (Jobb, von Haeseler, and Strimmer 2004Go) using an independent site-specific GTR+{Gamma} model of nucleotide substitution for each codon position. Numbers above the nodes correspond to maximum likelihood bootstrap support values, and those below the nodes correspond to Bayesian posterior probabilities.

 

Figure 2
View larger version (40K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
FIG. 2.— Schematic representations of alternative hypotheses regarding phylogenetic relationships among the {alpha}-like globin genes of amniote vertebrates. In A, {alpha}D-globin arose via duplication of an ancestral embryonic {alpha}E-globin that predated the radiation of tetrapods. In B, the {alpha}D-globin arose via duplication of an ancestral adult {alpha}A-globin that predated the radiation of tetrapods. Our results support hypothesis A. In C, {alpha}D-globin arose via duplication of an ancestral embryonic {alpha}E-globin that occurred after the divergence of amphibians from the common ancestor of amniote vertebrates, whereas in D, {alpha}D-globin arose via duplication of an ancestral adult {alpha}A-globin that occurred after the divergence of amphibians from the common ancestor of amniote vertebrates.

 

Figure 3
View larger version (28K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
FIG. 3.— Two alternative hypotheses regarding the evolutionary history of the {alpha}-globin gene family. Panel A summarizes the prevailing view, in which the {alpha}E-globin and {alpha}A-globin genes arose via tandem duplication of the {alpha}-globin pro-ortholog in the ancestor of all extant tetrapods. This was then followed by a second duplication of the adult {alpha}A-globin gene which gave rise to the {alpha}D-globin gene in the common ancestor of all amniotes. Panel B shows the evolutionary scenario favored by our phylogenetic analyses. Under this scenario, the {alpha}D-globin gene arose via duplication of the {alpha}E-globin pro-ortholog in the common ancestor to all tetrapods.

 
We found strong support for monophyly of the clade containing all {alpha}E-globin sequences (MLbs = 78%, BApp = 98%). Surprisingly, we also found support for a sister-group relationship between a clade that included all embryonic {alpha}E-globin sequences (amphibian {alpha}L-globin, avian {pi}-globin, and mammalian {zeta}-globin), and a clade that included all {alpha}D-globin sequences in addition to an {alpha}A-globin gene of the slender sea-snake (MLbs = 59%, and BApp = 99%). Bootstrap support for the monophyly of the clade that grouped all {alpha}D-globin genes with the slender sea-snake {alpha}A-globin was over 70% in ML, and the posterior probability was just below the 95% threshold set in the Bayesian analysis. Although the putative {alpha}D-globin gene of the slender sea-snake grouped with other {alpha}D-globin sequences in the phylogeny reconstruction, this sequence does not share a second-codon deletion that is shared by all other {alpha}D-globin genes. Instead, the second codon of the sea-snake sequence codes for valine, as is the case for all amniote {alpha}A-globin genes in our database. This suggests a history of gene conversion where the 5' end of the sea-snake {alpha}D-globin coding sequence has been converted by the {alpha}A-globin paralog. Further insights into the nature and prevalence of interparalog gene conversion between the {alpha}A- and {alpha}D-globin genes should be possible when genomic sequence data become available for snakes and other squamate reptiles.

We also found strong support for a clade that included amphibian, reptilian, and avian {alpha}A-globins, as well as the mammalian {alpha}A- and {theta}-globins (MLbs = 76%, BApp = 98%). As expected, the {alpha}A-globin clade was found to be paraphyletic relative to mammalian {theta}-globin. The monophyly of the mammalian {theta}-globins, in turn, was strongly supported (MLbs = 80%, BApp = 99%).

Hypotheses Testing
The topology of the ML tree in Fig. 1 met a subset of our initial expectations regarding phylogenetic relationships within and among the different paralogs, although there were several minor discrepancies between the observed and expected relationships among mammalian taxa within each set of orthologous sequences. Eutherian and marsupial sequences were placed sister to each other in the case of {alpha}E-globin, {alpha}D-globin, and {theta}-globin. Adult {alpha}A-globins, however, departed from this pattern. Here, the deepest nodes corresponded to the split of the hedgehog {alpha}A-globins from a clade that contained the remaining therian {alpha}A-globins and therian {theta}-globins. Marsupial {alpha}A-globins were sister to the two tenrec paralogs, and this clade in turn was sister to the {theta}-globin clade. Statistically significant support for the sister relationship between therian {theta}-globins and the clade containing marsupial and tenrec {alpha}A-globins was only found in Bayesian analyses (BApp = 99%, MLbs < 50%). The presence of {theta}-globin sequences in both marsupial and eutherian mammals indicates that a copy of this gene was present in the therian lineage prior to the divergence of marsupials from eutherians. The lack of support for a sister relationship between the therian {alpha}A- and {theta}-globin sequences is most likely attributable to lack of power and/or long branch attraction between marsupial {alpha}A-globins and therian {theta}-globins.

The main discrepancy between our inferred tree and the tree predicted by the conventional view of {alpha}-globin gene family evolution concerns the placement of the {alpha}D-globin lineage. Our inferred tree indicates that {alpha}D-globin is more closely related to embryonic {alpha}E-globin than to adult {alpha}A-globin, and that the duplication that gave rise {alpha}D-globin preceded the radiation of extant tetrapods.

Given that our results contradict prevailing views of {alpha}-globin gene family evolution, we used constrained searches to explore differences in likelihood scores among the alternative hypotheses (Fig. 2). Specifically, we explored alternative placements of the {alpha}D-globin sequences that are predicted under competing evolutionary hypotheses, and compared differences among the alternative tree topologies in a statistical framework. The conventional view is that the duplication that gave rise to {alpha}D-globin occurred after the divergence of amphibians from the common ancestor of all amniote vertebrates (Fig. 3A). By contrast, our results suggest that this duplication took place in the common ancestor of all extant tetrapods (Fig. 3B). Accordingly, we carried out ML phylogenetic reconstructions in which {alpha}D-globin divergence occurred before the split of either the {alpha}E- or {alpha}A globins of amphibians (Fig. 2A and 2B), and compared these inferred trees to constrained trees in which amphibian {alpha}E- or {alpha}A globins arose prior to the divergence of {alpha}D-globin (Fig. 2C and 2D). Differences in likelihood scores associated with each of the four alternative hypotheses were not statistically significant in the SH and AU tests. However, the parametric-bootstrapping SOWH test was able to distinguish the best ML tree (Fig. 2A) from each of the three alternative constrained trees with a high level of statistical significance (P < 0.005; Table 1).


    Discussion
 TOP
 Abstract
 Introduction
 Material and Methods
 Results
 Discussion
 Supplementary Material
 Acknowledgements
 References
 
We used a large number of vertebrate genome sequences and recently developed phylogenetic methods to reevaluate evolutionary relationships among members of the {alpha}-globin gene family in vertebrates. Results of our phylogenetic analyses indicate that {alpha}D-globin is more closely related to embryonic {alpha}E-globin than to adult {alpha}A-globin, and that the tandem duplication that gave rise to {alpha}D-globin preceded the radiation of extant tetrapods. Our conclusion that the {alpha}D-globin gene arose via duplication of a proto {alpha}E-globin gene has important evolutionary implications for understanding the role of hemoglobin isoform differentiation in the oxygen-transport systems of birds and other archosaurs.

The prevailing view of {alpha}-globin gene family evolution has been that the {alpha}E- and {alpha}A-globin genes arose via tandem duplication of the {alpha}-globin pro-ortholog in the ancestor of all extant tetrapods (Czelusniak et al. 1982Go; Hardison 1991Go, 2001Go; Aguileta et al. 2006Go; Cooper et al. 2006Go; Fig. 2A). This was then followed by a second duplication of the adult {alpha}A-globin gene, which gave rise to {alpha}D-globin. This second duplication was initially thought to be restricted to modern reptiles and birds, but the recent discovery of a transcriptionally active copy of {alpha}D-globin in the {alpha}-globin gene cluster of mammals (Goh et al. 2005Go; Hughes et al. 2005Go; Cooper et al. 2006Go) suggests that the duplication must have occurred in the common ancestor of amniote vertebrates. Several previous studies have presented phylogenetic reconstructions consistent with a sister relationship between the {alpha}D- and {alpha}E-globins (Goodman et al. 1982Go; Fushitani et al. 1996Go; Gorr et al. 1998Go; Wheeler et al. 2004Go), but the complete history of gene duplication and divergence was never reconstructed nor were the evolutionary implications explored.

If the duplication leading to {alpha}D-globin had occurred after the lineage leading to amniotes diverged from the stem lineage of amphibians (the conventional view), we would expect {alpha}D-globin sequences to be basal to either the amniote {alpha}E- or amniote {alpha}A-globin clade, and to branch after the divergence of the amphibian {alpha}E- or {alpha}A-globins (Fig. 2A). However, monophyly of the group that includes the {alpha}E-globins to the exclusion of {alpha}D-globins is strongly supported in both ML and Bayesian analyses, as is the monophyly of the clade that includes the {alpha}A-globins of all amniotes and the {theta}-globins of mammals to the exclusion of {alpha}D-globins. We therefore conclude that the gene duplication that gave rise to {alpha}D-globin predated the deepest split within either the {alpha}E- or {alpha}A- globin clades, as illustrated in Fig 3B. This would imply that an {alpha}D-globin paralog was present in the ancestor of all tetrapods, regardless of whether {alpha}D-globin is allied to embryonic {alpha}E- or adult {alpha}A-globins. To date, there is no evidence for the presence of an {alpha}D-globin gene in amphibians, which suggests that it may have been secondarily lost.

Our evolutionary hypothesis that {alpha}D-globin is the product of a duplication of the {alpha}E-globin pro-ortholog is supported by analyses of both DNA and amino acid sequences, which indicate a sister relationship between the {alpha}D- and {alpha}E-globins (Fig. 3B). Support for this relationship is strong in Bayesian analyses (BApp = 99%), and moderate in the ML analysis (MLbs = 59%). The sister relationship of {alpha}D- and {alpha}E-globins is also strongly supported by the parametric-bootstrapping SOWH test. This indicates that the {alpha}D-globin gene, which is expressed in adult birds and reptiles, shares a common ancestor with {alpha}E-globin, which is only expressed during the earliest stages of embryonic development. Our analyses also provide strong support for monophyly of the clade that includes all {alpha}E-globins (amphibian {alpha}L-globins, mammalian {zeta}-globins, and avian {pi}-globins) to the exclusion of all other {alpha}-like globin sequences. This suggests an orthologous relationship among all {alpha}-like globin genes that are expressed during the earliest stages embryonic development, and is consistent with previous studies that confirmed an orthologous relationship between avian {pi}-globins and mammalian {zeta}-globins (Czelusniak et al. 1982Go; Proudfoot et al. 1982Go).

In the common ancestor of amniote vertebrates, it may be that the ancestral {alpha}D-globin was expressed early in development, and was later recruited for expression in the mature erythrocytes of adult birds and reptiles, although the ancestral mode of embryonic expression has been retained in some avian species (Godovac-Zimmermann and Braunitzer 1984Go). In most birds, the {alpha}D-globin gene is expressed at lower levels than {alpha}A-globin, but the relative expression levels of the two genes vary widely among different species (Borgese and Bertles 1965Go; Godovac-Zimmermann and Braunitzer 1984Go, 1985Go; Hiebl et al. 1986Go; Hiebl et al. 1987Go; Hiebl et al. 1989Go; Nothum, Braunitzer et al. 1989; Nothum, Weber et al. 1989; Ikehara et al. 1997Go).

The functional differentiation of HbA and HbD isoforms appears to have played an important role in the evolution of hypoxia tolerance in a number of birds, including high-soaring vultures (Hiebl et al. 1988Go; Weber et al. 1988Go; Hiebl et al. 1989Go) and migratory ducks and geese (Hiebl et al. 1986Go; Hiebl et al. 1987Go). The graded oxygen affinities of {alpha}A- and {alpha}D- containing hemoglobin isoforms provides a ‘cascade’ mechanism for fine-tuning hemoglobin-oxygen affinity in response to variation in ambient oxygen tension. Under conditions of high-altitude hypoxia, high-affinity HbD isoforms ensure efficient pulmonary oxygen loading in the parabranchial lung whereas low-affinity HbA isoforms ensure the efficient unloading of oxygen to the cells of aerobically metabolizing tissues (Hiebl et al. 1988Go; Weber et al. 1988Go).

One of the most striking examples of the role of hemoglobin isoform differentiation in high-altitude respiration involves Rüppell's griffon (Gyps rueppelli), an African vulture that nests at sea-level and is known to fly at altitudes of over 11,000 m. As a result of tandem duplication and divergence of the {alpha}A- and {alpha}D-globin genes, the red blood cells of these birds contain a heterogeneous mixture of four functionally distinct hemoglobin isoforms with the following rank-order of oxygen-binding affinities (low to high): HbA, HbA', HbD, and HbD'. The graded oxygen affinities of these four hemoglobin isoforms permits a cascade mechanism of pulmonary/tissue oxygen transport, and appears to provide an important regulatory reserve of oxygen transport capacity (Hiebl et al. 1988Go; Weber, Hiebl, and Braunitzer 1988Go). The fact that the {alpha}D-globin gene arose via duplication of a precursor gene with an embryonic/larval function suggests that the encoded protein was ‘pre-adapted’ to the task of pulmonary oxygen loading at low partial pressures of oxygen. The protein was pre-adapted to this new task in the sense that the requisite biochemical properties were present in the ancestral condition, and could therefore be co-opted for a modified function when the possession of a high-affinity adult hemoglobin became advantageous. Given what we know about the role of HbD in the evolution of hypoxia tolerance in birds, and possibly in other archosaurs (Shishikura 2002Go; Stoeckelhuber, Gorr et al. 2002Go), our insights into the evolutionary origin of the {alpha}D-globin gene illustrate how adaptive modifications of physiological pathways may result from the retention and opportunistic co-option of ancestral protein functions.


    Supplementary Material
 TOP
 Abstract
 Introduction
 Material and Methods
 Results
 Discussion
 Supplementary Material
 Acknowledgements
 References
 
Supplementary Material is available at Molecular Biology and Evolution online (http://www.mbe.oxfordjournals.org/).


    Acknowledgements
 TOP
 Abstract
 Introduction
 Material and Methods
 Results
 Discussion
 Supplementary Material
 Acknowledgements
 References
 
We thank G. Ortí, E. Gering, A. Runck, J. C. Opazo, E. P. Lessa, W. Jeffery, and two anonymous reviewers for helpful comments and suggestions. This work was funded by a Postdoctoral Fellowship in Population Biology to FGH from the University of Nebraska, an NSF grant to JFS (DEB-0614342), a Layman Award to JFS, and an Interdisciplinary Research Grant to JFS from the Nebraska Research Council.


    Footnotes
 
William Jeffery, Associate Editor


    References
 TOP
 Abstract
 Introduction
 Material and Methods
 Results
 Discussion
 Supplementary Material
 Acknowledgements
 References
 

    Abascal F, Zardoya R, Posada D. Prottest: selection of best-fit models of protein evolution. Bioinformatics (2005) 21:2104–2105.[Abstract/Free Full Text]

    Aguileta G, Bielawski JP, Yang Z. Proposed standard nomenclature for the alpha- and beta-globin gene families. Genes Genet Syst (2006) 81:367–371.[CrossRef][Web of Science][Medline]

    Baumann R, Goldbach E, Haller EA, Wright PG. Organic phosphates increase the solubility of avian haemoglobin D and embryonic chicken haemoglobin. Biochem J (1984) 217:767–771.[Web of Science][Medline]

    Borgese TA, Bertles JF. Hemoglobin heterogeneity: embryonic hemoglobin in the duckling and its disappearance in the adult. Science (1965) 148:509–511.[Abstract/Free Full Text]

    Brittain T. Molecular aspects of embryonic hemoglobin function. Mol Aspects Med (2002) 23:293–342.[CrossRef][Medline]

    Brown JL, Ingram VM. Structural studies on chick embryonic hemoglobins. J Biol Chem. (1974) 249:3960–3972.[Abstract/Free Full Text]

    Chapman BS, Hood LE, Tobin AJ. Minor early embryonic chick hemoglobin M. Amino acid sequences of the epsilon and alpha D chains. J Biol Chem. (1982) 257:651–658.[Abstract/Free Full Text]

    Chapman BS, Tobin AJ, Hood LE. Complete amino acid sequences of the major early embryonic alpha-like globins of the chicken. J Biol Chem. (1980) 255:9051–9059.[Abstract/Free Full Text]

    Cirotto C, Geraci G. Embryonic chicken hemoglobins. studies on the oxygen equilibrium of two pure components. Comp Biochem Physiol A (1975) 51:159–163.

    Cooper SJB, Wheeler D, De Leo A, Cheng J, Holland RAB, Marshall Graves JA, Hope RM. The mammalian alphaD-globin gene lineage and a new model for the molecular evolution of alpha-globin gene clusters at the stem of the mammalian radiation. Mol Phylogenet Evol. (2006) 38:439–448.[CrossRef][Web of Science][Medline]

    Czelusniak J, Goodman M, Hewett-Emmett D, Weiss ML, Venta PJ, Tashian RE. Phylogenetic origins and adaptive evolution of avian and mammalian haemoglobin genes. Nature (1982) 298:297–300.[CrossRef]

    Fushitani K, Higashiyama K, Moriyama EN, Imai K, Hosokawa K. The amino acid sequences of two alpha chains of hemoglobins from Komodo dragon Varanus komodoensis and phylogenetic relationships of amniotes. Mol Biol Evol. (1996) 13:1039–1043.[Abstract]

    Godovac-Zimmermann J, Braunitzer G. Hemoglobin of the adult white stork (Ciconia ciconia, Ciconiiformes). the primary structure of alpha a- and beta-chains from the only present hemoglobin component. Hoppe Seylers Z Physiol Chem. (1984) 365:1107–1113.[Web of Science][Medline]

    Godovac-Zimmermann J, Braunitzer G. The primary structure of alpha a- and beta-chains from Blue-and-yellow macaw (Ara ararauna, Psittaci) hemoglobin. No evidence for expression of alpha d-chains. Biol Chem Hoppe Seyler (1985) 366:503–508.[Web of Science][Medline]

    Goh SH, Lee YT, Bhanu NV, Cam MC, Desper R, Martin BM, Moharram R, Gherman RB, Miller JL. A newly discovered human alpha-globin gene. Blood (2005) 106:1466–1472.[Abstract/Free Full Text]

    Goldman N, Anderson JP, Rodrigo AG. Likelihood-based tests of topologies in phylogenetics. Syst Biol. (2000) 49:652–670.[CrossRef][Web of Science][Medline]

    Goodman M, Miyamoto MM, Czelusniak J. Pattern and process in vertebrate phylogeny revealed by coevolution of molecules and phylogenies. In: Molecules and morphology in evolution: conflict or compromise?—Patterson C, ed. (1987) New York: Cambridge University Press. 140–176.

    Goodman M, Moore GW, Matsuda G. Darwinian evolution in the genealogy of haemoglobin. Nature (1975) 253:603–608.[CrossRef][Medline]

    Goodman M, Weiss ML, Czelusniak J. Molecular evolution above the species level: branching pattern rates and mechanisms. Syst Zool (1982) 31:376–399.

    Gorr TA, Mable BK, Kleinschmidt T. Phylogenetic analysis of reptilian hemoglobins: trees rates and divergences. J Mol Evol. (1998) 47:471–485.[CrossRef][Web of Science][Medline]

    Grigg GC, Wells RMG, Beard LA. Allosteric control of oxygen binding by haemoglobin during embryonic development in the crocodile Crocodylus porosus: the role of red cell organic phosphates and carbon dioxide. J Exp Biol. (1993) 175:15–32.[Abstract]

    Hall TA. Bioedit: a user-friendly biological sequence alignment editor and analysis program for windows 95/98/NT. Nucl Acids Symp Ser (1999) 41:95–98.

    Hardison R. Evolution of globin gene families. In: Evolution at the molecular level—Selander RK, Clark AG, Whittam TS, eds. (1991) Sunderland, MA: Sinauer. 272–289.

    Hardison R. Hemoglobins from bacteria to man: evolution of different patterns of gene expression. J Exp Biol. (1998) 201:1099–1117.[Abstract]

    Hardison R. Organization, evolution, and regulation of the globin genes. In: Disorders of hemoglobin: genetics, pathophysiology, and clinical managment—Steinberg MH, Forget BG, Higgs DR, Nagel RL, eds. (2001) Cambridge: Cambridge University Press. 95–115.

    Hiebl I, Braunitzer G, Schneeganss D. The primary structures of the major and minor hemoglobin-components of adult Andean goose (Chloephaga melanoptera: Anatidae): the mutation Leu–--Ser in position 55 of the beta-chains. Biol Chem Hoppe Seyler (1987) 368:1559–1569.[Web of Science][Medline]

    Hiebl I, Schneeganss D, Braunitzer G. High-altitude respiration of birds. The primary structures of the alpha D-chains of the Bar-headed goose (Anser indicus), the Greylag goose(Anser anser) and the Canada goose (Branta canadensis). Biol Chem Hoppe Seyler (1986) 367:591–599.[Web of Science][Medline]

    Hiebl I, Weber RE, Schneeganss D, Braunitzer G. High-altitude respiration of Falconiformes. The primary structures and functional properties of the major and minor hemoglobin components of the adult White-headed vulture (Trigonoceps occipitalis, Aegypiinae). Biol Chem Hoppe Seyler (1989) 370:699–706.[Web of Science][Medline]

    Hiebl I, Weber RE, Schneeganss D, Kosters J, Braunitzer G. High-altitude respiration of birds. Structural adaptations in the major and minor hemoglobin components of adult Ruppell's griffon (Gyps rueppellii, Aegypiinae): a new molecular pattern for hypoxic tolerance. Biol Chem Hoppe Seyler (1988) 369:217–232.[Web of Science][Medline]

    Higgs DR, Vickers MA, Wilkie AO, Pretorius IM, Jarman AP, Weatherall DJ. A review of the molecular genetics of the human alpha-globin gene cluster. Blood (1989) 73:1081–1104.[Free Full Text]

    Hughes AL. Adaptive evolution of genes and genomes (1999) New York: Oxford University Press.

    Hughes JR, Cheng J, Ventress N, Prabhakar S, Clark K, Anguita E, De Gobbi M, de Jong P, Rubin E, Higgs DR. Annotation of cis-regulatory elements by identification, subclassification, and functional assessment of multispecies conserved sequences. Proc Natl Acad Sci USA (2005) 102:9830–9835.[Abstract/Free Full Text]

    Ikehara T, Eguchi Y, Kayo S, Takei H. Isolation and sequencing of two alpha-globin genes alpha(A) and alpha(D) in pigeon and evidence for embryo-specific expression of the alpha(D)-globin gene. Biochem Biophys Res Commun (1997) 234:450–453.[CrossRef][Web of Science][Medline]

    Jobb G, von Haeseler A, Strimmer K. Treefinder: a powerful graphical analysis environment for molecular phylogenetics. BMC Evol Biol. (2004) 4:18.[CrossRef][Medline]

    Kimura M, Ohta T. On some principles governing molecular evolution. Proc Natl Acad Sci USA (1974) 71:2848–2852.[Abstract/Free Full Text]

    Knapp JE, Oliveira MA, Xie Q, Ernst SR, Riggs AF, Hackert ML. The structural and functional analysis of the hemoglobin D component from chicken. J Biol Chem. (1999) 274:6411–6420.[Abstract/Free Full Text]

    Nagel RL, Steinberg MH. Role of epistatic (modifier) genes in the modulation of the phenotypic diversity of sickle cell anemia. Pediatr Pathol Mol Med (2001) 20:123–136.[CrossRef][Web of Science][Medline]

    Nothum R, Braunitzer G, Hiebl I, Kosters J, Schneeganss D. The hemoglobins of the adult blackbird (Turdus merula, Passeriformes). The sequence of the major (HbA) and minor component (HbD). Biol Chem Hoppe Seyler (1989) 370:309–316.[Web of Science][Medline]

    Nothum R, Weber RE, Kösters J, Schneeganss D, Braunitzer G. Amino-acid sequences and functional differentiation of hemoglobins A and D from swift (Apus apus, Apodiformes). Biol Chem Hoppe Seyler (1989) 370:1197–1207.[Web of Science][Medline]

    Ohno S. Evolution by gene duplication (1970) New York: Springer-Verlag.

    Ohta T. Pattern of nucleotide substitutions in growth hormone-prolactin gene family: a paradigm for evolution by gene duplication. Genetics (1993) 134:1271–1276.[Abstract]

    Ohta T. Further examples of evolution by gene duplication revealed through DNA sequence comparisons. Genetics (1994) 138:1331–1337.[Abstract]

    Posada D, Crandall KA. Modeltest: testing the model of DNA substitution. Bioinformatics (1998) 14:817–818.[Abstract/Free Full Text]

    Proudfoot NJ, Gil A, Maniatis T. The structure of the human zeta-globin gene and a closely linked, nearly identical pseudogene. Cell. (1982) 31:553–563.[CrossRef][Web of Science][Medline]

    Riggs AF. Self-association, cooperativity and supercooperativity of oxygen binding by hemoglobins. J Exp Biol. (1998) 201:1073–1084.[Abstract]

    Rodriguez F, Oliver JL, Marin A, Medina JR. The general stochastic model of nucleotide substitution. J Theor Biol. (1990) 142:485–501.[Web of Science][Medline]

    Ronquist F, Huelsenbeck JP. MrBayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics (2003) 19:1572–1574.[Abstract/Free Full Text]

    Shimodaira H. An approximately unbiased test of phylogenetic tree selection. Syst Biol. (2002) 51:492–508.[CrossRef][Web of Science][Medline]

    Shimodaira H, Hasegawa M. Multiple comparisons of log-likelihoods with applications to phylogenetic inference. Mol Biol Evol. (1999) 16:1114–1116.[Web of Science]

    Shimodaira H, Hasegawa M. Consel: for assessing the confidence of phylogenetic tree selection. Bioinformatics (2001) 17:1246–1247.[Abstract/Free Full Text]

    Shishikura F. The primary structure of hemoglobin d from the Aldabra giant tortoise, Geochelone gigantea. Zoolog Sci. (2002) 19:197–206.[CrossRef][Web of Science][Medline]

    Stoeckelhuber M, Gorr T, Kleinschmidt T. The primary structure of three hemoglobin chains from the Indigo snake (Drymarchon corais erebennus, Serpentes): first evidence for alphaD chains and two beta chain types in snakes. Biol Chem. (2002) 383:1907–1916.[CrossRef][Web of Science][Medline]

    Swofford DL. Paup* phylogenetic analysis using parsimony (* and other methods) version 4.0b10 (2002) Sunderland, MA: Sinauer.

    Swofford DL, Olsen GJ, Waddell PJ, Hillis DM. Phylogenetic inference. In: Molecular systematics 2ndedition—Hillis DM, Moritz C, Mable B, eds. (1996) Sunderland, MA: Sinauer. 407–514.

    Tatusova TA, Madden TL. Blast 2 sequences, a new tool for comparing protein and nucleotide sequences. FEMS Microbiol Lett (1999) 174:247–250.[CrossRef][Web of Science][Medline]

    Thompson JD, Higgins DG, Gibson TJ. Clustal w: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. (1994) 22:4673–4680.[Abstract/Free Full Text]

    Weber RE, Hiebl I, Braunitzer G. High altitude and hemoglobin function in the vultures Gyps rueppellii and Aegypius monachus. Biol Chem Hoppe Seyler (1988) 369:233–240.[Web of Science][Medline]

    Wheeler D, Hope RM, Cooper SJB, Gooley AA, Holland RAB. Linkage of the beta-like omega-globin gene to alpha-like globin genes in an australian marsupial supports the chromosome duplication model for separation of globin gene clusters. J Mol Evol. (2004) 58:642–652.[CrossRef][Web of Science][Medline]

    Yang Z. Paml: a program package for phylogenetic analysis by maximum likelihood. Comput Appl Biosci (1997) 13:555–556.[Free Full Text]

    Zimmer EA, Martin SL, Beverley SM, Kan YW, Wilson AC. Rapid duplication and loss of genes coding for the alpha chains of hemoglobin. Proc Natl Acad Sci USA (1980) 77:2158–2162.[Abstract/Free Full Text]

Accepted for publication June 18, 2007.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Proc. Natl. Acad. Sci. USAHome page
J. C. Opazo, F. G. Hoffmann, and J. F. Storz
Differential loss of embryonic globin genes during the radiation of placental mammals
PNAS, September 2, 2008; 105(35): 12950 - 12955.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
J. F. Storz, F. G. Hoffmann, J. C. Opazo, and H. Moriyama
Adaptive Functional Divergence Among Triplicated {alpha}-Globin Genes in Rodents
Genetics, March 1, 2008; 178(3): 1623 - 1638.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
F. G. Hoffmann, J. C. Opazo, and J. F. Storz
Rapid Rates of Lineage-Specific Gene Duplication and Deletion in the {alpha}-Globin Gene Family
Mol. Biol. Evol., March 1, 2008; 25(3): 591 - 602.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Supplementary Material
Right arrow All Versions of this Article:
24/9/1982    most recent
msm127v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Hoffmann, F. G.
Right arrow Articles by Storz, J. F.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Hoffmann, F. G.
Right arrow Articles by Storz, J. F.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?