MBE Advance Access originally published online on August 3, 2005
Molecular Biology and Evolution 2005 22(12):2386-2394; doi:10.1093/molbev/msi229
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Research Article |
Evolutionary Change of the Numbers of Homeobox Genes in Bilateral Animals
Institute of Molecular Evolutionary Genetics and Department of Biology, Pennsylvania State University
E-mail: jnam{at}caltech.edu.
| Abstract |
|---|
|
|
|---|
It has been known that the conservation or diversity of homeobox genes is responsible for the similarity and variability of some of the morphological or physiological characters among different organisms. To gain some insights into the evolutionary pattern of homeobox genes in bilateral animals, we studied the change of the numbers of these genes during the evolution of bilateral animals. We analyzed 2,031 homeodomain sequences compiled from 11 species of bilateral animals ranging from Caenorhabditis elegans to humans. Our phylogenetic analysis using a modified reconciled-tree method suggested that there were at least about 88 homeobox genes in the common ancestor of bilateral animals. About 5060 genes of them have left at least one descendant gene in each of the 11 species studied, suggesting that about 3040 genes were lost in a lineage-specific manner. Although similar numbers of ancestral genes have survived in each species, vertebrate lineages gained many more genes by duplication than invertebrate lineages, resulting in more than 200 homeobox genes in vertebrates and about 100 in invertebrates. After these gene duplications, a substantial number of old duplicate genes have also been lost in each lineage. Because many old duplicate genes were lost, it is likely that lost genes had already been differentiated from other groups of genes at the time of gene loss. We conclude that both gain and loss of homeobox genes were important for the evolutionary change of phenotypic characters in bilateral animals.
Key Words: homeobox genes molecular evolution gene duplication gene loss evolutionary developmental biology
| Introduction |
|---|
|
|
|---|
Homeobox genes that regulate morphogenesis were first discovered by Garber, Kuroiwa, and Gehring (1983)
Animal homeobox genes were previously classified into about 30 different groups or families based on their sequence similarity and protein domain structure (Burglin 1994
). Additional groups of homeobox genes were identified later (e.g., PBC, MEIS, PKNOX/PREP, TGIF, and IRO; Burglin 1997
), and now the homeobox genes in animals can be classified into at least 49 different gene families (see Burglin 2005
, for a more detailed classification). Member genes of the same family are often functionally related, and different families of homeobox genes are concerned with different aspects of development (Burglin 1994
). For example, the genes of the HOX, CDX, and EVX families and their cognate genes play important roles in different steps of body pattern formation during early embryogenesis of animals. The PAX6, SIX, VAX, and EMX gene families are concerned with the development of eyes, whereas the LIM and HMX gene families are important in the development of neurons (reviewed in Duboule 1994
). Because of their important roles in development, homeobox genes have been studied extensively by both developmental and evolutionary biologists.
Homeobox genes are generally highly conserved and control similar phenotypic characters among distantly related organisms (reviewed in De Robertis 1994
). However, they are also responsible for controlling different phenotypic characters among relatively closely related species (e.g., Galant and Carroll 2002
; Ronshaugen, N. McGinnis, and W. McGinnis 2002
). The formation of similar phenotypic characters can be explained by the conservation of shared homeobox genes. By contrast, different phenotypic characters are believed to be generated by duplication of homeobox genes and their functional differentiation. It has also been hypothesized that the loss of some homeobox genes are responsible for morphological differentiation (Ruddle et al. 1994
). Therefore, it is interesting to study the pattern of duplication and loss of homeobox genes to have some insights into the evolutionary change of phenotypic characters. It is likely that the number of homeobox genes is related to the complexity of organisms.
Although the patterns of gain and loss of homeobox genes belonging to some families have been studied (e.g., Zhang and Nei 1996
; Aparicio et al. 1997
; Kappen 2000
; Wada et al. 2003
; Amores et al. 2004
; Edvardsen et al. 2005
), no one appears to have studied the gain and loss of the entire set of homeobox genes covering diverse bilateral animals. We have therefore decided to study the evolutionary change of the homeobox gene superfamily examining 11 completely or nearly completely sequenced genomes from bilateral animals. For this purpose, we used a modified version of the reconciled-tree method (Goodman et al. 1979
; Page and Charleston 1997
) taking into account the ambiguity of gene tree. Although our estimates are crude and conservative, we can still obtain a rough picture of the gain and loss of homeobox genes and their significance for morphological evolution. The method used and the results obtained will be presented in this paper.
| Materials and Methods |
|---|
|
|
|---|
Identification of Homeodomain-Containing Proteins
To find homeodomain-containing proteins, we performed homology search using the tool Psi-Blast (Altschul et al. 1997
105 (see Supplementary Material online). We also searched for homeobox genes from the expressed sequence tag (EST) database of the tunicate from the DNA Data Bank of Japan (http://www.ddbj.nig.ac.jp/) because Wada et al. (2003)
Phylogenetic Analysis
Because the homeodomain is the only alignable region between different groups of homeodomain-containing proteins, we used only this domain (
60 aa) for phylogenetic analysis. The homeodomain sequences were aligned against the alignment of 207 query sequences (seed alignments) using the profile alignment of the ClustalX program (Thompson et al. 1997
). We then constructed a neighbor-joining tree (Saitou and Nei 1987
) using the computer program NJBOOT (Takezaki, Rzhetsky, and Nei 1995
) with the pairwise deletion option, proportional amino acid differences (p-distances), and 1,000 bootstrap resamplings (Nei and Kumar 2000
). Because of the large number of sequences used, other tree construction methods such as maximum-parsimony and maximum-likelihood methods were not used.
Each homeobox gene was assigned to one of the 49 previously defined groups according to the sequence similarity and protein domain structure. Domain structure was examined by using the computer program HMMER (Eddy 2001
) for each protein domain profile downloaded from the Pfam (http://pfam.wustl.edu/). A phylogenetic tree for the 49 families of genes was constructed to find their evolutionary relationships.
Estimation of the Number of Genes in the Ancestral Species
When there is a rooted tree of m species, the tree has m 1 ancestral nodes or species (Nei and Kumar 2000
). We are interested in estimating the number of homeobox genes in each of the ancestral species and how the number has changed in the evolutionary process. This can be studied by comparing the species tree with the gene tree for a given set of genes and constructing a reconciled tree (Goodman et al. 1979
; Page and Charleston 1997
). In this paper, we use a modified version of this reconciled-tree method in which multifurcating branching patterns are taken into account.
For simplicity, let us consider the tree of three species
, ß, and
in figure 1A and assume that species
, ß, and
have three, two, and two genes, respectively. Suppose that the gene tree inferred for the seven genes from the three species is given in figure 1B and that this tree represents the true gene tree. This gene tree can be decomposed into three groups of genes (I, II, and III), in which genes a, b, and c come from species
, ß, and
, respectively. Group I genes do not include any c gene but contain two pairs of genes a and b. Therefore, to reconcile this portion of the gene tree with the species tree under the principle of parsimony, we must assume that one deletion of gene c and one duplication of the ancestral gene of genes a and b occurred (fig. 1C). Similarly, to reconcile the group II genes with the species tree, we will have to assume that gene b was deleted. In the case of group III genes, we have to consider the deletion of genes a and b. In this case, we assume that one event of deletion occurred in the ancestral lineage of genes a and b. The reconciled tree in figure 1C therefore suggests that the ancestral species
had three genes (ancestral genes of the three groups of genes).
|
A similar inference indicates that the ancestral species
also contained three genes, i.e., two genes in group I, one gene in group II, and zero gene in group III (fig. 1C). Figure 1A now shows the change of the number of genes in the evolutionary lineages for
, ß, and
.
In the above estimation of the number of genes, we assumed that the gene tree is correct. In practice, however, some interior branches are often weakly supported in terms of bootstrap values. For example, one interior branch may have a low bootstrap value (<50% in the present study). In this case, the existence of this branch is questionable, so that the length of this branch is reduced to zero, and a condensed tree (Nei and Kumar 2000
) is constructed (fig. 1D). The real gene tree in this case will be one of the three possible trees given in figure 1EG. Tree E is identical with tree B, and we already estimated the number of genes in the ancestral species
and
. In the case of tree F, the reconciled tree is given by figure 1H, and the numbers of genes in species
and
are given in figure 1I. This tree is more parsimonious than tree A with respect to the change of gene number. Tree G is different from tree F in that group II genes are closer to gene c, but the number of genes estimated in the ancestral organisms becomes the same as those for tree B. Therefore, we assume that tree I gives the actual numbers of ancestral species. Note that if low bootstrap support was observed between two clades representing different family of genes, we did not apply this multifurcation rule.
When there are several branches with low bootstrap values, the numbers of genes in ancestral species are estimated by the same procedure as the above under the principle of parsimony. Therefore, one can estimate the number of genes for any number of ancestral species. Obviously, the number of genes estimated would be minimal, but because homeobox genes evolve very slowly, the present method appears to give reasonably good estimates (see below). When m is large, the computation can be quite complicated, and we have developed a computer program (available by request to J.N.).
| Results |
|---|
|
|
|---|
Number of Homeobox Genes in the Genome
Table 1 shows the numbers of nonredundant homeobox genes obtained from the annotated gene sets of 11 species. The majority of the homeobox genes encode only one homeodomain (single-homeobox genes), but some encode more than one domain (multihomeobox genes). The number of homeodomains encoded by a multihomeobox gene was less than 10 with some exceptions. All vertebrate species studied (pufferfish, zebrafish, frog, mouse, rat, and humans) had about 200 or more homeobox genes, and all invertebrate species (C. elegans, C. briggsae, fruitfly, mosquito, and tunicate) had about 100 or fewer homeobox genes. All the sequences used are presented as Supplementary Material online (see file 1).
|
Evolutionary Relationships of Different Families of Homeobox Genes
The majority of the homeobox genes were assigned into the 49 previously defined groups or families. The remaining homeobox genes were either highly divergent or multihomeobox genes. The list of the genes in each of the 49 groups and the multihomeobox genes is available from the Supplementary Material online. Figure 2 shows the evolutionary relationships of the 49 groups of genes. The homeobox gene superfamily was initially classified into two groups, the typical and atypical homeobox gene groups (Burglin 1994
|
At least 13 groups of homeobox genes (gene groups with orange boxes in fig. 2) encode other evolutionarily conserved domains in addition to the homeodomain according to the hidden Markov model searches for conserved domains using the Pfam database (see Burglin 2005
Figure 2 also shows the numbers of homeobox genes for each of the 49 families as well as for those of multihomeobox genes and unassigned genes. In most groups, vertebrates have about two to four times more genes than invertebrates. However, there are several gene groups that do not show this pattern. For example, the NOT gene, which is important for the development of notochords in zebrafish (Talbot et al. 1995
), has been found as a single-copy gene in three invertebrate species, fruitflies, mosquitoes, and tunicates, and three vertebrate species, zebrafish, pufferfish, and frog, but no gene has been found in three mammalian species, mouse, rat, and humans. Recently, it has been claimed that a mouse ortholog of the NOT gene was found (Abdelkhalek et al. 2004
; Plouhinec et al. 2004
), but the phylogenetic analysis presented actually suggests that it is a paralog of the NOT genes. There are also other families of genes where vertebrates have fewer or no more genes than those of invertebrates. Because the genome sequencing or annotation of several species has not been completed, the absence of some families of genes should be reexamined, though it may not change the general pattern of evolution. It is also possible that the genes have not been lost completely but only their homeoboxes are missing, and other regions of the gene are still functional as in the case of some PAX genes (Chi and Epstein 2002
).
Evolutionary Change of the Number of Homeobox Genes in Bilateral Animals
Knowing that there were already many homeobox genes in the MRCA of all the 11 species (archi-MRCA), we estimated the numbers of homeobox genes in all ancestral organisms and their increase and decrease in different stages of the evolution of bilateral animals. We constructed a phylogenetic tree of 2,031 homeodomain sequences compiled from single- and multihomeodomaincontaining proteins in relation to the species tree (fig. 3). The species tree is based on the observation that insects, tunicates, and vertebrates are coelomates but nematodes are pseudocoelomates (Coelomata hypothesis) as well as phylogenetic analyses using more than 100 nuclear proteins from several species (e.g., Blair et al. 2002
; Wolf, Rogozin, and Koonin 2004
; Philip, Creevey, and McInerney 2005
). (The Ecdysozoa hypothesis in which insects and nematodes are sister groups will be considered later.) Because the protein sequences used are generally closely related, we used a bootstrap cutoff point of 50% for generating a multifurcating node of gene trees. As mentioned above, the number of homeoboxes in multihomeobox genes varies with the gene, and therefore it is difficult to estimate the real number of ancestral genes for these genes. We therefore decided to regard each homeobox as one gene. However, this will not affect our results significantly because the number of multihomeobox genes included was small.
|
To check the reliability of our estimates, we first analyzed HOX family genes. The numbers of ancestral HOX genes at several evolutionary time points have already been estimated by several researchers (e.g., Holland and Garcia-Fernandez 1996
, ß,
,
, and
in figure 3A were 5, 6, 9, 43, and 39, respectively, whereas our estimates were 4, 6, 9, 24, and 26 in this order. This result suggests that our estimates of the numbers of genes in the ancestral species tend to be smaller than the previous ones. This has happened mainly because some of the paralogous homeodomains from different species had identical or nearly identical sequences, and therefore the interior branches involved sometimes had zero substitutions and showed low bootstrap values. In this case, the numbers of genes in ancestral organisms are expected to be underestimated. In particular, the number of genes in the MRCAs
and
was underestimated for this reason. However, we note that the number of genes gained at the exterior branch of each lineage is likely to have been overestimated because of underestimation of the numbers of ancestral genes.
Increase of Homeobox Genes in the Evolutionary Process
Keeping in mind this possibility of underestimation, we estimated the numbers of ancestral genes and the numbers of genes lost and gained for the entire homeobox gene superfamily. Figure 3B shows that there were at least 88 ancestral homeobox genes in the archi-MRCA. This archi-MRCA already had several genes in some homeobox families (e.g., six genes in PAX group) (fig. 2). However, some gene families (LBX, VAX/NOT, MEOX, and zinc-finger [ZF]) were not found in nematodes (fig. 2), and separation of these gene families from other families was not always clear-cut in the tree of 2,031 sequences (data not shown). Therefore, these gene families were not considered in the estimation of genes in the archi-MRCA. However, figure 2 shows that these groups of genes already diverged from other groups of genes before the nematode and mammalian split. Therefore, it is possible that the genes from these four families actually existed in the archi-MRCA. If this is the case, the number of homeobox genes in the archi-MRCA would increase to about 92.
After the divergence of coelomates and pseudocoelomates, the number of homeobox genes increased almost threefold in the vertebrate lineages. In invertebrates, however, the increase was small or moderate, and our results suggest that the number of homeobox genes did not merely increase during the evolutionary process, but the number sometimes decreased. For example, the MRCA of insects and vertebrates had at least 118 homeobox genes, but fruitflies have 102 at present. Tunicates also have fewer homeobox genes than the MRCA of tunicates and vertebrates. In the case of vertebrate lineages, the number of genes increased primarily in two time periods, that is, the early stages of coelomate evolution (between nodes
and ß in fig. 3B) and the early stages of vertebrate evolution (between nodes
and
in fig. 3B). The major increase of gene number in these two time periods are consistent with that of the total number of genes in the genome by X. Gu, Wang, and J. Gu (2002)
. Note that if a gene is duplicated and one of the two duplicate genes is lost during the same time interval, our approach cannot detect them, as mentioned earlier. Therefore, the gene losses estimated here are losses of fairly old duplicate genes.
The Ecdysozoa hypothesis (e.g., Aguinaldo et al. 1997
; H. Dopazo and J. Dopazo 2005
) proposes that insects are more closely related to nematodes than to vertebrates. We therefore examined the numbers of ancestral genes for all the MRCAs of the species tree (fig. 4). A conspicuous difference between the two hypotheses was observed in the archi-MRCA (node
), the numbers for the former and the latter hypotheses being 88 and 145, respectively. In other MRCAs, the number was nearly the same for the two hypotheses.
|
The Coelomata and the Ecdysozoa hypotheses are quite controversial now. However, the studies based on a large number of nuclear protein sequences (e.g., Blair et al. 2002
Retention and Loss of Ancestral Homeobox Genes in Each Species
It is interesting to know how many gene families of the archi-MRCA have left descendent genes in the 11 species and how many gene families have been lost during this evolutionary period. Figure 2 shows that in nematodes 12 of the 49 gene families in the archi-MRCA have been lost, whereas 32 of them have been retained, the remaining 5 gene families existing in neither archi-MRCA nor nematodes. In vertebrates, however, all the archi-MRCA genes were found. In addition, some new gene families originated in insects, tunicates, and vertebrates. The HOX gene family has the largest number of member genes in vertebrates, and the archi-MRCA also had a relatively high number of member genes (four genes). The highest number of archi-MRCA genes was observed in the LIM family, but the number of genes did not increase very much compared with the HOX gene family. Some gene families such as the BSH and VSX families had small numbers of genes in all species.
We also studied the numbers of ancestral homeobox genes lost during the time period from the archi-MRCA to the present species (table 2). Let us consider figure 1C to illustrate how we counted the numbers of genes lost. This figure shows three groups of genes (I, II, and III) that were derived from the three genes in the ancestral species
. In group I, genes a and b are retained, but gene c was lost. In group II, genes a and c are retained, but gene b was lost. Similarly, in group III genes, gene c is retained, but genes a and b were lost. Therefore, in species
, two ancestral genes are retained and one gene was lost. Similarly, the number of genes lost is two in species ß and one in species
. The total number of genes lost in each extant species is given by the sum of these losses for all gene families. Note that this number of gene losses can be computed for each MRCA.
|
Table 2 shows that the invertebrate lineages lost about 3038 genes of the 88 ancestral genes in the archi-MRCA, whereas the vertebrate lineages lost about 2528 genes during the same period. This suggests that invertebrates lost somewhat more genes than vertebrates. However, the difference is much smaller than that observed with full genome analysis, which suggests that about two-fold or more gene losses occurred in the lineage leading to C. elegans than the lineage leading to human (Hughes and Friedman 2004
and
), fishes lost somewhat smaller numbers of genes than other vertebrates. These results suggest that the degree of gene loss varies significantly among different families of homeobox genes, but it is not so different among different species. When the Ecdysozoa tree was used, the number of genes lost from the archi-MRCA is more than two times greater than that for the Coelomata tree (table 3). This indicates that the Coelomata tree is much more parsimonious than the Ecdysozoa tree. Therefore, our data support the former tree, as mentioned earlier.
|
| Discussion |
|---|
|
|
|---|
In this study, we showed that there were at least 88 homeobox genes in the archi-MRCA of bilateral animals when the Coelomata tree was used. Previously, we mentioned that our statistical method would give minimum estimates of the numbers of ancestral genes. However, our estimate of the total number of genes in the archi-MRCA is close to the current number of genes in nematodes and insects. This is also true with the number in each gene family. These observations suggest that our estimates may not be too far off from the true numbers. Furthermore, the similarity of the estimates for the archi-MRCA and those for nematodes and insects suggest that the archi-MRCA had the same degree of phenotypic complexity as that of current nematodes or insects. Because vertebrates gained more homeobox genes than invertebrates, it appears that this increase in the number of homeobox genes is responsible for the formation of more complex characters in vertebrates than in invertebrates.
We have also seen that many homeobox genes have been lost in the process of evolution of phenotypic characters. This loss of homeobox genes might have been either inactivation of redundant genes after gene duplication or loss of functionally differentiated genes (Ruddle et al. 1994
; Wagner, Amemiya, and Ruddle 2003
). The genes lost in our study are losses of fairly old duplicate genes, and therefore it is likely that the genes lost were already functionally differentiated from their paralogous genes at the time of gene loss. This raises the question of why genes could be lost so often. There are at least three possible explanations. First, without closely related paralogous genes, functional redundancy can be achieved by something called distributed robustness (reviewed in Wagner 2005
). In other words, loss (or mutation) of a homeobox gene can be buffered by the rewiring of functionally different parts of the regulatory network. If so, it is possible that losses of homeobox genes might not have caused any noticeable changes of phenotypes. Second, it is also possible that gene loss occasionally has had beneficial effects. For example, loss of genes may be related to the reduction of unused characters. Third, the phenotypic changes caused by the loss of homeobox genes might have been more or less neutral with respect to fitness. In the case of multifunctional genes, this is possible if the critical functions are shared by duplicate genes.
The gain and loss of homeobox genes are probably initially opportunistic, but these events may change the evolutionary courses of different organisms. However, the possible causes of gene loss mentioned above are speculative, and more detailed studies are needed to identify the real reason.
| Supplementary Material |
|---|
|
|
|---|
Supplementary files are available at Molecular Biology and Evolution online (http://www.mbe.oxfordjournals.org/).
- A file containing 207 query sequences.
- A file containing 2,031 homeodomain sequences.
- A file containing list of genes for each of 49 families and multihomeodomain proteins.
| Acknowledgements |
|---|
|
|
|---|
We thank David Geiser, Claude dePamphilis, Hong Ma, Jan Klein, Junyul Kim, Li Hao, Yoshi Suzuki, and Kerstin Kaufmann for valuable comments on an earlier version of the manuscript. We also thank Thomas Bürglin for sending his up-to-date classification of homeobox genes. This study was supported by the National Institutes of Health Grant GM20293 to M.N.
| Footnotes |
|---|
William Martin, Associate Editor
| References |
|---|
|
|
|---|
Abdelkhalek, H. B., A. Beckers, K. Schuster-Gossler et al. (14 co-authors). 2004. The mouse homeobox gene Not is required for caudal notochord development and affected by the truncate mutation. Genes Dev. 18:17251736.
Aguinaldo, A. M., J. M. Turbeville, L. S. Linford, M. C. Rivera, J. R. Garey, R. A. Raff, and J. A. Lake. 1997. Evidence for a clade of nematodes, arthropods and other moulting animals. Nature 387:489493.[CrossRef][Medline]
Altschul, S. F., T. L. Madden, A. A. Schaffer, J. Zhang, Z. Zhang, W. Miller, and D. J. Lipman. 1997. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25:33893402.
Amores, A., T. Suzuki, Y. L. Yan, J. Pomeroy, A. Singer, C. Amemiya, and J. H. Postlethwait. 2004. Developmental roles of pufferfish Hox clusters and genome evolution in ray-fin fish. Genome Res. 14:110.
Aparicio, S., K. Hawker, A. Cottage, Y. Mikawa, L. Zuo, B. Venkatesh, E. Chen, R. Krumlauf, and S. Brenner. 1997. Organization of the Fugu rubripes Hox clusters: evidence for continuing evolution of vertebrate Hox complexes. Nat. Genet. 16:7983.[CrossRef][ISI][Medline]
Bharathan, G., B. J. Janssen, E. A. Kellogg, and N. Sinha. 1997. Did homeodomain proteins duplicate before the origin of angiosperms, fungi, and metazoa? Proc. Natl. Acad. Sci. USA 94:1374913753.
Blair, J. E., K. Ikeo, T. Gojobori, and S. B. Hedges. 2002. The evolutionary position of nematodes. BMC Evol. Biol. 2:17.[CrossRef][Medline]
Burglin, T. R. 1994. A comprehensive classification of homeobox genes. Pp. 2572 in D. Duboule, ed. Guidebook to the homeobox genes. Oxford University Press, New York.
. 1997. Analysis of TALE superclass homeobox genes (MEIS, PBC, KNOX, Iroquois, TGIF) reveals a novel domain conserved between plants and animals. Nucleic Acids Res. 25:41734180.
. 2005. Homeodomain proteins. Pp. 179222 in R. A. Meyers, ed. Encyclopedia of molecular cell biology and molecular medicine. Wiley-VCH Verlag GmbH & Co., Weinheim, Germany.
Chi, N., and J. A. Epstein. 2002. Getting your Pax straight: Pax proteins in development and disease. Trends Genet. 18:4147.[CrossRef][ISI][Medline]
De Robertis, E. M. 1994. The homeobox in cell differentiation and evolution. Pp. 1323 in D. Duboule, ed. Guidebook to the homeobox genes. Oxford University Press, New York.
Dopazo, H., and J. Dopazo. 2005. Genome-scale evidence of the nematode-arthropod clade. Genome Biol. (in press).
Duboule, D. 1994. Guidebook to the homeobox genes. Oxford University Press, New York.
Eddy, S. R. 2001. HMMER: profile hidden Markov models for biological sequence analysis. (http://hmmer.wustl.edu/).
Edvardsen, R. B., H. C. Seo, M. F. Jensen et al. (11 co-authors). 2005. Remodelling of the homeobox gene complement in the tunicate Oikopleura dioica. Curr. Biol. 15:R12R13.[CrossRef][ISI][Medline]
Galant, R., and S. B. Carroll. 2002. Evolution of a transcriptional repression domain in an insect Hox protein. Nature 415:910913.[CrossRef][Medline]
Garber, R. L., A. Kuroiwa, and W. J. Gehring. 1983. Genomic and cDNA clones of the homeotic locus Antennapedia in Drosophila. EMBO J. 2:20272036.[ISI][Medline]
Goodman, M., J. Czelusniak, G. W. Moore, A. E. Romero-Herrera, and G. Matsuda. 1979. Fitting the gene lineage into its species lineage, a parsimony strategy illustrated by cladograms constructed from globin sequences. Syst. Zool. 28:132163.[CrossRef]
Gu, X., Y. Wang, and J. Gu. 2002. Age distribution of human gene families shows significant roles of both large- and small-scale duplications in vertebrate evolution. Nat. Genet. 31:205209.[CrossRef][ISI][Medline]
Holland, P. W., and J. Garcia-Fernandez. 1996. Hox genes and chordate evolution. Dev. Biol. 173:382395.[CrossRef][ISI][Medline]
Hughes, A. L., and R. Friedman. 2004. Shedding genomic ballast: extensive parallel loss of ancestral gene families in animals. J. Mol. Evol. 59:827833.[CrossRef][ISI][Medline]
Kappen, C. 2000. Analysis of a complete homeobox gene repertoire: implications for the evolution of diversity. Proc. Natl. Acad. Sci. USA 97:44814486.
Koonin, E. V., N. D. Fedorova, J. D. Jackson et al. (18 co-authors). 2004. A comprehensive evolutionary classification of proteins encoded in complete eukaryotic genomes. Genome Biol. 5:R7.[CrossRef][Medline]
Kumar, S., and S. B. Hedges. 1998. A molecular timescale for vertebrate evolution. Nature 392:917920.
McGinnis, W., M. S. Levine, E. Hafen, A. Kuroiwa, and W. J. Gehring. 1984. A conserved DNA sequence in homoeotic genes of the Drosophila Antennapedia and bithorax complexes. Nature 308:428433.[CrossRef][Medline]
Nei, M., and S. Kumar. 2000. Molecular evolution and phylogenetics. Oxford Press, New York.
Nei, M., P. Xu, and G. Glazko. 2001. Estimation of divergence times from multiprotein sequences for a few mammalian species and several distantly related organisms. Proc. Natl. Acad. Sci. USA 98:24972502.
Ogura, A., K. Ikeo, and T. Gojobori. 2005. Estimation of ancestral gene set of bilaterian animals and its implications to dynamic change of gene content in bilaterian evolution. Gene 345:6571.[CrossRef][ISI][Medline]
Page, R. D., and M. A. Charleston. 1997. From gene to organismal phylogeny: reconciled trees and the gene tree/species tree problem. Mol. Phylogenet. Evol. 8:349362.[CrossRef][ISI][Medline]
Philip, G. K., C. J. Creevey, and J. O. McInerney. 2005. The Opisthokonta and the Ecdysozoa may not be clades: stronger support for the grouping of plant and animal than for animal and fungi and stronger support for the Coelomata than Ecdysozoa. Mol. Biol. Evol. 22:11751184.
Plouhinec, J. L., C. Granier, C. Le Mentec, K. A. Lawson, D. Saberan-Djoneidi, J. Aghion, D. L. Shi, J. Collignon, and S. Mazan. 2004. Identification of the mammalian Not gene via a phylogenomic approach. Gene Expr. Patterns 5:1122.[CrossRef][Medline]
Ronshaugen, M., N. McGinnis, and W. McGinnis. 2002. Hox protein mutation and macroevolution of the insect body plan. Nature 415:914917.[CrossRef][Medline]
Ruddle, F. H., K. L. Bentley, M. T. Murtha, and N. Risch. 1994. Gene loss and gain in the evolution of the vertebrates. Dev. Suppl.:155161.
Saitou, N., and M. Nei. 1987. The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol. Biol. Evol. 4:406425.[Abstract]
Scott, M. P., and A. J. Weiner. 1984. Structural relationships among genes that control development: sequence homology between the Antennapedia, Ultrabithorax, and fushi tarazu loci of Drosophila. Proc. Natl. Acad. Sci. USA 81:41154119.
Scott, M. P., A. J. Weiner, T. I. Hazelrigg, B. A. Polisky, V. Pirrotta, F. Scalenghe, and T. C. Kaufman. 1983. The molecular organization of the Antennapedia locus of Drosophila. Cell 35:763776.[CrossRef][ISI][Medline]
Shu, D. G., S. C. Morris, J. Han, L. Chen, X. L. Zhang, Z. F. Zhang, H. Q. Liu, Y. Li, and J. N. Liu. 2001. Primitive deuterostomes from the Chengjiang Lagerstatte (Lower Cambrian, China). Nature 414:419424.[CrossRef][Medline]
Stellwag, E. J. 1999. Hox gene duplication in fish. Semin. Cell. Dev. Biol. 10:531540.[CrossRef][ISI][Medline]
Takezaki, N., A. Rzhetsky, and M. Nei. 1995. Phylogenetic test of the molecular clock and linearized trees. Mol. Biol. Evol. 12:823833.[Abstract]
Talbot, W. S., B. Trevarrow, M. E. Halpern, A. E. Melby, G. Farr, J. H. Postlethwait, T. Jowett, C. B. Kimmel, and D. Kimelman. 1995. A homeobox gene essential for zebrafish notochord development. Nature 378:150157.[CrossRef][Medline]
Thompson, J. D., T. J. Gibson, F. Plewniak, F. Jeanmougin, and D. G. Higgins. 1997. The ClustalX windows interface, flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res. 24:48764882.
Wada, S., M. Tokuoka, E. Shoguchi et al. (13 co-authors). 2003. A genomewide survey of developmentally relevant genes in Ciona intestinalis. II. Genes for homeobox transcription factors. Dev. Genes Evol. 213:222234.[CrossRef][ISI][Medline]
Wagner, A. 2005. Distributed robustness versus redundancy as causes of mutational robustness. Bioessays 27:176188.[CrossRef][ISI][Medline]
Wagner, G. P., C. Amemiya, and F. Ruddle. 2003. Hox cluster duplications and the opportunity for evolutionary novelties. Proc. Natl. Acad. Sci. USA 100:1460314606.
Wolf, Y. I., I. B. Rogozin, and E. V. Koonin. 2004. Coelomata and not Ecdysozoa: evidence from genome-wide phylogenetic analysis. Genome Res. 14:2936.
Zhang, J., and M. Nei. 1996. Evolution of Antennapedia-class homeobox genes. Genetics 142:295303.[Abstract]
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
M. Nozawa, Y. Kawahara, and M. Nei From the Cover: Genomic drift and copy number variation of sensory receptor genes in humans PNAS, December 18, 2007; 104(51): 20421 - 20426. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Y. Low, H. L. Ng, C. J. Morton, M. W. Parker, P. Batterham, and C. Robin Molecular Evolution of Glutathione S-Transferases in the Genus Drosophila Genetics, November 1, 2007; 177(3): 1363 - 1375. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Wang and J. Zhang Rapid evolution of primate ESX1, an X-linked placenta- and testis-expressed homeobox gene Hum. Mol. Genet., September 1, 2007; 16(17): 2053 - 2060. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Gitelman Evolution of the vertebrate twist family and synfunctionalization: a mechanism for differential gene loss through merging of expression domains Mol. Biol. Evol., September 1, 2007; 24(9): 1912 - 1925. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Nei The new mutation theory of phenotypic evolution PNAS, July 24, 2007; 104(30): 12235 - 12242. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Nozawa and M. Nei Evolutionary dynamics of olfactory receptor genes in Drosophila species PNAS, April 24, 2007; 104(17): 7122 - 7127. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||


" mark) in invertebrates are either ZF multihomeobox genes that have their own vertebrate multihomeobox orthologs or unclassified genes that appear to have been derived from multihomeobox genes (data not shown). Note that the SIX family genes are typical homeobox genes and that the gene numbers for the HOX gene family are slightly higher than those of the genes in the HOX cluster. This is because other closely related genes (e.g., IPF) were also included in this family. We used the same notations as Burglin's (2005)




