MBE Advance Access published online on September 13, 2006
Molecular Biology and Evolution, doi:10.1093/molbev/msl117
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 Department of Crop Sciences, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
* To whom correspondence should be addressed. The majority of proteins consist of multiple domains that are either repeated or combined in defined order. In this study, we survey the combination of protein domains defined at fold and fold superfamily levels in 185 genomes belonging to organisms that have been fully sequenced and introduce a method that reconstructs rooted phylogenomic trees from the content and arrangement of domains in proteins at a genomic level. We find that the majority of domain combinations were unique to Archaea, Bacteria or Eukarya, suggesting most combinations originated after life had diversified. Domain-repeat and domain repeat within multi-domain proteins increased notably in eukaryotes, mainly at the expense of single-domain and domain-pair proteins. This increased was mostly confined to Metazoa. We also find an unbalanced sharing of domain combinations that suggests Eukarya is more closely related to Bacteria than to Archaea, an observation that challenges the widely assumed eukaryote-archaebacterial sisterhood relationship. The occurrence and abundance of the molecular repertoire (interactome) of domain combinations was used to generate phylogenomic trees. These global interactome-based phylogenies described organismal histories satisfactorily, revealing the tripartite nature of life, and supporting controversial evolutionary patterns, such as the Coelomata hypothesis, the grouping of plants and animals, and the Gram-positive origin of bacteria. Results suggest strongly that the process of domain combination is not random but curved by evolution, rejecting the null hypothesis of domain modules combining in the absence of natural selection or an optimality criterion.
Accepted September 8, 2006
Research Article
Global Phylogeny Determined by the Combination of Protein Domains in Proteomes
Minglei Wang 1 and Gustavo Caetano-Anollés 1 *
Gustavo Caetano-Anollés, E-mail: gca{at}uiuc.edu
![]()
Abstract ![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
M. K. Basu, L. Carmel, I. B. Rogozin, and E. V. Koonin Evolution of protein domain promiscuity in eukaryotes Genome Res., March 1, 2008; 18(3): 449 - 461. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Forslund, A. Henricson, V. Hollich, and E. L. L. Sonnhammer Domain Tree-Based Analysis of Protein Architecture Evolution Mol. Biol. Evol., February 1, 2008; 25(2): 254 - 264. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Wang, L. S. Yafremava, D. Caetano-Anolles, J. E. Mittenthal, and G. Caetano-Anolles Reductive evolution of architectural repertoires in proteomes and the birth of the tripartite world Genome Res., November 1, 2007; 17(11): 1572 - 1585. [Abstract] [Full Text] [PDF] |
||||

