Molecular Biology and Evolution, Vol 15, 583-589, Copyright © 1998 by Society for Molecular Biology and Evolution
MA Huynen and E van Nimwegen
We compare the frequency distribution of gene family sizes in the complete
genomes of six bacteria (Escherichia coli, Haemophilus influenzae,
Helicobacter pylori, Mycoplasma genitalium, Mycoplasma pneumoniae, and
Synechocystis sp. PCC6803), two Archaea (Methanococcus jannaschii and
Methanobacterium thermoautotrophicum), one eukaryote (Saccharomyces
cerevisiae), the vaccinia virus, and the bacteriophage T4. The sizes of the
gene families versus their frequencies show power- law distributions that
tend to become flatter (have a larger exponent) as the number of genes in
the genome increases. Power-law distributions generally occur as the limit
distribution of a multiplicative stochastic process with a boundary
constraint. We discuss various models that can account for a multiplicative
process determining the sizes of gene families in the genome. In
particular, we argue that, in order to explain the observed distributions,
gene families have to behave in a coherent fashion within the genome; i.e.,
the probabilities of duplications of genes within a gene family are not
independent of each other. Likewise, the probabilities of deletions of
genes within a gene family are not independent of each other.
ORIGINAL ARTICLE
The frequency distribution of gene family sizes in complete genomes
Santa Fe Institute, New Mexico, USA. huynen@embl-heidelberg.de
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
M. J. Oliver, D. Petrov, D. Ackerly, P. Falkowski, and O. M. Schofield The mode and tempo of genome size evolution in eukaryotes Genome Res., May 1, 2007; 17(5): 594 - 601. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Zhao, J. H. Thomas, N. Chen, J. A. Sheps, and D. L. Baillie Comparative Genomics and Adaptive Selection of the ATP-Binding-Cassette Gene Family in Caenorhabditis Species Genetics, March 1, 2007; 175(3): 1407 - 1418. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Shoja and L. Zhang A Roadmap of Tandemly Arrayed Genes in the Genomes of Human, Mouse, and Rat Mol. Biol. Evol., November 1, 2006; 23(11): 2134 - 2141. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. W. Hahn, T. De Bie, J. E. Stajich, C. Nguyen, and N. Cristianini Estimating the tempo and mode of gene family evolution from comparative genomic data Genome Res., August 1, 2005; 15(8): 1153 - 1160. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Kunin, S. A. Teichmann, M. A. Huynen, and C. A. Ouzounis The properties of protein family space depend on experimental design Bioinformatics, June 1, 2005; 21(11): 2618 - 2622. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. E. Shakhnovich, E. Deeds, C. Delisi, and E. Shakhnovich Protein structure and evolutionary history determine sequence space topology Genome Res., March 1, 2005; 15(3): 385 - 392. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Meinel, A. Krause, H. Luz, M. Vingron, and E. Staub The SYSTERS Protein Family Database in 2005 Nucleic Acids Res., January 1, 2005; 33(suppl_1): D226 - D229. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Chothia, J. Gough, C. Vogel, and S. A. Teichmann Evolution of the Protein Repertoire Science, June 13, 2003; 300(5626): 1701 - 1703. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. G. Berg and C. G. Kurland Evolution of Microbial Genomes: Sequence Acquisition and Loss Mol. Biol. Evol., December 1, 2002; 19(12): 2265 - 2276. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. C. Conant and A. Wagner GenomeHistory: a software tool and its application to fully sequenced genomes Nucleic Acids Res., August 1, 2002; 30(15): 3378 - 3386. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. Lespinet, Y. I. Wolf, E. V. Koonin, and L. Aravind The Role of Lineage-Specific Gene Family Expansion in the Evolution of Eukaryotes Genome Res., July 1, 2002; 12(7): 1048 - 1059. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Yanai, A. Derti, and C. DeLisi Genes linked by fusion events are generally of the same functional category: A systematic analysis of 30 microbial genomes PNAS, July 3, 2001; 98(14): 7940 - 7945. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Perez-Rueda and J. Collado-Vides The repertoire of DNA-binding transcriptional regulators in Escherichia coli K-12 Nucleic Acids Res., April 15, 2000; 28(8): 1838 - 1847. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. K. Jordan, K. S. Makarova, J. L. Spouge, Y. I. Wolf, and E. V. Koonin Lineage-Specific Gene Expansions in Bacterial and Archaeal Genomes Genome Res., April 1, 2001; 11(4): 555 - 565. [Abstract] [Full Text] |
||||






