MBE Advance Access originally published online on November 9, 2005
Molecular Biology and Evolution 2006 23(3):530-540; doi:10.1093/molbev/msj054
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Research Article |
Evolutionary Conservation of Expression Profiles Between Human and Mouse Orthologous Genes
Department of Ecology and Evolutionary Biology, University of Michigan
E-mail: jianzhi{at}umich.edu.
| Abstract |
|---|
|
|
|---|
Mouse models are often used to study human genes because it is believed that the expression and function are similar for the majority of orthologous genes between the two species. However, recent comparisons of microarray data from thousands of orthologous human and mouse genes suggested rapid evolution of gene expression profiles under minimal or no selective constraint. These findings appear to contradict nonarray-based observations from many individual genes and imply the uselessness of mouse models for studying human genes. Because absolute levels of gene expression are not comparable between species when the data are generated by species-specific microarrays, use of relative mRNA abundance among tissues (RA) is preferred to that of absolute expression signals. We thus reanalyze human and mouse genome-wide gene expression data generated by oligonucleotide microarrays. We show that the mean correlation coefficient among expression profiles detected by different probe sets of the same gene is only 0.38 for humans and 0.28 for mice, indicating that current measures of expression divergence are flawed because the large estimation error (discrepancy in expression signal detected by different probe sets of the same gene) is mistakenly included in the between-species divergence. When this error is subtracted, 84% of human-mouse orthologous gene pairs show significantly lower expression divergence than that of random gene pairs. In contrast to a previous finding, but consistent with the common sense, expression profiles of orthologous tissues between species are more similar to each other than to those of nonorthologous tissues. Furthermore, the evolutionary rate of expression divergence and that of coding sequence divergence are found to be weakly, but significantly positively correlated, when RA and the Euclidean distance are used to measure expression-profile divergence. These results highlight the importance of proper consideration of various estimation errors in comparing the microarray data between species.
Key Words: gene expression conservation humans mouse orthologous genes microarray
| Introduction |
|---|
|
|
|---|
Patterns and mechanisms of DNA and protein sequence evolution have been extensively studied in the past three decades (Li 1997
Based on oligonucleotide microarray data sets obtained from humans and mice (Su et al. 2002
), Yanai, Graur, and Ophir (2004)
found that the expression profiles of orthologous genes differ substantively between the two species, suggesting little selective constraint in the evolution of gene expression. Additionally, based on the expression similarity among 32 human and mouse tissues, they found orthologous tissues between species (e.g., human liver and mouse liver) to be less similar than nonorthologous tissues within species (e.g., human liver and human testis). Because tissue functions are determined by the genes expressed in the tissue, these results imply that the human liver is functionally more similar to the human testis than to the mouse liver, which is contrary to the common sense. Based on both oligonucleotide and cDNA microarrays, Khaitovich et al. (2004)
found that expression-level divergence between primate species increases linearly with divergence time and that functional genes and expressed pseudogenes have similar rates of expression evolution (Khaitovich et al. 2004
). Because pseudogenes evolve without any selective constraint, these results suggest that gene expression evolution is largely neutral, without the influence of either positive or purifying selection (Khaitovich et al. 2004
).
These findings are surprising for several reasons. First, it is well established that coding sequences and functions of most orthologous genes are conserved across species (Li 1997
). Because a gene must be expressed properly to function in the cell, it is puzzling why gene function should be conserved when the expression changes quickly. Second, the expression pattern and function of human genes are often inferred from their mouse orthologs (e.g., Hinds et al. 1993
), based on the assumption that these properties are conserved between the two species. The success of many mouse models of human genes and diseases suggests the validity of this assumption. Studies using traditional nonarray-based methods such as the northern analysis showed that the expression profiles of human-mouse orthologs are overall similar, although a quantitative genome-wide measure of mean similarity is difficult to obtain, due to large variations in experimental designs among these individual-gene studies. Third, a recent microarray-based study of the nematode Caenorhabditis elegans showed that transcriptome evolution is significantly faster in laboratory mutation-accumulation strains than in naturally isolated strains (Denver et al. 2005
). Because the sizes of the laboratory populations are much smaller than those of the wild populations, the observation of Denver et al (2005)
is best explained by purifying selection acting on expression divergence in nature. The rate of expression divergence would have been similar between laboratory and wild populations if gene expression were not under any selection (Kimura 1983
).
With these considerations, we reexamined the expression divergence between orthologous genes, based on the oligonucleotide microarray data of human and mouse genes generated by Su et al. (2004)
. Our choice of this data set is not only because it (or its earlier version) has been used in a number of evolutionary studies (Li 1997
; Makova and Li 2003
; Huminiecki and Wolfe 2004
; Yanai, Graur, and Ophir 2004
; Jordan, Marino-Ramirez, and Koonin 2005
; Yang, Su, and Li 2005
) but also because this data set is one of the largest for humans and mice, the expression divergence between which is of special importance to the biomedical community. Our analysis showed that there is a large error in measuring gene expression using microarrays. When this error is subtracted, the majority of orthologous genes show significantly lower expression-profile divergence between humans and mice than expected under neutrality.
| Materials and Methods |
|---|
|
|
|---|
Mapping Expression Data to Ensembl Genes
We used the Gene Atlas V2 microarray data set of humans and mice (http://symatlas.gnf.org/). The data set was generated by hybridization of RNA from 79 human and 61 mouse tissues onto the designed Affymetrix microarray chips (humans: U133A/GNF1H; mice: GNF1M) (Su et al. 2004
To compare our results with those of Yanai, Graur, and Ophir (2004)
, we also analyzed the expression data set used in their study, which is from Su et al. (2002)
. The results obtained from the two data sets were consistent with each other when the same method was used. We also used the expressional values computed by the robust multiarray averaging (RMA) (Bolstad et al. 2003
; Irizarry et al. 2003
). The results obtained from MAS 5.0 and RMA algorithms were similar. Thus, only the results derived from the data set Gene Atlas V2 and calculated by MAS 5.0 are presented here.
Human-Mouse Orthologs
Homology information of human and mouse genes was obtained from Ensembl EnsMart (http://www.ensembl.org/Multi/martview) (Kasprzyk et al. 2004
). There are several homologous relationships between human and mouse genes annotated by Ensembl. We only considered those pairs of genes annotated as unique best reciprocal hit (meaning that they were unique reciprocal best hits in all-against-all BlastZ searches) as orthologous. A total of 10,607 pairs of human-mouse orthologs have expression information from the microarray data we use. Among these genes, 64.5% of human genes and 86.9% of mouse genes were represented by a single probe set, while the others have multiple probe sets on the chips. Affymetrix probes with name suffixes _x_at and _s_at are believed to be prone to cross-hybridization, compared to other probes (Affymetrix Technical Support, Data Analysis Fundamentals, Appendix B; http://www.affymetrix.com/) and have been considered "suboptimal" (Huminiecki and Wolfe 2004
; Yang, Su, and Li 2005
). But our analysis showed that the quality of these probes is not worse than other probes (see below). We therefore considered all probe sets equally.
The number of synonymous substitutions per synonymous site (dS) and the number of nonsynonymous substitutions per nonsynonymous site (dN) between human and mouse orthologs were retrieved form Ensembl EnsMart. In this database, dS and dN were estimated by codeml of the PAML package (Yang 1997
) using the likelihood method.
Analysis of Gene Expression Data
For the purpose of studying the divergence of expression profiles between human-mouse orthologs, we analyzed 26 common tissues from the two species (see fig. 2 for the tissues examined; note that mouse lower spinal cord was used as the homologous tissue of human spinal cord).
|
For calculating the expressional divergence between a pair of orthologous genes, the expression signal in humans and that in mice must be comparable. Log2-transformed signal intensity (S) is commonly used to quantitatively measure the level of gene expression. But it has intrinsic problems for comparing expression data derived from different Affymetrix microarrays. First, probes are separately designed for the human and mouse orthologous genes and do not target the same sequences. Therefore, the human probes and mouse probes have different affinities to their target RNAs (Binder et al. 2004a
![]() | (1) |
and
are the expression signal intensities of gene i in human tissue j and mouse tissue j, respectively. When the RMA algorithm was used to measure expression, S was calculated by antilog of the default output value. It should be noted that by using RA we lose the information of the absolute expression level in all tissues, but as aforementioned, the absolute expression levels of orthologs are practically incomparable.
The divergence between expression profiles of human and mouse orthologous genes is measured by "1 Pearson's correlation coefficient (r)" and "Euclidean distance (d)" using the RA values of the 26 pairs of tissues. Pearson's r between human and mouse gene i is computed by
![]() | (2) |
![]() | (3) |
Our analysis showed that different probe sets on the same chip often give very different S values for a given gene in a given tissue. This difference is most likely due to the variation in affinity among probe sets for a given gene. Let dH denote the Euclidean distance between the expression profiles estimated by two randomly picked probe sets for the same human gene, dM denote the Euclidean distance for the corresponding mouse gene, and d be the Euclidean distance defined in equation (3). We estimate the net distance (D) between human and mouse orthologous genes by
![]() | (4) |
The tissue expression dendrograms were calculated from the matrix of distances among tissues, which were estimated from the RA values of 10,607 gene pairs in humans and mice. Pearson's correlation coefficient between human tissue j1 and mouse tissue j2 is computed by
![]() | (5) |
![]() | (6) |
| Results and Discussion |
|---|
|
|
|---|
The Mean Expression Divergence Between Human-Mouse Orthologs Is Lower Than That Between Random Gene Pairs
To study whether the expression profile is conserved between human and mouse orthologous genes, it is necessary to know the expected value of expression divergence under complete neutrality. Ideally, this value should be estimated using expressed pseudogenes. However, it is unlikely that a functionless pseudogene generated before the separation of primates and rodents would still be retained and expressed in humans and mice. Jordan, Marino-Ramirez, and Koonin (2005)
|
Errors in Measuring Gene Expression
Although the overall rate of transcriptome evolution between humans and mice appears lower than the neutral expectation, it is unclear what proportion of genes are under purifying selection in their expression evolution and how strong the selection is. To address this question, it is necessary to evaluate the error in measuring gene expression by microarrays. It is quite common that more than one probe set is used to represent a gene on an oligonucleotide microarray. Theoretically, if the same transcripts are targeted and if there is no cross-hybridization, all probe sets designed for the same gene should provide the same, or at least similar, expression signals. However, this is often not the case. For example, there are two probe sets on the human chip and two on the mouse chip for the gene RUTBC1. The S as well as RA values obtained from the two probe sets on the same chip are quite different even for the same tissues (fig. 2). In this example, the r between the RA values generated from the two mouse probe sets (0.23) is even lower than the average r between the RA values generated from a human probe set and a mouse probe set (0.40). In other words, the apparent low r between species is largely attributable to the estimation error of gene expression within species. For many of the 3,762 genes with multiple probe sets on the human chip, the r values between the expression profiles generated by two randomly picked probe sets of the same gene are much lower than 1 (fig. 3a). In fact, r has a mean of 0.375 and a median of 0.368. There are 1,385 genes with multiple probe sets on the mouse chip. The mean r is 0.277 and the median r is 0.235 between the expression profiles generated by two randomly picked probe sets of the same mouse gene (fig. 3a). These low r values show that the expression level is not precisely measured by the microarrays. Rather, there are large errors associated with the estimates. Similar results were obtained when d was used to measure the difference between expression profiles detected by different probe sets targeting the same gene (fig. 3b).
|
It is unclear what factors caused such a great variation between expressional levels detected by different probe sets. Affymetrix probes with name suffixes _x_at and _s_at are thought to be prone to cross-hybridization, compared to other probes, and have been considered suboptimal (Huminiecki and Wolfe 2004
Expression Profiles of 84% of Human-Mouse Orthologs Are Significantly Lower Than Expected Under Neutrality
Despite the high estimation error shown in figure 3, the expression differences between human and mouse orthologs are higher than those detected by different probe sets within species. This indicates that for many genes the expression profile is not completely conserved between the two species. For estimating the proportion of human-mouse orthologs that diverge significantly slower than expected under neutrality, Euclidean distance d is preferred over 1 r because the correlation coefficient r ignores any linear changes which may exist between expression profiles. We computed the net expression distance D between humans and mice by subtracting the expression distance between probe sets within species from the expression distance between species (see eq. 4 in Materials and Methods). This procedure is analogous to the estimation of the net genetic difference between populations by subtracting the variation within populations (Nei 1987
). D can be interpreted as the detectable expression divergence given the estimation error. Randomly paired human-mouse genes should have no expression similarity; thus, the Euclidean distances do not require correction. We found that d has a relatively wide distribution for random gene pairs (fig. 4). Five percent of d is smaller than d5% = 0.0897. If the D value of a human-mouse orthologous gene pair is smaller than d5%, we may claim that the expression divergence of this gene pair has been under selective constraint because the probability that the evolution has been neutral is lower than 5% (fig. 4). Using this criterion, we found that the detectable expression divergence of 83.9% of genes is significantly lower than expected under complete neutrality. A simple interpretation is that the expressions of these genes are under purifying selection. However, our result may also reflect the large estimation error of the current microarray technology and consequently low detectable expression divergence between species. More accurately, our findings suggest that at least for 84% of genes the current data do not provide evidence for neutrality. Note that this estimate was derived from 4,564 orthologous gene pairs in which multiple probe sets are available for at least one species so that the estimation error could be evaluated. Under the assumption that the probe design for a gene is independent of the rate of gene expression evolution, our result is applicable to the entire genome. We also computed the values of dH and dM by averaging the Euclidean distances of all possible combinations of probe sets of the same gene, instead of using two randomly picked probe sets. The results are very similar (Supplementary Fig. S2, Supplementary Material online).
|
Orthologous Tissues Between Species Are More Similar Than Nonorthologous Tissues in Terms of Expression Profile
It is expected that orthologous tissues between species (e.g., human liver and mouse liver) should have similar expression profiles because they carry out similar physiological functions. However, Yanai, Graur, and Ophir (2004)
|
Correlation Between the Rate of Expression-Profile Divergence and That of Coding Sequence Divergence
It has been controversial as whether there is a positive correlation between the rate of expression evolution and the rate of coding sequence evolution across many genes in a genome. In earlier studies, this question was addressed by comparing duplicate genes within species (Wagner 2000
|
We believe that the main reason why the positive correlation between expression-profile divergence and coding sequence divergence was not previously observed in the comparison of human and mouse orthologous genes (Yanai, Graur, and Ophir 2004
As mentioned, 1 r and d are commonly used to measure gene expression divergence. Compared to d, 1 r is more often adopted by evolutionists, such as in the studies of duplicated genes in yeast (Wagner 2000
; Gu et al. 2002
), nematode (Castillo-Davis, Hartl, and Achaz 2004
; Conant and Wagner 2004
), humans (Makova and Li 2003
; Huminiecki and Wolfe 2004
), mice (Huminiecki and Wolfe 2004
), and mustard (Haberer et al. 2004
). However, d reportedly performs better (Slonim et al. 2000
) and has been used to compare orthologous genes (Yanai, Graur, and Ophir 2004
; Jordan, Marino-Ramirez, and Koonin 2005
) and to cluster coexpressed genes (Wen et al. 1998
; de Bivort, Huang, and Bar-Yam 2004
). To our surprise, we did not observe positive correlations between 1 r and either dN, dS, or dN/dS. But this result is consistent with that of Jordan et al. (2004)
, although they used Spearman's rank correlation instead of Pearson's correlation (r) to measure the human-mouse expression-profile similarity. One may think that the expression divergences measured by d and 1 r should be positively correlated. However, the mathematical properties of d and 1 r are different. For example, any linear transformations of S do not affect r, while they may influence d. In addition, 1 r is bounded between 0 and 2, whereas d can increase infinitely. In our data, 1 r and d have a weak, but significant, negative correlation (Supplementary Fig. S3, Supplementary Material online). Although both measures are commonly used, which better describes the expression divergence between orthologous genes remains unanswered. It is possible that the advantages of these two measures vary depending on the conditions used. It is also important to note that neither 1 r nor d measures the number of genetic changes (i.e., number of nucleotide substitutions) underlying the observed expression-profile divergence. Because the molecular mechanism of gene expression regulation is complex and not well understood, no distance measures currently exist to quantify the genetic changes underlying expression-profile divergence (Leung and Cavalieri 2003
).
Final Remarks
There are two ways to compare the transcriptomes of two species using DNA oligonucleotide microarrays. The first approach is to use a single array to detect gene expression in multiple species, while the second approach is to use species-specific arrays. Using a single array is only applicable to closely related species and is subject to biases caused by interspecific sequence differences (Hsieh et al. 2003
; Preuss et al. 2004
; Gilad et al. 2005
). Using multiple species-specific arrays is applicable to any species pairs, but we found that the expression divergence between species is substantially overestimated. This overestimation results from the large variation in sensitivity among different probe sets. Thus, precise measurement of expression divergence between species is still a challenging task. cDNA microarrays have also been used to assess the expression divergence between species (Ranz et al. 2003
; Renn, Aubin-Horth, and Hofmann 2004
). Our method of analysis (e.g., use of RA instead of S) applies to cDNA array data as well. We think that advances in both microarray technology and statistical methodology are needed to better characterize the evolution of gene expression, which is central to our understanding of the mechanism of biological evolution (Rodriguez-Trelles, Tarrio, and Ayala 2005
).
| Supplementary Material |
|---|
|
|
|---|
Supplementary Figures S1, S2, and S3 are available at Molecular Biology and Evolution online (http://www.mbe.oxfordjournals.org/).
Figure S1.Pairwise comparison of expression profiles of two probe sets of the same human genes. The 3,762 genes in group B contain both optimal and suboptimal probe sets, whereas the 1,097 genes in group B contain only optimal probe sets. The distributions show that the expression profiles detected by two probe sets of the same gene are more similar for group A genes than for group B genes (P < 1027, Mann-Whitney U test), implying that suboptimal probe sets produce more consistent expressional profiles than optimal probe sets.
Figure S2.Net distances (D) of expressional profiles between human and mouse orthologs and Euclidean distances (d) of random human-mouse gene pairs. The distribution of the random pairs represents the neutral expectation of expressional divergences. The black area left to the vertical dashed line (d5% = 0.0899) shows the 5% smallest d values. A total of 86.6% of 4,564 human-mouse orthologous genes have D smaller than d5%, suggesting that the detectable expression-profile divergence of 86.6% of genes is lower than the neutral expectation at the 5% significance level. In this figure, we computed the values of dH and dM by averaging the Euclidean distances of all possible combinations of probe sets of the same gene, instead of using two randomly picked probe sets as in figure 4.
Figure S3.Correlation between two measures of expression divergence between human-mouse orthologous genes. A total of 10,607 pairs of orthologs are used.
| Acknowledgements |
|---|
|
|
|---|
We thank Itai Yanai for providing details of their published microarray analysis, James McDonald for technical assistance, Steve Qin and Xionglei He for discussion, and Wendy Grus and Soochin Cho for valuable comments on an earlier version of the manuscript. This study was supported by research grants from National Institutes of Health and University of Michigan to J.Z.
| Footnotes |
|---|
Douglas Crawford, Associate Editor
| References |
|---|
|
|
|---|
Affymetrix. 2001. Technical note: new statistical algorithms for monitoring gene expression on GeneChip probe arrays. Affymetrix, Santa Clara, Calif.
Binder, H., T. Kirsten, I. L. Hofacker, P. F. Stadler, and M. Loeffler. 2004a. Interactions in oligonucleotide hybrid duplexes on microarrays. J. Phys. Chem. B 108:1801518025.[CrossRef]
Binder, H., T. Kirsten, M. Loeffler, and P. F. Stadler. 2004b. Sensitivity of microarray oligonucleotide probes: variability and effect of base composition. J. Phys. Chem. B 108:1800318014.[CrossRef]
Bolstad, B. M., R. A. Irizarry, M. Astrand, and T. P. Speed. 2003. A comparison of normalization methods for high density oligonucleotide array data based on variance and bias. Bioinformatics 19:185193.
Brenner, S., M. Johnson, J. Bridgham et al. (24 co-authors). 2000. Gene expression analysis by massively parallel signature sequencing (MPSS) on microbead arrays. Nat. Biotechnol. 18:630634.[CrossRef][Web of Science][Medline]
Castillo-Davis, C. I., D. L. Hartl, and G. Achaz. 2004. Cis-regulatory and protein evolution in orthologous and duplicate genes. Genome Res. 14:15301536.
Cavalieri, D., J. P. Townsend, and D. L. Hartl. 2000. Manifold anomalies in gene expression in a vineyard isolate of Saccharomyces cerevisiae revealed by DNA microarray analysis. Proc. Natl. Acad. Sci. USA 97:1236912374.
Conant, G. C., and A. Wagner. 2004. Duplicate genes and robustness to transient gene knock-downs in Caenorhabditis elegans. Proc. Biol. Sci. 271:8996.
de Bivort, B., S. Huang, and Y. Bar-Yam. 2004. Dynamics of cellular level function and regulation derived from murine expression array data. Proc. Natl. Acad. Sci. USA 101:1768717692.
Denver, D. R., K. Morris, J. T. Streelman, S. K. Kim, M. Lynch, and W. K. Thomas. 2005. The transcriptional consequences of mutation and natural selection in Caenorhabditis elegans. Nat. Genet. 37:544548.[CrossRef][Web of Science][Medline]
Enard, W., P. Khaitovich, J. Klose et al. (13 co-authors). 2002. Intra- and interspecific variation in primate gene expression patterns. Science 296:340343.
Gilad, Y., S. A. Rifkin, P. Bertone, M. Gerstein, and K. P. White. 2005. Multi-species microarrays reveal the effect of sequence divergence on gene expression profiles. Genome Res. 15:674680.
Gu, Z., D. Nicolae, H. H. Lu, and W. H. Li. 2002. Rapid divergence in expression between duplicate genes inferred from microarray data. Trends Genet. 18:609613.[CrossRef][Web of Science][Medline]
Haberer, G., T. Hindemitt, B. C. Meyers, and K. F. Mayer. 2004. Transcriptional similarities, dissimilarities, and conservation of cis-elements in duplicated genes of Arabidopsis. Plant Physiol. 136:30093022.
Hinds, H. L., C. T. Ashley, J. S. Sutcliffe, D. L. Nelson, S. T. Warren, D. E. Housman, and M. Schalling. 1993. Tissue specific expression of FMR-1 provides evidence for a functional role in fragile X syndrome. Nat. Genet. 3:3643.[CrossRef][Web of Science][Medline]
Hsieh, W. P., T. M. Chu, R. D. Wolfinger, and G. Gibson. 2003. Mixed-model reanalysis of primate data suggests tissue and species biases in oligonucleotide-based gene expression profiles. Genetics 165:747757.
Hubbell, E., W. M. Liu, and R. Mei. 2002. Robust estimators for expression analysis. Bioinformatics 18:15851592.
Huminiecki, L., and K. H. Wolfe. 2004. Divergence of spatial gene expression profiles following species-specific gene duplications in human and mouse. Genome Res. 14:18701879.
Irizarry, R. A., B. Hobbs, F. Collin, Y. D. Beazer-Barclay, K. J. Antonellis, U. Scherf, and T. P. Speed. 2003. Exploration, normalization, and summaries of high density oligonucleotide array probe level data. Biostatistics 4:249264.[Abstract]
Jordan, I. K., L. Marino-Ramirez, and E. V. Koonin. 2005. Evolutionary significance of gene expression divergence. Gene 345:119126.[CrossRef][Web of Science][Medline]
Jordan, I. K., L. Marino-Ramirez, Y. I. Wolf, and E. V. Koonin. 2004. Conservation and coevolution in the scale-free human gene coexpression network. Mol. Biol. Evol. 21:20582070.
Kasprzyk, A., D. Keefe, D. Smedley, D. London, W. Spooner, C. Melsopp, M. Hammond, P. Rocca-Serra, T. Cox, and E. Birney. 2004. EnsMart: a generic system for fast and flexible access to biological data. Genome Res. 14:160169.
Khaitovich, P., I. Hellmann, W. Enard, K. Nowick, M. Leinweber, H. Franz, G. Weiss, M. Lachmann, and S. Paabo. 2005. Parallel patterns of evolution in the genomes and transcriptomes of humans and chimpanzees. Science 309:18501854.
Khaitovich, P., G. Weiss, M. Lachmann, I. Hellmann, W. Enard, B. Muetzel, U. Wirkner, W. Ansorge, and S. Paabo. 2004. A neutral model of transcriptome evolution. PLoS Biol. 2:682689.
Kimura, M. 1983. The neutral theory of molecular evolution. Cambridge University Press, Cambridge.
Kumar, S., K. Tamura, and M. Nei. 2004. MEGA3: integrated software for molecular evolutionary genetics analysis and sequence alignment. Brief. Bioinform. 5:150163.
Leung, Y. F., and D. Cavalieri. 2003. Fundamentals of cDNA microarray data analysis. Trends Genet. 19:649659.[CrossRef][Web of Science][Medline]
Li, W.-H. 1997. Molecular evolution. Sinauer Associates, Sunderland, Mass.
Makova, K. D., and W. H. Li. 2003. Divergence in the spatial pattern of gene expression between human duplicate genes. Genome Res. 13:16381645.
Murtagh, F. D. 1985. "Multidimensional clustering algorithms" in CompStat lectures 4. Physica-Verlag,Vienna, Austria.
Nei, M. 1987. Molecular evolutionary genetics. Columbia University Press, New York.
Nei, M., and S. Kumar. 2000. Molecular evolution and phylogenetics. Oxford University Press, New York.
Oleksiak, M. F., G. A. Churchill, and D. L. Crawford. 2002. Variation in gene expression within and among natural populations. Nat. Genet. 32:261266.[CrossRef][Web of Science][Medline]
Pal, C., B. Papp, and L. D. Hurst. 2001. Highly expressed genes in yeast evolve slowly. Genetics 158:927931.
Preuss, T. M., M. Caceres, M. C. Oldham, and D. H. Geschwind. 2004. Human brain evolution: insights from microarrays. Nat. Rev. Genet. 5:850860.[CrossRef][Web of Science][Medline]
Ranz, J. M., C. I. Castillo-Davis, C. D. Meiklejohn, and D. L. Hartl. 2003. Sex-dependent gene expression and evolution of the Drosophila transcriptome. Science 300:17421745.
Renn, S. C., N. Aubin-Horth, and H. A. Hofmann. 2004. Biologically meaningful expression profiling across species using heterologous hybridization to a cDNA microarray. BMC Genomics 5:42.[CrossRef][Medline]
Rifkin, S. A., J. Kim, and K. P. White. 2003. Evolution of gene expression in the Drosophila melanogaster subgroup. Nat. Genet. 33:138144.[CrossRef][Web of Science][Medline]
Rocha, E. P., and A. Danchin. 2004. An analysis of determinants of amino acids substitution rates in bacterial proteins. Mol. Biol. Evol. 21:108116.
Rodriguez-Trelles, F., R. Tarrio, and F. J. Ayala. 2005. Is ectopic expression caused by deregulatory mutations or due to gene-regulation leaks with evolutionary potential? Bioessays 27:592601.[CrossRef][Web of Science][Medline]
Ruan, Y., P. Le Ber, H. H. Ng, and E. T. Liu. 2004. Interrogating the transcriptome. Trends Biotechnol. 22:2330.[CrossRef][Web of Science][Medline]
Slonim, D., P. Tamayo, J. P. Mesirov, T. Golub, and E. Lander. 2000. Class prediction and discovery using gene expression data. Pp. 263272. Proceedings of the 4th annual international conference on computational molecular biology (RECOMB 2000). Universal Academy Press, Tokyo, Japan.
Su, A. I., M. P. Cooke, K. A. Ching et al. (14 co-authors). 2002. Large-scale analysis of the human and mouse transcriptomes. Proc. Natl. Acad. Sci. USA 99:44654470.
Su, A. I., T. Wiltshire, S. Batalov et al. (13 co-authors). 2004. A gene atlas of the mouse and human protein-encoding transcriptomes. Proc. Natl. Acad. Sci. USA 101:60626067.
Townsend, J. P., D. Cavalieri, and D. L. Hartl. 2003. Population genetic variation in genome-wide gene expression. Mol. Biol. Evol. 20:955963.
Velculescu, V. E., L. Zhang, B. Vogelstein, and K. W. Kinzler. 1995. Serial analysis of gene expression. Science 270:484487.
Wagner, A. 2000. Decoupled evolution of coding region and mRNA expression patterns after gene duplication: implications for the neutralist-selectionist debate. Proc. Natl. Acad. Sci. USA 97:65796584.
Wen, X., S. Fuhrman, G. S. Michaels, D. B. Carr, S. Smith, J. L. Barker, and R. Somogyi. 1998. Large-scale temporal gene expression mapping of central nervous system development. Proc. Natl. Acad. Sci. USA 95:334339.
Wyckoff, G. J., C. M. Malcom, E. J. Vallender, and B. T. Lahn. 2005. A highly unexpected strong correlation between fixation probability of nonsynonymous mutations and mutation rate. Trends Genet. 21:381385.[CrossRef][Web of Science][Medline]
Yanai, I., D. Graur, and R. Ophir. 2004. Incongruent expression profiles between human and mouse orthologous genes suggest widespread neutral evolution of transcription control. OMICS 8:1524.[CrossRef][Web of Science][Medline]
Yang, J., A. I. Su, and W.-H. Li. 2005. Gene expression evolves faster in narrowly than in broadly expressed mammalian genes. Mol. Biol. Evol. 22:21132118.
Yang, Z. 1997. PAML: a program package for phylogenetic analysis by maximum likelihood. Comput. Appl. Biosci. 13:555556.
Zhang, J., and X. He. 2005. Significant impact of protein dispensability on the instantaneous rate of protein evolution. Mol. Biol. Evol. 22:11471155.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
Y. Lu, P. Huggins, and Z. Bar-Joseph Cross species analysis of microarray expression data Bioinformatics, June 15, 2009; 25(12): 1476 - 1483. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Lin, S. Liu, H. Brockway, J. Seok, P. Jiang, W. H. Wong, and Y. Xing Using high-density exon arrays to profile gene expression in closely related species Nucleic Acids Res., May 27, 2009; (2009) gkp420v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. V. Koonin Darwinian evolution in the light of genomics Nucleic Acids Res., March 1, 2009; 37(4): 1011 - 1034. [Abstract] [Full Text] [PDF] |
||||
![]() |
B.-Y. Liao and J. Zhang Coexpression of Linked Genes in Mammalian Genomes Is Generally Disadvantageous Mol. Biol. Evol., August 1, 2008; 25(8): 1555 - 1565. [Abstract] [Full Text] [PDF] |
||||
![]() |
B.-Y. Liao and J. Zhang Null mutations in human and mouse orthologs frequently result in different phenotypes PNAS, May 13, 2008; 105(19): 6987 - 6992. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. R Kensche, V. van Noort, B. E Dutilh, and M. A Huynen Practical and theoretical advances in predicting the function of a protein by its phylogenetic distribution J R Soc Interface, February 6, 2008; 5(19): 151 - 170. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Lu, X. He, and S. Zhong Cross-species microarray analysis with the OSCAR system suggests an INSR->Pax6->NQO1 neuro-protective pathway in aging and Alzheimer's disease Nucleic Acids Res., July 13, 2007; 35(suppl_2): W105 - W114. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Osato, Y. Suzuki, K. Ikeo, and T. Gojobori Transcriptional Interferences in cis Natural Antisense Transcripts of Humans and Mice Genetics, June 1, 2007; 176(2): 1299 - 1306. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Xing, Z. Ouyang, K. Kapur, M. P. Scott, and W. H. Wong Assessing the Conservation of Mammalian Gene Expression Using High-Density Exon Arrays Mol. Biol. Evol., June 1, 2007; 24(6): 1283 - 1285. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Gu and Z. Su Tissue-driven hypothesis of genomic evolution and sequence-expression correlations PNAS, February 20, 2007; 104(8): 2779 - 2784. [Abstract] [Full Text] [PDF] |
||||
![]() |
B.-Y. Liao, N. M. Scott, and J. Zhang Impacts of Gene Essentiality, Expression Pattern, and Gene Compactness on the Evolutionary Rate of Mammalian Proteins Mol. Biol. Evol., November 1, 2006; 23(11): 2072 - 2080. [Abstract] [Full Text] [PDF] |
||||
![]() |
B.-Y. Liao and J. Zhang Low Rates of Expression Profile Divergence in Highly Expressed Genes and Tissue-Specific Genes During Mammalian Evolution Mol. Biol. Evol., June 1, 2006; 23(6): 1119 - 1128. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||

















