Molecular Biology and Evolution 18:691-699 (2001)
© 2001 Society for Molecular Biology and Evolution
ARTICLE |
A General Empirical Model of Protein Evolution Derived from Multiple Protein Families Using a Maximum-Likelihood Approach
Department of Zoology, University of Cambridge, Cambridge, England
| Abstract |
|---|
|
|
|---|
Phylogenetic inference from amino acid sequence data uses mainly empirical models of amino acid replacement and is therefore dependent on those models. Two of the more widely used models, the Dayhoff and JTT models, are estimated using similar methods that can utilize large numbers of sequences from many unrelated protein families but are somewhat unsatisfactory because they rely on assumptions that may lead to systematic error and discard a large amount of the information within the sequences. The alternative method of maximum-likelihood estimation may utilize the information in the sequence data more efficiently and suffers from no systematic error, but it has previously been applicable to relatively few sequences related by a single phylogenetic tree. Here, we combine the best attributes of these two methods using an approximate maximum-likelihood method. We implemented this approach to estimate a new model of amino acid replacement from a database of globular protein sequences comprising 3,905 amino acid sequences split into 182 protein families. While the new model has an overall structure similar to those of other commonly used models, there are significant differences. The new model outperforms the Dayhoff and JTT models with respect to maximum-likelihood values for a large majority of the protein families in our database. This suggests that it provides a better overall fit to the evolutionary process in globular proteins and may lead to more accurate phylogenetic tree estimates. Potentially, this matrix, and the methods used to generate it, may also be useful in other areas of research, such as biological sequence database searching, sequence alignment, and protein structure prediction, for which an accurate description of amino acid replacement is required.
Key Words: amino acid replacement general reversible model maximum likelihood protein evolution
Key Words: amino acid replacement general reversible model maximum likelihood protein evolution
| Introduction |
|---|
|
|
|---|
The majority of likelihood methods used for reconstructing phylogenies from amino acid sequences rely on empirical models of protein evolution. These models need good replacement matrices, which represent the relative rates of amino acid replacement at homologous sites in a protein, to accurately estimate the true evolutionary distances and relationships among species. Unfortunately, none of the current methods used to estimate these replacement matrices are entirely satisfactory.
Dayhoff and colleagues (Dayhoff, Eck, and Park 1972
; Dayhoff, Schwartz, and Orcutt 1978
) used a parsimony-based counting method to generate accepted point mutation (PAM) matrices from the limited amount of protein sequence data available at the time. To achieve this, phylogenetic trees were estimated for multiple protein families, along with the ancestral sequences within those trees, using maximum parsimony (MP). This information was then used to estimate the relative rates of all amino acid replacements by simply counting both the inferred numbers of different amino acid replacements that occurred on all of the lineages of the trees and the numbers of occasions on which no change in amino acids was inferred.
Jones, Taylor, and Thornton (1992)
applied a faster, automated procedure based on Dayhoff and colleagues' (Dayhoff, Eck, and Park 1972
; Dayhoff, Schwartz, and Orcutt 1978
) approach and used it to produce a replacement matrix from a much larger database. After estimating the phylogenetic tree for each protein family in the database, their method selected a pair of sequences from a phylogeny that were nearest-neighbors and were >85% identical and counted the differences between them. The pair of sequences was then discarded to avoid recounting changes on any given branch of a phylogeny. This process was repeated for all such pairs of sequences in all protein families from their database. The 85% identity rule was used to reduce the number of multiple changes recorded as single replacements. Both of these approaches, which we refer to as counting methods, produce matrices of counts that may be used to estimate Markov process models of amino acid replacement (Swofford et al. 1996
; Liò and Goldman 1998
). The two models described above, the most widely used for the phylogenetic analysis of amino acid sequences of globular proteins, are both estimated using these counting methods and are known as the Dayhoff model (Dayhoff, Schwartz, and Orcutt 1978
) and the JTT model (Jones, Taylor, and Thornton 1992
).
The counting methods effectively employ MP to estimate amino acid replacement matrices and are therefore susceptible to its inherent problems. In particular, MP intrinsically assumes that for any given site in an alignment, only one change takes place along any single branch in a tree. This can lead to a serious underestimation of the true number of replacements that have occurred in branches where multiple changes have occurred and may consequently lead to systematic error in any model estimated using counts of replacements. In addition, MP inferences of ancestral sequences may introduce still further inaccuracies (Yang, Kumar, and Nei 1995
). The Dayhoff model may be affected by both of these problems. The 85% rule of the JTT method attempts to reduce the impact of these problems by reducing the expected number of multiple hits that are neglected. Without completely solving the problem, this also renders the method very wasteful because it discards all of the information available in the sequences that are <85% identical. The JTT method avoids making inferences of ancestral sequences, but at the further cost of using an inefficient method for avoiding the repeated counting on branches of phylogenetic trees, discarding many sequences which have >85% identity only with previously used sequences.
Adachi and Hasegawa (1996)
, Yang, Nielsen, and Hasegawa (1998)
, and Adachi et al. (2000)
used maximum-likelihood (ML) methods to estimate models of amino acid replacement for vertebrate mitochondrial, mammalian mitochondrial, and chloroplast sequences, respectively. For an alignment of sequences related by a single phylogenetic tree, the amino acid replacement matrix that gave the highest likelihood was found simultaneously with the optimal phylogeny and branch lengths. This ML approach avoids the problems associated with the counting methods by using all of the information available in the sequences across all levels of homology and by having a model that explicitly allows multiple changes to occur on a single branch at any site in an alignment. Unfortunately ML, while providing a more reliable estimate of a model of replacement than the counting methods, has a much greater computational burden associated with it. The time each individual likelihood calculation takes and the number of calculations required to numerically maximize the likelihood increase significantly with each sequence added to an analysis. This has meant that relatively few sequences, each consisting of a number of concatenated genes available for all of the organisms studied, have been included in previous analyses: Adachi and Hasegawa (1996)
analyzed 20 sequences, each of 3,357 residues; Yang, Nielsen, and Hasegawa (1998)
used 23 sequences of similar lengths; and Adachi et al. (2000)
used just 10 sequences, each of 9,957 residues. This may restrict the accuracy of the resulting models or the variety of proteins for which the models are subsequently found to be useful (see also P. Liò and N. Goldman, unpublished data
).
Here, we combine the best attributes of the likelihood and counting methods to estimate a model of amino acid replacement from a large database of different globular protein families using an approximation to ML. This model should provide a better estimate of the evolutionary process than existing models estimated using counting methods and be applicable to phylogenetic studies of a much broader range of protein sequences than existing models estimated using the likelihood approach.
| Models of Amino Acid Replacement |
|---|
|
|
|---|
The Amino Acid Replacement Matrix
All the models discussed in this paper assume that all amino acid sites in an alignment evolve independently and according to the same Markov process. The Markov process is assumed to be both stationary and homogeneous, so that the amino acid frequencies and the model of evolution are assumed constant through time and across all sites in an alignment. Additionally, the Markov process is assumed to be reversible, implying that to an observer it would appear the same going backwards in time as it would going forward. The probability of amino acid i being replaced by amino acid j over time T is Pij(T), where i and j take the values 1, 2, ... , 20, representing the 20 different amino acids. These probabilities can be written as a 20 x 20 matrix, P(T), which is calculated as P(T) = exp(TQ), where Q is the rate matrix, with off-diagonal elements Qij being the instantaneous rates of change of amino acid i to amino acid j and with diagonal elements Qii being fixed so that the row sums of Q equal 0. The off-diagonal elements of the matrix Q can be described by the off-diagonal elements of the matrix product
|
| (1) |
i. The variables sij represent the exchangeabilities of amino acid pairs (i, j). Time reversibility is imposed by placing the restriction that sij
sji (as above), resulting in 190 such parameters. Empirically derived models of amino acid replacement describe the evolutionary process by fixing these exchangeabilities to values that have been estimated from a large amount of data. When performing likelihood calculations on a tree, the matrix Q is scaled to provide meaningful branch lengths (fixing -
i
iqii = 1 means that evolutionary distances T are measured in units of expected numbers of replacements per site), and this effectively removes one parameter, leaving 189 free parameters describing relative amino acid exchangeability.
The
i values represent the equilibrium or stationary frequencies of the 20 amino acids. These frequencies may all be set to 1/20 or may be set to the values estimated from the original data used to estimate the sij values. These applications are now relatively rare in phylogenetics (and are not used in this paper), and the
i are more typically estimated as being equal to the proportions of the amino acids as observed in a data set under phylogenetic analysis (Cao et al. 1994
). When the frequencies are estimated from the data in this way, model names are generally given the suffix "+F"; e.g., JTT+F would use sij as estimated by Jones, Taylor, and Thornton (1992)
and the
i observed in the data set under analysis. The 20 amino acid frequencies can be described by 19 free parameters because of the constraint
i
i = 1 and, in effect, weight sij according to sequences' amino acid compositions.
All of the standard models of evolution used in this paper have previously been well documented (e.g., Swofford et al. 1996
; Liò and Goldman 1998
). The simplest model of amino acid evolution is the equiprobable (EQU) model, which assumes that all the exchangeability parameters sij are equal and sets all of the stationary frequencies to 1/20. The EQU+F form of this model allows the stationary frequencies to equal the proportions of the different amino acids observed in the data. The Dayhoff (Dayhoff, Schwartz, and Orcutt 1978
) and JTT (Jones, Taylor, and Thornton 1992
) models, which in their Dayhoff+F and JTT+F forms will be compared with our new models, have values of sij which have been estimated from large databases using counting methods. The mitochondrial- and chloroplast-specific models of Adachi and Hasegawa (1996)
, Yang, Nielsen, and Hasegawa (1998)
, and Adachi et al. (2000)
, which are not appropriate for direct comparison with our new model because of their sequence-specificity, have each had their 189 free sij parameters estimated by direct likelihood maximization for relatively few protein sequences and using a single evolutionary tree.
Assumptions Needed to Estimate a Model Using Multiple Phylogenies
The simultaneous use of many different protein families implies that an estimated model may be applicable to a wide range of proteins (as are the Dayhoff and JTT models), and the use of the likelihood approach to perform this estimation suggests that it may more accurately reflect the evolutionary process by avoiding systematic error and utilizing more of the available data. In order to estimate an empirical model of amino acid replacement simultaneously from many families of sequences, we have developed a new approximation to the likelihood approach, exploiting two observations about the ML estimation of parameters on phylogenetic trees. First, it has been shown that parameters describing the evolutionary process remain relatively constant across near-optimal tree topologies (e.g., Yang, Goldman, and Friday 1994, 1995
; Sullivan, Holsinger, and Simon 1996
; Yang, Nielsen, and Hasegawa 1998
; Adachi et al. 2000
). We exploit this by assuming it to be the case for the parameters used to describe amino acid replacement, in particular, assuming that the relative values of the amino acid exchangeability parameters sij stay approximately constant over near-optimal branch lengths and tree topologies. The implication of this assumption is that so long as branch lengths are close enough to optimal when estimating the new model, any changes in the branch lengths observed when they are reestimated under the new model would not influence the model estimated to any great extent.
The second observation relates to changes in individual branch lengths that occur when performing ML estimation under two different models of evolution for a single-tree topology. When the two models are quite different, the ML branch lengths they give can be quite different (demonstrated by comparing the statistics shown in table 3
for either the EQU or the EQU+F model of evolution with those for either the Dayhoff+F or the JTT+F model). When two models are alike in their abilities to describe the evolutionary process, however, there is often much less difference in the branch lengths (e.g., Yang, Nielsen, and Hasegawa 1998
; also illustrated by comparing the statistics shown in table 3
for the EQU and EQU+F models or those for the Dayhoff+F and JTT+F models). We exploit the relatively small changes in branch lengths under "similarly good" models of evolution by assuming that the JTT+F model is capable of providing near-optimal branch lengths for the best general model of evolution (yet to be estimated).
|
Calculating a Likelihood Using Multiple Phylogenies
In order to calculate a likelihood for a complete database of aligned protein families, each protein family was taken in turn, and all pairwise phylogenetic distances between the sequences were estimated using the Dayhoff+F model. These distances were used to estimate phylogenetic tree topologies using neighbor-joining (Saitou and Nei 1987
Rather than completely fixing these estimates of the branch lengths during the estimation of the amino acid replacement model, only the ratios of branch lengths were fixed, and a scaling factor
was introduced which allowed all branch lengths to increase or decrease linearly. This parameter makes some allowance for any unforeseen changes in branch lengths between the JTT+F model and the new model being estimated, which could occur if the assumptions discussed above regarding branch lengths were invalid. As we assume that the families' topologies and relative branch lengths are now fixed at near-optimal values, the overall log-likelihood for the database can be calculated as
|
| (2) |
i, and the scaling factor
. To find the ML model of evolution, we need only maximize log L over M in equation (3) while fixing the parameters associated with T, as our assumptions mean that the resulting model will be close to that obtained by maximizing equation (2)
over both M and T. This dramatically reduces the computational time required for large amounts of data because most of the parameters that would normally require optimization are located within T.
Application to Real Data
This method was used to estimate a general model of amino acid replacement from the BRKALN database of aligned protein sequence families (D. Jones, unpublished data). This database has previously been used to estimate amino acid replacement models specific to different protein secondary structures (e.g., Goldman, Thorne, and Jones 1996
), and we used 3,905 sequences split into 182 protein families, each containing no more than 100 sequences. The amino acid frequencies for the entire database are shown in table 1
.
|
As described so far, the methods above assume that a single set of stationary frequencies is sufficient to describe the evolution of all of the protein families in a database. Different protein families may, however, be expected to have different amino acid compositions due to a variety of biochemical factors, such as differing cellular environments or variable proportions of protein secondary-structure elements. Two different estimation methods were used to address this question. The first method used a single set of stationary frequencies (19 free parameters) for all protein families, estimated by counting the amino acids observed in the database (as in table 1 ), with the remaining parameters of M (189 free exchangeability parameters sij and the scaling factor
) estimated by ML. The resulting replacement model is called the WAG model, after the authors of this paper. The second method used a different set of stationary frequencies for each protein family (19 x 182 = 3,458 free parameters, estimated by counting the amino acids observed in each family), with the remaining parameters of M again estimated by ML. In this case, the resulting model is called the WAG* model.
Both the WAG and the WAG* models required only 190 parameters to be numerically optimized, compared with >7,500 (before even considering optimization over topologies) under the traditional likelihood approach. This optimization was still computationally slow: estimation of the WAG model took approximately 18 h on a Digital 600au Personal Workstation. To avoid local maxima, estimations of the WAG and WAG* models were each performed using two sets of starting values, those of the EQU and the JTT models; the same estimates were recovered in each case. We found it computationally impractical to estimate the stationary frequency parameters
i by ML (Yang and Roberts 1995
) simultaneously with the estimation of the sij. Given the large size of the BRKALN database, we expect that any differences in the estimated
i would be small and that any consequent differences in the estimated sij would be insignificant.
| Results |
|---|
|
|
|---|
Comparison of the Overall Performance of the New Models with Other Commonly Used Models
Log-likelihood values (eq. 3) for the two newly estimated models and other commonly used models of amino acid replacement were calculated for the complete database and are shown in table 2 . Each model was applied in both the +F form, with one set of amino acid frequencies estimated from the entire database and applied to the analysis of all protein families, and in a form denoted +mF (multiple frequencies), with a different set of amino acid frequencies estimated for each family. In all cases, models that used multiple sets of stationary frequencies were significantly better than the equivalent model using only a single set of stationary frequencies (likelihood ratio test; twice the log-likelihood difference compared with a
2 distribution with 3,458 - 19 = 3,439 dfsee Yang, Goldman, and Friday 1994
|
Most importantly, both of the new models (WAG and WAG*) have higher likelihoods than any of the other models. Statistical comparisons of these models against the JTT model can be made by comparing twice the log likelihood differences given in table 2 with a
2190 distribution, with the 190 df being derived from the 190 parameters sij and
estimated during the generation of the WAG* and WAG models. In all cases, (JTT+F vs. WAG*+F or WAG+F; JTT+mF vs. WAG*+mF or WAG+mF), the WAG* and WAG models give a much better fit to the data: using a normal approximation to the
2190 distribution (Lindgren 1976We also note that, even after making allowance for the estimation of 190 additional parameters, the improvement in likelihood of the WAG* or WAG model over the JTT model is greater than the improvement of the JTT model over the Dayhoff model. This suggests that the improvement achieved by estimating a model of evolution using our new method may be at least as great as the improvement obtained by using a larger database from which to estimate a model of evolution, which is the main detail in which the Dayhoff and JTT models differ.
When the WAG* and the WAG models are compared, neither appears clearly better than the other in examining the whole database of families. As expected, each performs best for the analysis conditions it was optimized for, with WAG giving a better likelihood when using a single set of stationary frequencies (+F option) and WAG* performing better for multiple sets of stationary frequencies (+mF). When the two models are compared using equivalent methods (both +F or both +mF) for calculating the stationary frequencies, their log-likelihood values are very similar (changing by approximately 0.01%), suggesting that there is little difference between the two models.
The branch length scaling factor
was estimated as 1.027 during the generation of the WAG model. This suggests that the likelihood maximization procedure was not trying to change the branch lengths dramatically and that the approximations used were valid. While we note that this scaling factor may not detect nonlinear changes in branch lengths (i.e., changes not proportional to the original lengths), results obtained by the reestimation of the branch lengths and a subsequently reestimated model give no indication of this occurring (see below).
Performance of New Models on Specific Phylogenies
The new models' performance when estimating individual phylogenies is of more practical relevance than their performance when estimating a likelihood for an entire database. To give an example of the improvement in fit to the data that might be achieved with our new models, an alignment of 18 Lepidopteran sequences of the phosphoenolpyruvate carboxykinase protein (Friedlander et al. 1996
; Goldman, Thorne, and Jones 1998
) was chosen as a typical example of data used to perform a phylogenetic analysis. Some statistics of the ML trees under different replacement models are shown in table 3
. From these statistics, it is clear that the use of the WAG+F or WAG*+F model results in a considerably higher likelihood value than any of the other models. The difference between the two new models is very small. There is some change in the branch length statistics between JTT and the new models, although it does not appear large enough to invalidate the assumptions used to estimate the WAG and WAG* models. Note that the phosphoenolpyruvate carboxykinase family is not represented in the BRKALN database, and so there is no possibility of the WAG and WAG* models having any unfair advantage. In this example, the WAG and WAG* models of sequence evolution are superior, and, in general, we expect the use of the models giving the best fit to the observed data to lead to more accurate phylogenetic estimation. Even small changes in branch lengths may lead to changes in optimal ML tree topology, resulting in an estimated tree being closer to the true evolutionary tree.
In order to demonstrate this improvement in performance for the whole BRKALN database, ML values were calculated for each protein family under the JTT+F, WAG+F, and WAG*+F models, this time fixing the evolutionary models and tree topologies but reestimating the branch lengths for each family's phylogeny. Figure 1A and B
shows that the majority of protein families (146 out of 182, or 80%) had higher likelihoods when analyzed under either the WAG+F or the WAG*+F model than under the JTT+F model. The protein families whose likelihoods were higher under the JTT+F model were examined in more detail and compared with the families whose likelihoods were higher under the WAG+F and WAG*+F models to see if there was any common feature defining which model was preferred. It was found that families for which the JTT+F model gave a higher likelihood had relatively shorter average branch lengths than those families for which the new models gave higher likelihoods (data not shown). This may be the result of Jones, Taylor, and Thornton's (1992)
counting method of replacement matrix estimation using only sequences that are >85% identical to estimate the JTT model and perhaps consequently overfitting the model to relatively short evolutionary distances.
|
Figure 1A and B shows that the increases in performance of the WAG+F and WAG*+F models compared with the JTT+F model are very similar. To demonstrate the relative performance of the two new models, the increase in likelihood of the WAG*+F model over the WAG+F model for each individual protein family is shown in figure 1C. It is apparent that the difference in likelihood between the two models is minimal for the majority of the families: log-likelihood differences between the WAG+F and WAG*+F models are <1 for 102 of the 182 families; 95 families lie above the x-axis in figure 1C, and 87 lie below. The few cases in which the likelihoods were clearly different between the two models are located toward the highest log-likelihood values (i.e., the left-hand side of fig. 1C ), with the WAG model clearly outperforming the WAG* model. These cases were investigated in more detail, and it was found that these families consisted of only two very similar sequences. This suggests that the largest differences in likelihood between the WAG and the WAG* models were caused by one or two differences in the amino acid replacement matrices of the two models coinciding with differences between two closely related sequences and can thus be attributed to chance effects. We conclude that the overall difference between the two models' performances is negligible, and the additional parameters (and computation time) used when estimating the WAG* model are not required for model estimation from these data.
Comparison of the Structure of the New Models with Those of Other Commonly Used Models
A comparison of the differences in the patterns of amino acid replacement between the empirically derived Dayhoff, JTT, WAG, and WAG* models of evolution is shown in figure 2
. From these graphs, it is clear that the overall structures of the four models are similar, which suggests that they are all modeling the same process. Closer examination shows no discernible pattern to the differences in the values of the parameters sij of the amino acid replacement matrices of the JTT and WAG models; this is illustrated in figure 3
. There is almost no difference between the values in the replacement matrices of the WAG and WAG* models. The exchangeability parameters sij defining the WAG and WAG* models are available via http://www.zoo.cam.ac.uk/zoostaff/goldman/WAG.
|
|
Testing the Adequacy of Approximations
The methodology presented here may be considered similar to a single round of optimization in the algorithms often used to maximize a likelihood for a single given phylogenetic tree, which involve alternating cycles of branch length optimization and model optimization. In our methodology, we first optimize branch lengths for multiple families under a fixed model and then find the optimal model using those branch lengths. It is therefore of interest to see whether further rounds of optimization in our methodology would provide any substantial increase in likelihood and, consequently, a better fit to the data. We performed a second round of optimization, involving a single reestimation of all of the branch lengths for each individual protein family using the WAG+F model followed by the reestimation of the replacement model using these branch lengths. This reestimated model was then used to examine the individual families in the BRKALN database. This resulted in only trivial changes to branch lengths, estimated parameter values, and likelihood values (e.g., an average increase in log-likelihood of only 0.026 per family), and from this we conclude that second and subsequent rounds of iteration are unnecessary to get a good estimate of the optimal evolutionary model.
| Discussion |
|---|
|
|
|---|
The methodology presented here allows the estimation of a model of amino acid replacement from large numbers of sequences from many different families. By doing so, it combines the best attributes of the counting methods of Dayhoff, Schwartz, and Orcutt (1978)
The newly estimated WAG and WAG* models both gave significantly higher likelihoods than any other commonly used models when used to assess phylogenies for all 182 protein families of the BRKALN database simultaneously and for a large majority of the individual phylogenies examined. It was unclear which of the two new models performed better, but we would tentatively suggest that in this case the methodology used to estimate the WAG model was preferable because it involved fewer parameters being estimated from the amino acid sequence database. We note some change in the estimated branch lengths under the new models of evolution. The better statistical fit of our new models to the data suggests that in many cases they may provide more accurate estimates of phylogenetic trees than existing models, although differences in branch length estimates do not appear so great as to invalidate the assumptions used in our method for estimating models.
We hope that the WAG and WAG* models of amino acid replacement will be of value in phylogenetic analyses of amino acid sequences as potentially superior alternatives to the Dayhoff and JTT models. Both our new methodology and models produced using it may have further applications outside of phylogenetics, in fields that rely on accurate descriptions of amino acid replacement, such as protein structure prediction, the detection of sequence homology (including database searching), and sequence alignment.
| Acknowledgements |
|---|
|
|
|---|
S.W. was supported by a BBSRC studentship. N.G. was supported by a Wellcome Trust Biodiversity Fellowship. Further details of algorithms used and source code are available on request from the authors, and the WAG and WAG* matrices are available in electronic form via http://www.zoo.cam.ac.uk/zoostaff/goldman/WAG. The WAG model is implemented in the PAML (Yang 1997
| Footnotes |
|---|
William Martin, Reviewing Editor
2 Address for correspondence and reprints: Simon Whelan, Department of Zoology, University of Cambridge, Cambridge CB2 3EJ, United Kingdom. s.whelan{at}zoo.cam.ac.uk ![]()
| literature cited |
|---|
|
|
|---|
Adachi, J., and M. Hasegawa. 1996. Model of amino acid substitution in proteins encoded by mitochondrial DNA. J. Mol. Evol. 42:459468.[Web of Science][Medline]
Adachi, J., P. J. Waddell, W. Martin, and M. Hasegawa. 2000. Plastid genome phylogeny and a model of amino acid substitution for proteins encoded by chloroplast DNA. J. Mol. Evol. 50:348358.[Web of Science][Medline]
Cao, Y., J. Adachi, A. Janke, S. Pääbo, and M. Hasegawa. 1994. Phylogenetic relationships among eutherian orders estimated from inferred sequences of mitochondrial proteins: instability of a tree based on a single gene. J. Mol. Evol. 39:519527.[Web of Science][Medline]
Dayhoff, M. O., R. V. Eck, and C. M. Park. 1972. A model of evolutionary change in proteins. Pp. 8999 in M. O. Dayhoff, ed. Atlas of protein sequence and structure. Vol. 5. National Biomedical Research Foundation, Washington, D.C.
Dayhoff, M. O., R. M. Schwartz, and B. C. Orcutt. 1978. A model of evolutionary change in proteins. Pp. 345352 in M. O. Dayhoff, ed. Atlas of protein sequence and structure. Vol. 5, suppl. 3. National Biomedical Research Foundation, Washington, D.C.
Felsenstein, J. 1995. PHYLIP (phylogenetic inference package). Version 3.57. Distributed by the author, Department of Genetics, University of Washington, Seattle.
Friedlander, T. P., J. C. Regier, C. Mitter, and D. L. Wagner. 1996. A nuclear gene for higher-level phylogeneticsphosphoenolpyruvate carboxykinase tracks Mesozoic-age divergences within Lepidoptera (Insecta). Mol. Biol. Evol. 13:594604.[Abstract]
Goldman, N., J. L. Thorne, and D. T. Jones. 1996. Using evolutionary trees in protein secondary structure prediction and other comparative sequence analyses. J. Mol. Biol. 263:196208.[Web of Science][Medline]
. 1998. Assessing the impact of secondary structure and solvent accessibility on protein evolution. Genetics 149:445458.
Jones, D. T., W. R. Taylor, and J. M. Thornton. 1992. The rapid generation of mutation data matrices from protein sequences. CABIOS 8:275282.
Lindgren, B. W. 1976. Statistical theory. 3rd edition. Macmillan, New York.
Liò, P., and N. Goldman. 1998. Models of molecular evolution and phylogeny. Genome Res. 8:12331244.
Saitou, N., and M. Nei. 1987. The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol. Biol. Evol. 4:406425.[Abstract]
Strimmer, K., and A. von Haeseler. 1996. Quartet puzzling: a quartet maximum-likelihood method for reconstructing tree topologies. Mol. Biol. Evol. 13:964969.[Web of Science]
Sullivan, J., K. E. Holsinger, and C. Simon. 1996. The effect of topology on estimation of among site rate variation. J. Mol. Evol. 42:308312.[Web of Science][Medline]
Swofford, D. L., G. J. Olsen, P. J. Waddell, and D. M. Hillis. 1996. Phylogenetic inference. Pp. 407514 in D. M. Hillis, C. Moritz, and B. K. Mable, eds. Molecular systematics. Sinauer, Sunderland, Mass.
Yang, Z. 1997. PAML: a program package for phylogenetic analysis by maximum likelihood. CABIOS 13:555556.
Yang, Z., N. Goldman, and A. Friday. 1994. Comparison of models for nucleotide substitution used in maximum-likelihood phylogenetic estimation. Mol. Biol. Evol. 11:316324.[Abstract]
. 1995. Maximum likelihood trees from DNA sequences: a peculiar statistical estimation problem. Syst. Biol. 44:384399.
Yang, Z., S. Kumar, and M. Nei. 1995. A new method of inference of ancestral nucleotide and amino acid sequences. Genetics 141:16411650.
Yang, Z., R. Nielsen, and M. Hasegawa. 1998. Models of amino acid substitution and applications to mitochondrial protein evolution. Mol. Biol. Evol. 15:16001611.[Abstract]
Yang, Z., and D. Roberts. 1995. On the use of nucleic acid sequences to infer early branchings in the tree of life. Mol. Biol. Evol. 12:451458.[Abstract]
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
N. Terrapon, O. Gascuel, E. Marechal, and L. Brehelin Detection of new protein domains using co-occurrence: application to Plasmodium falciparum Bioinformatics, December 1, 2009; 25(23): 3077 - 3083. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Ertas, B. M. von Reumont, J.-W. Wagele, B. Misof, and T. Burmester Hemocyanin Suggests a Close Relationship of Remipedia and Hexapoda Mol. Biol. Evol., December 1, 2009; 26(12): 2711 - 2718. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Simon, S. Strauss, A. von Haeseler, and H. Hadrys A Phylogenomic Approach to Resolve the Basal Pterygote Divergence Mol. Biol. Evol., December 1, 2009; 26(12): 2719 - 2730. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Carter and G. Drouin The Evolutionary Rates of Eukaryotic RNA Polymerases and of Their Transcription Factors Are Affected by the Level of Concerted Evolution of the Genes They Transcribe Mol. Biol. Evol., November 1, 2009; 26(11): 2515 - 2520. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. A. Dunn, J. P. Bielawski, T. J. Ward, C. Urquhart, and H. Gu Reconciling Ecological and Genomic Divergence among Lineages of Listeria under an "Extended Mosaic Genome Concept" Mol. Biol. Evol., November 1, 2009; 26(11): 2605 - 2615. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Blasingame, T. Tuton-Blasingame, L. Larkin, A. M. Falick, L. Zhao, J. Fong, V. Vaidyanathan, A. Visperas, P. Geurts, X. Hu, et al. Pyriform Spidroin 1, a Novel Member of the Silk Gene Family That Anchors Dragline Silk Fibers in Attachment Discs of the Black Widow Spider, Latrodectus hesperus J. Biol. Chem., October 16, 2009; 284(42): 29097 - 29108. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Rastgou, M. K. Habibi, K. Izadpanah, V. Masenga, R. G. Milne, Y. I. Wolf, E. V. Koonin, and M. Turina Molecular characterization of the plant virus genus Ourmiavirus and evidence of inter-kingdom reassortment of viral genome segments as its possible route of origin J. Gen. Virol., October 1, 2009; 90(10): 2525 - 2535. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Cobbe, K. M. Marshall, S. G. Rao, C.-W. Chang, F. Di Cara, E. Duca, S. Vass, A. Kassan, and M. M. S. Heck The conserved metalloprotease invadolysin localizes to the surface of lipid droplets J. Cell Sci., September 15, 2009; 122(18): 3414 - 3423. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Lartillot, T. Lepage, and S. Blanquart PhyloBayes 3: a Bayesian software package for phylogenetic reconstruction and molecular dating Bioinformatics, September 1, 2009; 25(17): 2286 - 2288. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. G. Foster, C. J. Cox, and T. M. Embley The primary divisions of life: a phylogenomic approach employing composition-heterogeneous methods Phil Trans R Soc B, August 12, 2009; 364(1527): 2197 - 2207. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. A. Nahum, S. Goswami, and M. H. Serres Protein families reflect the metabolic diversity of organisms and provide support for functional prediction Physiol Genomics, August 7, 2009; 38(3): 250 - 260. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Heise, D. Singh, H. van der Wel, S. O Sassi, J. M Johnson, C. L Feasley, C. M Koeller, J. O Previato, L. Mendonca-Previato, and C. M West Molecular analysis of a UDP-GlcNAc:polypeptide {alpha}-N-acetylglucosaminyltransferase implicated in the initiation of mucin-type O-glycosylation in Trypanosoma cruzi Glycobiology, August 1, 2009; 19(8): 918 - 933. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. M. Wade, A. Tollenaere, M. R. Hall, and B. M. Degnan Evolution of a Novel Carotenoid-Binding Protein Responsible for Crustacean Shell Color Mol. Biol. Evol., August 1, 2009; 26(8): 1851 - 1864. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Fletcher and Z. Yang INDELible: A Flexible Simulator of Biological Sequence Evolution Mol. Biol. Evol., August 1, 2009; 26(8): 1879 - 1888. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Mosquna, A. Katz, E. L. Decker, S. A. Rensing, R. Reski, and N. Ohad Regulation of stem cell maintenance by the Polycomb protein FIE has been conserved during land plant evolution Development, July 15, 2009; 136(14): 2433 - 2444. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Rodrigue, C. L. Kleinman, H. Philippe, and N. Lartillot Computational Methods for Evaluating Phylogenetic Models of Coding Sequence Evolution with Dependence between Codons Mol. Biol. Evol., July 1, 2009; 26(7): 1663 - 1676. [Abstract] [Full Text] [PDF] |
||||
![]() |
T.-K. Seo and H. Kishino Statistical Comparison of Nucleotide, Amino Acid, and Codon Substitution Models for Evolutionary Analysis of Protein-Coding Sequences Syst Biol, June 29, 2009; (2009) syp015v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. D. Nedialkova, R. Ulferts, E. van den Born, C. Lauber, A. E. Gorbalenya, J. Ziebuhr, and E. J. Snijder Biochemical Characterization of Arterivirus Nonstructural Protein 11 Reveals the Nidovirus-Wide Conservation of a Replicative Endoribonuclease J. Virol., June 1, 2009; 83(11): 5671 - 5682. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Weber, A. Gruber, and P. G. Kroth The Presence and Localization of Thioredoxins in Diatoms, Unicellular Algae of Secondary Endosymbiotic Origin Mol Plant, May 1, 2009; 2(3): 468 - 477. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Berger and I. T. Baldwin Silencing the Hydroxyproline-Rich Glycopeptide Systemin Precursor in Two Accessions of Nicotiana attenuata Alters Flower Morphology and Rates of Self-Pollination Plant Physiology, April 1, 2009; 149(4): 1690 - 1700. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Hue, R. J. Gifford, D. Dunn, E. Fernhill, D. Pillay, and on Behalf of the UK Collaborative Group on HIV Dru Demonstration of Sustained Drug-Resistant Human Immunodeficiency Virus Type 1 Lineages Circulating among Treatment-Naive Individuals J. Virol., March 15, 2009; 83(6): 2645 - 2654. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Schulz, M. Gottschling, I. G. Bravo, U. Wittstatt, E. Stockfleth, and I. Nindl Genomic characterization of the first insectivoran papillomavirus reveals an unusually long, second non-coding region and indicates a close relationship to Betapapillomavirus J. Gen. Virol., March 1, 2009; 90(3): 626 - 633. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Alvarez-Ponce, M. Aguade, and J. Rozas Network-level molecular evolutionary analysis of the insulin/TOR signal transduction pathway across 12 Drosophila genomes Genome Res., February 1, 2009; 19(2): 234 - 242. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Kashiyama, T. Seki, H. Numata, and S. G. Goto Molecular Characterization of Visual Pigments in Branchiopoda and the Evolution of Opsins in Arthropoda Mol. Biol. Evol., February 1, 2009; 26(2): 299 - 311. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Wu, J. M. Comeron, H. S. Yoon, and D. Bhattacharya Unexpected Dynamic Gene Family Evolution in Algal Actins Mol. Biol. Evol., February 1, 2009; 26(2): 249 - 253. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Devier, G. Aguileta, M. E. Hood, and T. Giraud Ancient Trans-specific Polymorphism at Pheromone Receptor Genes in Basidiomycetes Genetics, January 1, 2009; 181(1): 209 - 223. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. P Huelsenbeck, P. Joyce, C. Lakner, and F. Ronquist Bayesian analysis of amino acid substitution models Phil Trans R Soc B, December 27, 2008; 363(1512): 3941 - 3953. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Q. Le, N. Lartillot, and O. Gascuel Phylogenetic mixture models for proteins Phil Trans R Soc B, December 27, 2008; 363(1512): 3965 - 3976. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. T Holder, D. J Zwickl, and C. Dessimoz Evaluating the robustness of phylogenetic methods to among-site variability in substitution processes Phil Trans R Soc B, December 27, 2008; 363(1512): 4013 - 4021. [Abstract] [Full Text] [PDF] |
||||
![]() |
C.-H. Kuo, J. P. Wares, and J. C. Kissinger The Apicomplexan Whole-Genome Phylogeny: An Analysis of Incongruence among Gene Trees Mol. Biol. Evol., December 1, 2008; 25(12): 2689 - 2698. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Soria-Carrasco and J. Castresana Estimation of Phylogenetic Inconsistencies in the Three Domains of Life Mol. Biol. Evol., November 1, 2008; 25(11): 2319 - 2329. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. C. Havird, M. M. Miyamoto, K. P. Choe, and D. H. Evans Gene Duplications and Losses within the Cyclooxygenase Family of Teleosts and Other Chordates Mol. Biol. Evol., November 1, 2008; 25(11): 2349 - 2359. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Luque, M. L. Riera-Alberola, A. Andujar, and J. A. G. Ochoa de Alda Intraphylum Diversity and Complex Evolution of Cyanobacterial Aminoacyl-tRNA Synthetases Mol. Biol. Evol., November 1, 2008; 25(11): 2369 - 2389. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Arai, S. D. Ohdachi, M. Asakawa, H. J. Kang, G. Mocz, J. Arikawa, N. Okabe, and R. Yanagihara Molecular phylogeny of a newfound hantavirus in the Japanese shrew mole (Urotrichus talpoides) PNAS, October 21, 2008; 105(42): 16296 - 16301. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Si Quang, O. Gascuel, and N. Lartillot Empirical profile mixture models for phylogenetic reconstruction Bioinformatics, October 15, 2008; 24(20): 2317 - 2323. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Lanfear and L. Bromham Statistical Tests between Competing Hypotheses of Hox Cluster Evolution Syst Biol, October 1, 2008; 57(5): 708 - 718. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. D. Fernandes and W. R. Atchley Site-specific evolutionary rates in proteins are better modeled as non-independent and strictly relative Bioinformatics, October 1, 2008; 24(19): 2177 - 2183. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. L. Kosakovsky Pond, A. F.Y. Poon, A. J. Leigh Brown, and S. D.W. Frost A Maximum Likelihood Method for Detecting Directional Evolution in Protein Sequences and Its Application to Influenza A Virus Mol. Biol. Evol., September 1, 2008; 25(9): 1809 - 1824. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Cadel-Six, C. Dauga, A. M. Castets, R. Rippka, C. Bouchier, N. Tandeau de Marsac, and M. Welker Halogenase Genes in Nonribosomal Peptide Synthetase Gene Clusters of Microcystis (Cyanobacteria): Sporadic Distribution and Evolution Mol. Biol. Evol., September 1, 2008; 25(9): 2031 - 2041. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Aguileta, S. Marthey, H. Chiapello, M.-H. Lebrun, F. Rodolphe, E. Fournier, A. Gendrault-Jacquemard, and T. Giraud Assessing the Performance of Single-Copy Genes for Recovering Robust Phylogenies Syst Biol, August 1, 2008; 57(4): 613 - 627. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Ueda, T. Nishikawa, M. Fujimoto, H. Takanashi, S.-i. Arimura, N. Tsutsumi, and K.-i. Kadowaki Substitution of the Gene for Chloroplast RPS16 Was Assisted by Generation of a Dual Targeting Signal Mol. Biol. Evol., August 1, 2008; 25(8): 1566 - 1575. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Whelan Spatial and Temporal Heterogeneity in Nucleotide Sequence Evolution Mol. Biol. Evol., August 1, 2008; 25(8): 1683 - 1694. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Q. Le and O. Gascuel An Improved General Amino Acid Replacement Matrix Mol. Biol. Evol., July 1, 2008; 25(7): 1307 - 1320. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. R. Magie and M. Q. Martindale Cell-Cell Adhesion in the Cnidaria: Insights Into the Evolution of Tissue Morphogenesis Biol. Bull., June 1, 2008; 214(3): 218 - 232. [Abstract] [Full Text] [PDF] |
||||
![]() |
T.-K. Seo and H. Kishino Synonymous Substitutions Substantially Improve Evolutionary Inference from Highly Diverged Proteins Syst Biol, June 1, 2008; 57(3): 367 - 377. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Matsuzaki, H. Kuroiwa, T. Kuroiwa, K. Kita, and H. Nozaki A Cryptic Algal Group Unveiled: A Plastid Biosynthesis Pathway in the Oyster Parasite Perkinsus marinus Mol. Biol. Evol., June 1, 2008; 25(6): 1167 - 1179. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Kamikawa, Y. Inagaki, and Y. Sako Direct phylogenetic evidence for lateral transfer of elongation factor-like gene PNAS, May 13, 2008; 105(19): 6965 - 6969. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Blanquart and N. Lartillot A Site- and Time-Heterogeneous Model of Amino Acid Replacement Mol. Biol. Evol., May 1, 2008; 25(5): 842 - 858. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Lartillot and H. Philippe Improvement of molecular phylogenetic inference and the phylogeny of Bilateria Phil Trans R Soc B, April 27, 2008; 363(1496): 1463 - 1472. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Baguna, P. Martinez, J. Paps, and M. Riutort Back in time: a new systematic proposal for the Bilateria Phil Trans R Soc B, April 27, 2008; 363(1496): 1481 - 1491. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. A. Cymerman, I. Chung, B. M. Beckmann, J. M. Bujnicki, and G. Meiss EXOG, a novel paralog of Endonuclease G in higher eukaryotes Nucleic Acids Res., March 27, 2008; 36(4): 1369 - 1379. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Yuri, R. T. Kimball, E. L. Braun, and M. J. Braun Duplication of Accelerated Evolution and Growth Hormone Gene in Passerine Birds Mol. Biol. Evol., February 1, 2008; 25(2): 352 - 361. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Jastroch, K. W. Withers, S. Taudien, P. B. Frappell, M. Helwig, T. Fromme, V. Hirschberg, G. Heldmaier, B. M. McAllan, B. T. Firth, et al. Marsupial uncoupling protein 1 sheds light on the evolution of mammalian nonshivering thermogenesis Physiol Genomics, January 17, 2008; 32(2): 161 - 169. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Gostincar, M. Turk, T. Trbuha, T. Vaupotic, A. Plemenitas, and N. Gunde-Cimerman Expression of fatty-acid-modifying enzymes in the halotolerant black yeast Aureobasidium pullulans (de Bary) G. Arnaud under salt stress. Stud Mycol, January 1, 2008; 61: 51 - 59. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. F. El Sheikh, A. T. Poret-Peterson, and M. G. Klotz Characterization of Two New Genes, amoR and amoD, in the amo Operon of the Marine Ammonia Oxidizer Nitrosococcus oceani ATCC 19707 Appl. Envir. Microbiol., January 1, 2008; 74(1): 312 - 318. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Gogendeau, C. Klotz, O. Arnaiz, A. Malinowska, M. Dadlez, N. G. de Loubresse, F. Ruiz, F. Koll, and J. Beisson Functional diversification of centrins and cell morphological complexity J. Cell Sci., January 1, 2008; 121(1): 65 - 74. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Hamaji, P. J. Ferris, A. W. Coleman, S. Waffenschmidt, F. Takahashi, I. Nishii, and H. Nozaki Identification of the Minus-Dominance Gene Ortholog in the Mating-Type Locus of Gonium pectorale Genetics, January 1, 2008; 178(1): 283 - 294. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Takahashi, A. Kamiya, A. Ishiguro, A. C. Suzuki, N. Saitou, A. Toyoda, and J. Aruga Conservation and Diversification of Msx Protein in Metazoan Evolution Mol. Biol. Evol., January 1, 2008; 25(1): 69 - 82. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. A. Hyndman and D. H. Evans Endothelin and endothelin converting enzyme-1 in the fish gill: evolutionary and physiological perspectives J. Exp. Biol., December 15, 2007; 210(24): 4286 - 4297. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Schmidt von Braun, A. Sabetti, P. J. Hanic-Joyce, J. Gu, E. Schleiff, and P. B. M. Joyce Dual targeting of the tRNA nucleotidyltransferase in plants: not just the signal J. Exp. Bot., December 1, 2007; 58(15-16): 4083 - 4093. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. T. Saunders and P. Green Insights from Modeling Protein Evolution with Context-Dependent Mutation and Asymmetric Amino Acid Selection Mol. Biol. Evol., December 1, 2007; 24(12): 2632 - 2647. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Schlegel, O. Mirus, A. von Haeseler, and E. Schleiff The Tetratricopeptide Repeats of Receptors Involved in Protein Translocation across Membranes Mol. Biol. Evol., December 1, 2007; 24(12): 2763 - 2774. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. V. Goldstone, H. M. H. Goldstone, A. M. Morrison, A. Tarrant, S. E. Kern, B. R. Woodin, and J. J. Stegeman Cytochrome P450 1 Genes in Early Deuterostomes (Tunicates and Sea Urchins) and Vertebrates (Chicken and Frog): Origin and Diversification of the CYP1 Gene Family Mol. Biol. Evol., December 1, 2007; 24(12): 2619 - 2631. [Abstract] [Full Text] [PDF] |
||||
![]() |
S.-B. Malik, M. A. Ramesh, A. M. Hulstrand, and J. M. Logsdon Jr. Protist Homologs of the Meiotic Spo11 Gene and Topoisomerase VI reveal an Evolutionary History of Gene Duplication and Lineage-Specific Loss Mol. Biol. Evol., December 1, 2007; 24(12): 2827 - 2841. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Moslavac, K. Nicolaisen, O. Mirus, F. Al Dehni, R. Pernil, E. Flores, I. Maldener, and E. Schleiff A TolC-Like Protein Is Required for Heterocyst Development in Anabaena sp. Strain PCC 7120 J. Bacteriol., November 1, 2007; 189(21): 7887 - 7895. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Xiong, C. E. Bauer, and A. Pancholy Insight into the haem d1 biosynthesis pathway in heliobacteria through bioinformatics analysis Microbiology, October 1, 2007; 153(10): 3548 - 3562. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. B. Thackray, C. E. Wobus, K. A. Chachu, B. Liu, E. R. Alegre, K. S. Henderson, S. T. Kelley, and H. W. Virgin IV Murine Noroviruses Comprising a Single Genogroup Exhibit Biological Diversity despite Limited Sequence Divergence J. Virol., October 1, 2007; 81(19): 10460 - 10473. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Rodrigue, H. Philippe, and N. Lartillot Exploring Fast Computational Strategies for Probabilistic Phylogenetic Analysis Syst Biol, October 1, 2007; 56(5): 711 - 726. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Whelan New Approaches to Phylogenetic Tree Search and Their Application to Large Numbers of Protein Alignments Syst Biol, October 1, 2007; 56(5): 727 - 740. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Massingham and N. Goldman Statistics of the Log-Det Estimator Mol. Biol. Evol., October 1, 2007; 24(10): 2277 - 2285. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. J. Macqueen, D. Robb, and I. A. Johnston Temperature influences the coordinated expression of myogenic regulatory factors during embryonic myogenesis in Atlantic salmon (Salmo salar L.) J. Exp. Biol., August 15, 2007; 210(16): 2781 - 2794. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Kosiol, I. Holmes, and N. Goldman An Empirical Codon Model for Protein Sequence Evolution Mol. Biol. Evol., July 1, 2007; 24(7): 1464 - 1479. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. J. Yebra, M. Zuniga, S. Beaufils, G. Perez-Martinez, J. Deutscher, and V. Monedero Identification of a Gene Cluster Enabling Lactobacillus casei BL23 To Utilize myo-Inositol Appl. Envir. Microbiol., June 15, 2007; 73(12): 3850 - 3858. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Rodriguez-Ezpeleta, H. Brinkmann, B. Roure, N. Lartillot, B. F. Lang, and H. Philippe Detecting and Overcoming Systematic Errors in Genome-Scale Phylogenies Syst Biol, June 1, 2007; 56(3): 389 - 399. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Ruano-Rubio and M. A. Fares Testing the Neutral Fixation of Hetero-Oligomerism in the Archaeal Chaperonin CCT Mol. Biol. Evol., June 1, 2007; 24(6): 1384 - 1396. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. A. Gladyshev and I. R. Arkhipova From the Cover: Telomere-associated endonuclease-deficient Penelope-like retroelements in diverse eukaryotes PNAS, May 29, 2007; 104(22): 9352 - 9357. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Gottschling, A. Stamatakis, I. Nindl, E. Stockfleth, A. Alonso, and I. G. Bravo Multiple Evolutionary Mechanisms Drive Papillomavirus Diversification Mol. Biol. Evol., May 1, 2007; 24(5): 1242 - 1258. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. G. Beiko and R. L. Charlebois A simulation test bed for hypotheses of genome evolution Bioinformatics, April 1, 2007; 23(7): 825 - 831. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. O. Sassi, E. L. Braun, and S. A. Benner The Evolution of Seminal Ribonuclease: Pseudogene Reactivation or Multiple Gene Inactivation Events? Mol. Biol. Evol., April 1, 2007; 24(4): 1012 - 1024. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Richardt, D. Lang, R. Reski, W. Frank, and S. A. Rensing PlanTAPDB, a Phylogeny-Based Resource of Plant Transcription-Associated Proteins Plant Physiology, April 1, 2007; 143(4): 1452 - 1466. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. M. Fitzpatrick, Y. Hirai, H. Hirai, and K. F. Hoffmann Schistosome egg production is dependent upon the activities of two developmentally regulated tyrosinases FASEB J, March 1, 2007; 21(3): 823 - 835. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Rodriguez-Ezpeleta, H. Philippe, H. Brinkmann, B. Becker, and M. Melkonian Phylogenetic Analyses of Nuclear, Mitochondrial, and Plastid Multigene Data Sets Support the Placement of Mesostigma in the Streptophyta Mol. Biol. Evol., March 1, 2007; 24(3): 723 - 731. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Brindefalk, J. Viklund, D. Larsson, M. Thollesson, and S. G. E. Andersson Origin and Evolution of the Mitochondrial Aminoacyl-tRNA Synthetases Mol. Biol. Evol., March 1, 2007; 24(3): 743 - 756. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. L. Sanderson-Smith, M. Dowton, M. Ranson, and M. J. Walker The Plasminogen-Binding Group A Streptococcal M Protein-Related Protein Prp Binds Plasminogen via Arginine and Histidine Residues J. Bacteriol., February 15, 2007; 189(4): 1435 - 1440. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Doron-Faigenboim and T. Pupko A Combined Empirical and Mechanistic Codon Model Mol. Biol. Evol., February 1, 2007; 24(2): 388 - 397. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Bredemeier, T. Schlegel, F. Ertel, A. Vojta, L. Borissenko, M. T. Bohnsack, M. Groll, A. von Haeseler, and E. Schleiff Functional and Phylogenetic Properties of the Pore-forming beta-Barrel Transporters of the Omp85 Family J. Biol. Chem., January 19, 2007; 282(3): 1882 - 1890. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Baurain, H. Brinkmann, and H. Philippe Lack of Resolution in the Animal Phylogeny: Closely Spaced Cladogeneses or Undetected Systematic Errors? Mol. Biol. Evol., January 1, 2007; 24(1): 6 - 9. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. L. Kosakovsky Pond, F. V. Mannino, M. B. Gravenor, S. V. Muse, and S. D. W. Frost Evolutionary Model Selection with a Genetic Algorithm: A Case Study Using Stem RNA Mol. Biol. Evol., January 1, 2007; 24(1): 159 - 170. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. L. Porter, T. W. Cronin, D. A. McClellan, and K. A. Crandall Molecular Characterization of Crustacean Visual Pigments and the Evolution of Pancrustacean Opsins Mol. Biol. Evol., January 1, 2007; 24(1): 253 - 268. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. H. Shafer, M. A. McCartney, and L. M. Faircloth Identifying exoskeleton proteins in the blue crab from an expressed sequence tag (EST) library Integr. Comp. Biol., December 1, 2006; 46(6): 978 - 990. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Gray-Mitsumune, M. O'Brien, C. Bertrand, F. Tebbji, A. Nantel, and D. P. Matton Loss of ovule identity induced by overexpression of the fertilization-related kinase 2 (ScFRK2), a MAPKKK from Solanum chacoense J. Exp. Bot., December 1, 2006; 57(15): 4171 - 4187. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. B. Gould, M. S. Sommer, P. G. Kroth, G. H. Gile, P. J. Keeling, and U.-G. Maier Nucleus-to-Nucleus Gene Transfer and Protein Retargeting into a Remnant Cytoplasm of Cryptophytes and Diatoms Mol. Biol. Evol., December 1, 2006; 23(12): 2413 - 2422. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Savard, D. Tautz, S. Richards, G. M. Weinstock, R. A. Gibbs, J. H. Werren, H. Tettelin, and M. J. Lercher Phylogenomic analysis reveals bees and wasps (Hymenoptera) at the base of the radiation of Holometabolous insects Genome Res., November 1, 2006; 16(11): 1334 - 1338. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. E. Baker and K. Rogers Phylogenetic Analysis of Fungal Centromere H3 Proteins Genetics, November 1, 2006; 174(3): 1481 - 1492. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||






























