Molecular Biology and Evolution, Vol 6, 649-668, Copyright © 1989 by Society for Molecular Biology and Evolution
J Hein
Among the fundamental problems in molecular evolution and in the analysis
of homologous sequences are alignment, phylogeny reconstruction, and the
reconstruction of ancestral sequences. This paper presents a fast, combined
solution to these problems. The new algorithm gives an approximation to the
minimal history in terms of a distance function on sequences. The distance
function on sequences is a minimal weighted path length constructed from
substitutions and insertions-deletions of segments of any length.
Substitutions are weighted with an arbitrary metric on the set of
nucleotides or amino acids, and indels are weighted with a gap penalty
function of the form gk = a + (bxk), where k is the length of the indel and
a and b are two positive numbers. A novel feature is the introduction of
the concept of sequence graphs and a generalization of the traditional
dynamic sequence comparison algorithm to the comparison of sequence graphs.
Sequence graphs ease several computational problems. They are used to
represent large sets of sequences that can then be compared simultaneously.
Furthermore, they allow the handling of multiple, equally good, alignments,
where previous methods were forced to make arbitrary choices. A program
written in C implemented this method; it was tested first on 22 5S RNA
sequences.
ORIGINAL ARTICLE
A new method that simultaneously aligns and reconstructs ancestral sequences for any number of homologous sequences, when the phylogeny is given
NIEHS, Research Triangle Park, North Carolina 27709.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
S. Lindgreen, P. P. Gardner, and A. Krogh MASTR: multiple alignment and structure prediction of non-coding RNAs using simulated annealing Bioinformatics, December 15, 2007; 23(24): 3304 - 3311. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Zhang and T. Kahveci QOMA: quasi-optimal multiple alignment of protein sequences Bioinformatics, January 15, 2007; 23(2): 162 - 168. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Loytynoja and N. Goldman From The Cover: An algorithm for progressive multiple alignment of sequences with insertions PNAS, July 26, 2005; 102(30): 10557 - 10562. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Blanchette, E. D. Green, W. Miller, and D. Haussler Reconstructing large regions of an ancestral mammalian genome in silico Genome Res., December 1, 2004; 14(12): 2412 - 2423. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Blanchette, W. J. Kent, C. Riemer, L. Elnitski, A. F.A. Smit, K. M. Roskin, R. Baertsch, K. Rosenbloom, H. Clawson, E. D. Green, et al. Aligning Multiple Genomic Sequences With the Threaded Blockset Aligner Genome Res., April 1, 2004; 14(4): 708 - 715. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Nishizawa, E. Shimoda, and M. Kasahara Substrate Recognition Domain of the Gal2 Galactose Transporter in Yeast Saccharomyces cerevisiae as Revealed by Chimeric Galactose-Glucose Transporters J. Biol. Chem., February 10, 1995; 270(6): 2423 - 2426. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Gonnet, M. Cohen, and S. Benner Exhaustive matching of the entire protein sequence database Science, June 5, 1992; 256(5062): 1443 - 1445. [Abstract] [PDF] |
||||




