Molecular Biology and Evolution, Vol 1, 357-370, Copyright © 1984 by Society for Molecular Biology and Evolution
S Karlin, G Ghandour, DE Foulser and LJ Korn
A method is presented for the analysis and comparison of nucleic acid and
protein sequences utilizing all identity blocks (the term "identity block"
refers to a set of consecutive matches between two sequences) above a
prescribed length. Moreover, such identity blocks are determined for
various groupings of amino acids according to chemical, functional, charge,
and hydrophobic classifications. Alignment maps based on these
classifications and containing all statistically significant identity
blocks between two or more sequences are constructed. New theoretical
results for determining the expected length of the longest identity block
between sequences are also presented and are used, along with permutation
procedures, to ascertain the significance of sequence identity blocks. As
an example of the type of information that can be obtained, comparison has
been made of the complete DNA sequences and the E1, E2, L1, and L2 genes of
human and bovine papillomaviruses based on the classification schemes
described above.
ORIGINAL ARTICLE
Comparative analysis of human and bovine papillomaviruses
Department of Mathematics, Stanford University, California 94305.
![]()
CiteULike
Connotea
Del.icio.us What's this?