Statements in which the resource exists as a subject.
PredicateObject
rdf:type
lifeskim:mentions
pubmed:issue
1
pubmed:dateCreated
2001-9-5
pubmed:abstractText
Protein sequence alignment has become a widely used method in the study of newly sequenced proteins. Most sequence alignment methods use an affine gap penalty to assign scores to insertions and deletions. Although affine gap penalties represent the relative ease of extending a gap compared with initializing a gap, it is still an obvious oversimplification of the real processes that occur during sequence evolution. To improve the efficiency of sequence alignment methods and to obtain a better understanding of the process of sequence evolution, we wanted to find a more accurate model of insertions and deletions in homologous proteins. In this work, we extract the probability of a gap occurrence and the resulting gap length distribution in distantly related proteins (sequence identity < 25%) using alignments based on their common structures. We observe a distribution of gaps that can be fitted with a multiexponential with four distinct components. The results suggest new approaches to modeling insertions and deletions in sequence alignments.
pubmed:grant
pubmed:language
eng
pubmed:journal
pubmed:citationSubset
IM
pubmed:chemical
pubmed:status
MEDLINE
pubmed:month
Oct
pubmed:issn
0887-3585
pubmed:author
pubmed:copyrightInfo
Copyright 2001 Wiley-Liss, Inc.
pubmed:issnType
Print
pubmed:day
1
pubmed:volume
45
pubmed:owner
NLM
pubmed:authorsComplete
Y
pubmed:pagination
102-4
pubmed:dateRevised
2007-11-14
pubmed:meshHeading
pubmed:year
2001
pubmed:articleTitle
Distribution of Indel lengths.
pubmed:affiliation
Biophysics Research Division, University of Michigan, Ann Arbor, USA.
pubmed:publicationType
Journal Article, Research Support, U.S. Gov't, P.H.S., Research Support, U.S. Gov't, Non-P.H.S., Research Support, Non-U.S. Gov't