Statements in which the resource exists as a subject.
PredicateObject
rdf:type
lifeskim:mentions
pubmed:issue
20
pubmed:dateCreated
2005-10-12
pubmed:abstractText
Almost all protein database search methods use amino acid substitution matrices for scoring, optimizing, and assessing the statistical significance of sequence alignments. Much care and effort has therefore gone into constructing substitution matrices, and the quality of search results can depend strongly upon the choice of the proper matrix. A long-standing problem has been the comparison of sequences with biased amino acid compositions, for which standard substitution matrices are not optimal. To address this problem, we have recently developed a general procedure for transforming a standard matrix into one appropriate for the comparison of two sequences with arbitrary, and possibly differing compositions. Such adjusted matrices yield, on average, improved alignments and alignment scores when applied to the comparison of proteins with markedly biased compositions. Here we review the application of compositionally adjusted matrices and consider whether they may also be applied fruitfully to general purpose protein sequence database searches, in which related sequence pairs do not necessarily have strong compositional biases. Although it is not advisable to apply compositional adjustment indiscriminately, we describe several simple criteria under which invoking such adjustment is on average beneficial. In a typical database search, at least one of these criteria is satisfied by over half the related sequence pairs. Compositional substitution matrix adjustment is now available in NCBI's protein-protein version of blast.
pubmed:grant
pubmed:commentsCorrections
http://linkedlifedata.com/resource/pubmed/commentcorrection/16218944-10642881, http://linkedlifedata.com/resource/pubmed/commentcorrection/16218944-11108698, http://linkedlifedata.com/resource/pubmed/commentcorrection/16218944-11139604, http://linkedlifedata.com/resource/pubmed/commentcorrection/16218944-11382360, http://linkedlifedata.com/resource/pubmed/commentcorrection/16218944-11452024, http://linkedlifedata.com/resource/pubmed/commentcorrection/16218944-11473008, http://linkedlifedata.com/resource/pubmed/commentcorrection/16218944-11752310, http://linkedlifedata.com/resource/pubmed/commentcorrection/16218944-1438297, http://linkedlifedata.com/resource/pubmed/commentcorrection/16218944-14663142, http://linkedlifedata.com/resource/pubmed/commentcorrection/16218944-15509610, http://linkedlifedata.com/resource/pubmed/commentcorrection/16218944-15531614, http://linkedlifedata.com/resource/pubmed/commentcorrection/16218944-1604319, http://linkedlifedata.com/resource/pubmed/commentcorrection/16218944-16218943, http://linkedlifedata.com/resource/pubmed/commentcorrection/16218944-1633570, http://linkedlifedata.com/resource/pubmed/commentcorrection/16218944-16718863, http://linkedlifedata.com/resource/pubmed/commentcorrection/16218944-2051488, http://linkedlifedata.com/resource/pubmed/commentcorrection/16218944-2231712, http://linkedlifedata.com/resource/pubmed/commentcorrection/16218944-2315319, http://linkedlifedata.com/resource/pubmed/commentcorrection/16218944-3221397, http://linkedlifedata.com/resource/pubmed/commentcorrection/16218944-3357886, http://linkedlifedata.com/resource/pubmed/commentcorrection/16218944-3461222, http://linkedlifedata.com/resource/pubmed/commentcorrection/16218944-3570667, http://linkedlifedata.com/resource/pubmed/commentcorrection/16218944-3580642, http://linkedlifedata.com/resource/pubmed/commentcorrection/16218944-5167087, http://linkedlifedata.com/resource/pubmed/commentcorrection/16218944-5420325, http://linkedlifedata.com/resource/pubmed/commentcorrection/16218944-6100188, http://linkedlifedata.com/resource/pubmed/commentcorrection/16218944-7166760, http://linkedlifedata.com/resource/pubmed/commentcorrection/16218944-7265238, http://linkedlifedata.com/resource/pubmed/commentcorrection/16218944-7549879, http://linkedlifedata.com/resource/pubmed/commentcorrection/16218944-7723011, http://linkedlifedata.com/resource/pubmed/commentcorrection/16218944-8234244, http://linkedlifedata.com/resource/pubmed/commentcorrection/16218944-8483166, http://linkedlifedata.com/resource/pubmed/commentcorrection/16218944-9254694, http://linkedlifedata.com/resource/pubmed/commentcorrection/16218944-9600919
pubmed:language
eng
pubmed:journal
pubmed:citationSubset
IM
pubmed:chemical
pubmed:status
MEDLINE
pubmed:month
Oct
pubmed:issn
1742-464X
pubmed:author
pubmed:issnType
Print
pubmed:volume
272
pubmed:owner
NLM
pubmed:authorsComplete
Y
pubmed:pagination
5101-9
pubmed:dateRevised
2009-11-18
pubmed:meshHeading
pubmed:year
2005
pubmed:articleTitle
Protein database searches using compositionally adjusted substitution matrices.
pubmed:affiliation
National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA. altschul@ncbi.nlm.nih.gov
pubmed:publicationType
Journal Article, Review