Statements in which the resource exists as a subject.
PredicateObject
rdf:type
lifeskim:mentions
pubmed:issue
4
pubmed:dateCreated
1989-3-27
pubmed:abstractText
The ability to determine important features within DNA sequences from the sequences alone is becoming essential as large-scale sequencing projects are being undertaken. We present a method that can be applied to the problem of identifying the recognition pattern for a DNA-binding protein given only a collection of sequenced DNA fragments, each known to contain somewhere within it a binding site for that protein. Information about the position or orientation of the binding sites within those fragments is not needed. The method compares the "information content" of a large number of possible binding site alignments to arrive at a matrix representation of the binding site pattern. The specificity of the protein is represented as a matrix, rather than a consensus sequence, allowing patterns that are typical of regulatory protein-binding sites to be identified. The reliability of the method improves as the number of sequences increases, but the time required increases only linearly with the number of sequences. An example, using known cAMP receptor protein-binding sites, illustrates the method.
pubmed:grant
pubmed:commentsCorrections
http://linkedlifedata.com/resource/pubmed/commentcorrection/2919167-2898280, http://linkedlifedata.com/resource/pubmed/commentcorrection/2919167-3003533, http://linkedlifedata.com/resource/pubmed/commentcorrection/2919167-3293587, http://linkedlifedata.com/resource/pubmed/commentcorrection/2919167-3363347, http://linkedlifedata.com/resource/pubmed/commentcorrection/2919167-3379641, http://linkedlifedata.com/resource/pubmed/commentcorrection/2919167-3383004, http://linkedlifedata.com/resource/pubmed/commentcorrection/2919167-3474607, http://linkedlifedata.com/resource/pubmed/commentcorrection/2919167-3525846, http://linkedlifedata.com/resource/pubmed/commentcorrection/2919167-3550697, http://linkedlifedata.com/resource/pubmed/commentcorrection/2919167-3612791, http://linkedlifedata.com/resource/pubmed/commentcorrection/2919167-3806669, http://linkedlifedata.com/resource/pubmed/commentcorrection/2919167-3908689, http://linkedlifedata.com/resource/pubmed/commentcorrection/2919167-6283312, http://linkedlifedata.com/resource/pubmed/commentcorrection/2919167-6316325, http://linkedlifedata.com/resource/pubmed/commentcorrection/2919167-6344016, http://linkedlifedata.com/resource/pubmed/commentcorrection/2919167-6364039, http://linkedlifedata.com/resource/pubmed/commentcorrection/2919167-6364042, http://linkedlifedata.com/resource/pubmed/commentcorrection/2919167-6372090, http://linkedlifedata.com/resource/pubmed/commentcorrection/2919167-7037748
pubmed:language
eng
pubmed:journal
pubmed:citationSubset
IM
pubmed:chemical
pubmed:status
MEDLINE
pubmed:month
Feb
pubmed:issn
0027-8424
pubmed:author
pubmed:issnType
Print
pubmed:volume
86
pubmed:owner
NLM
pubmed:authorsComplete
Y
pubmed:pagination
1183-7
pubmed:dateRevised
2009-11-18
pubmed:meshHeading
pubmed:year
1989
pubmed:articleTitle
Identifying protein-binding sites from unaligned DNA fragments.
pubmed:affiliation
Department of Molecular, Cellular and Developmental Biology, University of Colorado, Boulder 80309.
pubmed:publicationType
Journal Article, Research Support, U.S. Gov't, P.H.S.