Statements in which the resource exists as a subject.
PredicateObject
rdf:type
lifeskim:mentions
pubmed:issue
2
pubmed:dateCreated
2008-5-8
pubmed:abstractText
Characterizing the DNA-binding specificities of transcription factors is a key problem in computational biology that has been addressed by multiple algorithms. These usually take as input sequences that are putatively bound by the same factor and output one or more DNA motifs. A common practice is to apply several such algorithms simultaneously to improve coverage at the price of redundancy. In interpreting such results, two tasks are crucial: clustering of redundant motifs, and attributing the motifs to transcription factors by retrieval of similar motifs from previously characterized motif libraries. Both tasks inherently involve motif comparison. Here we present a novel method for comparing and merging motifs, based on Bayesian probabilistic principles. This method takes into account both the similarity in positional nucleotide distributions of the two motifs and their dissimilarity to the background distribution. We demonstrate the use of the new comparison method as a basis for motif clustering and retrieval procedures, and compare it to several commonly used alternatives. Our results show that the new method outperforms other available methods in accuracy and sensitivity. We incorporated the resulting motif clustering and retrieval procedures in a large-scale automated pipeline for analyzing DNA motifs. This pipeline integrates the results of various DNA motif discovery algorithms and automatically merges redundant motifs from multiple training sets into a coherent annotated library of motifs. Application of this pipeline to recent genome-wide transcription factor location data in S. cerevisiae successfully identified DNA motifs in a manner that is as good as semi-automated analysis reported in the literature. Moreover, we show how this analysis elucidates the mechanisms of condition-specific preferences of transcription factors.
pubmed:commentsCorrections
http://linkedlifedata.com/resource/pubmed/commentcorrection/18463706, http://linkedlifedata.com/resource/pubmed/commentcorrection/18463706-10487868, http://linkedlifedata.com/resource/pubmed/commentcorrection/18463706-10698627, http://linkedlifedata.com/resource/pubmed/commentcorrection/18463706-10812473, http://linkedlifedata.com/resource/pubmed/commentcorrection/18463706-11102521, http://linkedlifedata.com/resource/pubmed/commentcorrection/18463706-11734641, http://linkedlifedata.com/resource/pubmed/commentcorrection/18463706-11827492, http://linkedlifedata.com/resource/pubmed/commentcorrection/18463706-11861919, http://linkedlifedata.com/resource/pubmed/commentcorrection/18463706-12015892, http://linkedlifedata.com/resource/pubmed/commentcorrection/18463706-12073323, http://linkedlifedata.com/resource/pubmed/commentcorrection/18463706-12101404, http://linkedlifedata.com/resource/pubmed/commentcorrection/18463706-12384591, http://linkedlifedata.com/resource/pubmed/commentcorrection/18463706-12410840, http://linkedlifedata.com/resource/pubmed/commentcorrection/18463706-12519945, http://linkedlifedata.com/resource/pubmed/commentcorrection/18463706-12520026, http://linkedlifedata.com/resource/pubmed/commentcorrection/18463706-12626717, http://linkedlifedata.com/resource/pubmed/commentcorrection/18463706-12732146, http://linkedlifedata.com/resource/pubmed/commentcorrection/18463706-12748633, http://linkedlifedata.com/resource/pubmed/commentcorrection/18463706-14668220, http://linkedlifedata.com/resource/pubmed/commentcorrection/18463706-14681366, http://linkedlifedata.com/resource/pubmed/commentcorrection/18463706-14681407, http://linkedlifedata.com/resource/pubmed/commentcorrection/18463706-14985506, http://linkedlifedata.com/resource/pubmed/commentcorrection/18463706-15297295, http://linkedlifedata.com/resource/pubmed/commentcorrection/18463706-15343339, http://linkedlifedata.com/resource/pubmed/commentcorrection/18463706-15454407, http://linkedlifedata.com/resource/pubmed/commentcorrection/18463706-1549472, http://linkedlifedata.com/resource/pubmed/commentcorrection/18463706-15620355, http://linkedlifedata.com/resource/pubmed/commentcorrection/18463706-15620356, http://linkedlifedata.com/resource/pubmed/commentcorrection/18463706-15814553, http://linkedlifedata.com/resource/pubmed/commentcorrection/18463706-15905282, http://linkedlifedata.com/resource/pubmed/commentcorrection/18463706-15980506, http://linkedlifedata.com/resource/pubmed/commentcorrection/18463706-16024809, http://linkedlifedata.com/resource/pubmed/commentcorrection/18463706-16024819, http://linkedlifedata.com/resource/pubmed/commentcorrection/18463706-16103898, http://linkedlifedata.com/resource/pubmed/commentcorrection/18463706-16246914, http://linkedlifedata.com/resource/pubmed/commentcorrection/18463706-16306045, http://linkedlifedata.com/resource/pubmed/commentcorrection/18463706-16522208, http://linkedlifedata.com/resource/pubmed/commentcorrection/18463706-16683017, http://linkedlifedata.com/resource/pubmed/commentcorrection/18463706-16782869, http://linkedlifedata.com/resource/pubmed/commentcorrection/18463706-16884493, http://linkedlifedata.com/resource/pubmed/commentcorrection/18463706-17324271, http://linkedlifedata.com/resource/pubmed/commentcorrection/18463706-17397256, http://linkedlifedata.com/resource/pubmed/commentcorrection/18463706-17478497, http://linkedlifedata.com/resource/pubmed/commentcorrection/18463706-7584439, http://linkedlifedata.com/resource/pubmed/commentcorrection/18463706-8871566, http://linkedlifedata.com/resource/pubmed/commentcorrection/18463706-8902360, http://linkedlifedata.com/resource/pubmed/commentcorrection/18463706-9036858, http://linkedlifedata.com/resource/pubmed/commentcorrection/18463706-9381177, http://linkedlifedata.com/resource/pubmed/commentcorrection/18463706-9672829, http://linkedlifedata.com/resource/pubmed/commentcorrection/18463706-9843569, http://linkedlifedata.com/resource/pubmed/commentcorrection/18463706-9843981
pubmed:language
eng
pubmed:journal
pubmed:citationSubset
IM
pubmed:chemical
pubmed:status
MEDLINE
pubmed:month
Feb
pubmed:issn
1553-7358
pubmed:author
pubmed:issnType
Electronic
pubmed:volume
4
pubmed:owner
NLM
pubmed:authorsComplete
Y
pubmed:pagination
e1000010
pubmed:dateRevised
2011-7-5
pubmed:meshHeading
pubmed:year
2008
pubmed:articleTitle
A novel Bayesian DNA motif comparison method for clustering and retrieval.
pubmed:affiliation
School of Computer Science and Engineering, The Hebrew University, Jerusalem, Israel.
pubmed:publicationType
Journal Article, Research Support, Non-U.S. Gov't, Research Support, N.I.H., Extramural