Statements in which the resource exists as a subject.
PredicateObject
rdf:type
lifeskim:mentions
pubmed:issue
4
pubmed:dateCreated
2004-8-3
pubmed:abstractText
Optimal spaced seeds were developed as a method to increase sensitivity of local alignment programs similar to BLASTN. Such seeds have been used before in the program PatternHunter, and have given improved sensitivity and running time relative to BLASTN in genome-genome comparison. We study the problem of computing optimal spaced seeds for detecting homologous coding regions in unannotated genomic sequences. By using well-chosen seeds, we are able to improve the sensitivity of coding sequence alignment over that of TBLASTX, while keeping runtime comparable to BLASTN. We identify good seeds by first giving effective hidden Markov models of conservation in alignments of homologous coding regions. We give an efficient algorithm to compute the optimal spaced seed when conservation patterns are generated by these models. Our results offer the hope of improved gene finding due to fewer missed exons in DNA/DNA comparison, and more effective homology search in general, and may have applications outside of bioinformatics.
pubmed:language
eng
pubmed:journal
pubmed:citationSubset
IM
pubmed:chemical
pubmed:status
MEDLINE
pubmed:month
Jan
pubmed:issn
0219-7200
pubmed:author
pubmed:issnType
Print
pubmed:volume
1
pubmed:owner
NLM
pubmed:authorsComplete
Y
pubmed:pagination
595-610
pubmed:dateRevised
2006-11-15
pubmed:meshHeading
pubmed:year
2004
pubmed:articleTitle
Optimal spaced seeds for homologous coding regions.
pubmed:affiliation
School of Computer Science, University of Waterloo, 200 University Ave West, Waterloo, ON N2L3G1, Canada. bbrejova@math.uwaterloo.ca
pubmed:publicationType
Journal Article, Comparative Study, Research Support, Non-U.S. Gov't