Statements in which the resource exists as a subject.
PredicateObject
rdf:type
lifeskim:mentions
pubmed:issue
14
pubmed:dateCreated
2007-8-10
pubmed:abstractText
The sequencing of complete genomes has created a pressing need for automated annotation of gene function. Because domains are the basic units of protein function and evolution, a gene can be annotated from a domain database by aligning domains to the corresponding protein sequence. Ideally, complete domains are aligned to protein subsequences, in a 'semi-global alignment'. Local alignment, which aligns pieces of domains to subsequences, is common in high-throughput annotation applications, however. It is a mature technique, with the heuristics and accurate E-values required for screening large databases and evaluating the screening results. Hidden Markov models (HMMs) provide an alternative theoretical framework for semi-global alignment, but their use is limited because they lack heuristic acceleration and accurate E-values. Our new tool, GLOBAL, overcomes some limitations of previous semi-global HMMs: it has accurate E-values and the possibility of the heuristic acceleration required for high-throughput applications. Moreover, according to a standard of truth based on protein structure, two semi-global HMM alignment tools (GLOBAL and HMMer) had comparable performance in identifying complete domains, but distinctly outperformed two tools based on local alignment. When searching for complete protein domains, therefore, GLOBAL avoids disadvantages commonly associated with HMMs, yet maintains their superior retrieval performance.
pubmed:commentsCorrections
http://linkedlifedata.com/resource/pubmed/commentcorrection/17596268-10745990, http://linkedlifedata.com/resource/pubmed/commentcorrection/17596268-10982878, http://linkedlifedata.com/resource/pubmed/commentcorrection/17596268-11452024, http://linkedlifedata.com/resource/pubmed/commentcorrection/17596268-11752314, http://linkedlifedata.com/resource/pubmed/commentcorrection/17596268-11752315, http://linkedlifedata.com/resource/pubmed/commentcorrection/17596268-12075022, http://linkedlifedata.com/resource/pubmed/commentcorrection/17596268-12364612, http://linkedlifedata.com/resource/pubmed/commentcorrection/17596268-12512721, http://linkedlifedata.com/resource/pubmed/commentcorrection/17596268-12520028, http://linkedlifedata.com/resource/pubmed/commentcorrection/17596268-12969510, http://linkedlifedata.com/resource/pubmed/commentcorrection/17596268-14705025, http://linkedlifedata.com/resource/pubmed/commentcorrection/17596268-15072685, http://linkedlifedata.com/resource/pubmed/commentcorrection/17596268-15613392, http://linkedlifedata.com/resource/pubmed/commentcorrection/17596268-16718863, http://linkedlifedata.com/resource/pubmed/commentcorrection/17596268-16845028, http://linkedlifedata.com/resource/pubmed/commentcorrection/17596268-1774068, http://linkedlifedata.com/resource/pubmed/commentcorrection/17596268-1924347, http://linkedlifedata.com/resource/pubmed/commentcorrection/17596268-2231712, http://linkedlifedata.com/resource/pubmed/commentcorrection/17596268-2684350, http://linkedlifedata.com/resource/pubmed/commentcorrection/17596268-2770630, http://linkedlifedata.com/resource/pubmed/commentcorrection/17596268-2983426, http://linkedlifedata.com/resource/pubmed/commentcorrection/17596268-3162770, http://linkedlifedata.com/resource/pubmed/commentcorrection/17596268-3287615, http://linkedlifedata.com/resource/pubmed/commentcorrection/17596268-6572363, http://linkedlifedata.com/resource/pubmed/commentcorrection/17596268-7265238, http://linkedlifedata.com/resource/pubmed/commentcorrection/17596268-8804824, http://linkedlifedata.com/resource/pubmed/commentcorrection/17596268-8946368, http://linkedlifedata.com/resource/pubmed/commentcorrection/17596268-9146967, http://linkedlifedata.com/resource/pubmed/commentcorrection/17596268-9254694, http://linkedlifedata.com/resource/pubmed/commentcorrection/17596268-9283754, http://linkedlifedata.com/resource/pubmed/commentcorrection/17596268-9520501, http://linkedlifedata.com/resource/pubmed/commentcorrection/17596268-9600919, http://linkedlifedata.com/resource/pubmed/commentcorrection/17596268-9837738, http://linkedlifedata.com/resource/pubmed/commentcorrection/17596268-9927713
pubmed:language
eng
pubmed:journal
pubmed:citationSubset
IM
pubmed:status
MEDLINE
pubmed:issn
1362-4962
pubmed:author
pubmed:issnType
Electronic
pubmed:volume
35
pubmed:owner
NLM
pubmed:authorsComplete
Y
pubmed:pagination
4678-85
pubmed:dateRevised
2009-11-18
pubmed:meshHeading
pubmed:year
2007
pubmed:articleTitle
The identification of complete domains within protein sequences using accurate E-values for semi-global alignment.
pubmed:affiliation
National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Department of Health and Human Services, Bethesda, MD 20894, USA.
pubmed:publicationType
Journal Article, Comparative Study, Evaluation Studies, Research Support, N.I.H., Intramural