Statements in which the resource exists as a subject.
PredicateObject
rdf:type
lifeskim:mentions
pubmed:issue
23
pubmed:dateCreated
2003-11-20
pubmed:databankReference
pubmed:abstractText
Gene annotation in viruses often relies upon similarity search methods. These methods possess high specificity but some genes may be missed, either those unique to a particular genome or those highly divergent from known homologs. To identify potentially missing viral genes we have analyzed all complete viral genomes currently available in GenBank with a specialized and augmented version of the gene finding program GeneMarkS. In particular, by implementing genome-specific self-training protocols we have better adjusted the GeneMarkS statistical models to sequences of viral genomes. Hundreds of new genes were identified, some in well studied viral genomes. For example, a new gene predicted in the genome of the Epstein-Barr virus was shown to encode a protein similar to alpha-herpesvirus minor tegument protein UL14 with heat shock functions. Convincing evidence of this similarity was obtained after only 12 PSI-BLAST iterations. In another example, several iterations of PSI-BLAST were required to demonstrate that a gene predicted in the genome of Alcelaphine herpesvirus 1 encodes a BALF1-like protein which is thought to be involved in apoptosis regulation and, potentially, carcinogenesis. New predictions were used to refine annotations of viral genomes in the RefSeq collection curated by the National Center for Biotechnology Information. Importantly, even in those cases where no sequence similarities were detected, GeneMarkS significantly reduced the number of primary targets for experimental characterization by identifying the most probable candidate genes. The new genome annotations were stored in VIOLIN, an interactive database which provides access to similarity search tools for up-to-date analysis of predicted viral proteins.
pubmed:grant
pubmed:commentsCorrections
http://linkedlifedata.com/resource/pubmed/commentcorrection/14627837-10481031, http://linkedlifedata.com/resource/pubmed/commentcorrection/14627837-10487861, http://linkedlifedata.com/resource/pubmed/commentcorrection/14627837-10871272, http://linkedlifedata.com/resource/pubmed/commentcorrection/14627837-10906222, http://linkedlifedata.com/resource/pubmed/commentcorrection/14627837-11083803, http://linkedlifedata.com/resource/pubmed/commentcorrection/14627837-11125038, http://linkedlifedata.com/resource/pubmed/commentcorrection/14627837-11125070, http://linkedlifedata.com/resource/pubmed/commentcorrection/14627837-11139604, http://linkedlifedata.com/resource/pubmed/commentcorrection/14627837-11152491, http://linkedlifedata.com/resource/pubmed/commentcorrection/14627837-11292750, http://linkedlifedata.com/resource/pubmed/commentcorrection/14627837-11322826, http://linkedlifedata.com/resource/pubmed/commentcorrection/14627837-11410670, http://linkedlifedata.com/resource/pubmed/commentcorrection/14627837-11452024, http://linkedlifedata.com/resource/pubmed/commentcorrection/14627837-11553768, http://linkedlifedata.com/resource/pubmed/commentcorrection/14627837-11689662, http://linkedlifedata.com/resource/pubmed/commentcorrection/14627837-11752243, http://linkedlifedata.com/resource/pubmed/commentcorrection/14627837-11836425, http://linkedlifedata.com/resource/pubmed/commentcorrection/14627837-11878922, http://linkedlifedata.com/resource/pubmed/commentcorrection/14627837-11895953, http://linkedlifedata.com/resource/pubmed/commentcorrection/14627837-11916376, http://linkedlifedata.com/resource/pubmed/commentcorrection/14627837-12009880, http://linkedlifedata.com/resource/pubmed/commentcorrection/14627837-12045222, http://linkedlifedata.com/resource/pubmed/commentcorrection/14627837-12663918, http://linkedlifedata.com/resource/pubmed/commentcorrection/14627837-12730501, http://linkedlifedata.com/resource/pubmed/commentcorrection/14627837-2172928, http://linkedlifedata.com/resource/pubmed/commentcorrection/14627837-2849754, http://linkedlifedata.com/resource/pubmed/commentcorrection/14627837-6221115, http://linkedlifedata.com/resource/pubmed/commentcorrection/14627837-7984428, http://linkedlifedata.com/resource/pubmed/commentcorrection/14627837-8211139, http://linkedlifedata.com/resource/pubmed/commentcorrection/14627837-9298646, http://linkedlifedata.com/resource/pubmed/commentcorrection/14627837-9461475, http://linkedlifedata.com/resource/pubmed/commentcorrection/14627837-9636706, http://linkedlifedata.com/resource/pubmed/commentcorrection/14627837-9705509, http://linkedlifedata.com/resource/pubmed/commentcorrection/14627837-9722640
pubmed:language
eng
pubmed:journal
pubmed:citationSubset
IM
pubmed:chemical
pubmed:status
MEDLINE
pubmed:month
Dec
pubmed:issn
1362-4962
pubmed:author
pubmed:issnType
Electronic
pubmed:day
1
pubmed:volume
31
pubmed:owner
NLM
pubmed:authorsComplete
Y
pubmed:pagination
7041-55
pubmed:dateRevised
2010-9-20
pubmed:meshHeading
pubmed:year
2003
pubmed:articleTitle
Improving gene annotation of complete viral genomes.
pubmed:affiliation
School of Biology, Georgia Institute of Technology, Atlanta, GA 30332-0230, USA.
pubmed:publicationType
Journal Article, Research Support, U.S. Gov't, P.H.S., Research Support, Non-U.S. Gov't