Statements in which the resource exists as a subject.
PredicateObject
rdf:type
lifeskim:mentions
pubmed:issue
3
pubmed:dateCreated
2006-6-5
pubmed:abstractText
The identification and characterization of the complete ensemble of genes is a main goal of deciphering the digital information stored in the human genome. Many algorithms for computational gene prediction have been described, ultimately derived from two basic concepts: (1) modeling gene structure and (2) recognizing sequence similarity. Successful hybrid methods combining these two concepts have also been developed. We present a third orthogonal approach to gene prediction, based on detecting the genomic signatures of transcription, accumulated over evolutionary time. We discuss four algorithms based on this third concept: Greens and CHOWDER, which quantify mutational strand biases caused by transcription-coupled DNA repair, and ROAST and PASTA, which are based on strand-specific selection against polyadenylation signals. We combined these algorithms into an integrated method called FEAST, which we used to predict the location and orientation of thousands of putative transcription units not overlapping known genes. Many of the newly predicted transcriptional units do not appear to code for proteins. The new algorithms are particularly apt at detecting genes with long introns and lacking sequence conservation. They therefore complement existing gene prediction methods and will help identify functional transcripts within many apparent "genomic deserts."
pubmed:grant
pubmed:commentsCorrections
http://linkedlifedata.com/resource/pubmed/commentcorrection/16543943-10231571, http://linkedlifedata.com/resource/pubmed/commentcorrection/16543943-10547847, http://linkedlifedata.com/resource/pubmed/commentcorrection/16543943-10607616, http://linkedlifedata.com/resource/pubmed/commentcorrection/16543943-10779482, http://linkedlifedata.com/resource/pubmed/commentcorrection/16543943-10871271, http://linkedlifedata.com/resource/pubmed/commentcorrection/16543943-10992426, http://linkedlifedata.com/resource/pubmed/commentcorrection/16543943-11083920, http://linkedlifedata.com/resource/pubmed/commentcorrection/16543943-11237011, http://linkedlifedata.com/resource/pubmed/commentcorrection/16543943-11717405, http://linkedlifedata.com/resource/pubmed/commentcorrection/16543943-11731486, http://linkedlifedata.com/resource/pubmed/commentcorrection/16543943-11731496, http://linkedlifedata.com/resource/pubmed/commentcorrection/16543943-12045153, http://linkedlifedata.com/resource/pubmed/commentcorrection/16543943-12368240, http://linkedlifedata.com/resource/pubmed/commentcorrection/16543943-12466850, http://linkedlifedata.com/resource/pubmed/commentcorrection/16543943-12612582, http://linkedlifedata.com/resource/pubmed/commentcorrection/16543943-12881777, http://linkedlifedata.com/resource/pubmed/commentcorrection/16543943-12917688, http://linkedlifedata.com/resource/pubmed/commentcorrection/16543943-14764925, http://linkedlifedata.com/resource/pubmed/commentcorrection/16543943-15057822, http://linkedlifedata.com/resource/pubmed/commentcorrection/16543943-15109775, http://linkedlifedata.com/resource/pubmed/commentcorrection/16543943-15152245, http://linkedlifedata.com/resource/pubmed/commentcorrection/16543943-15193305, http://linkedlifedata.com/resource/pubmed/commentcorrection/16543943-15489334, http://linkedlifedata.com/resource/pubmed/commentcorrection/16543943-15545716, http://linkedlifedata.com/resource/pubmed/commentcorrection/16543943-15790807, http://linkedlifedata.com/resource/pubmed/commentcorrection/16543943-15805498, http://linkedlifedata.com/resource/pubmed/commentcorrection/16543943-15964997, http://linkedlifedata.com/resource/pubmed/commentcorrection/16543943-15965474, http://linkedlifedata.com/resource/pubmed/commentcorrection/16543943-2315319, http://linkedlifedata.com/resource/pubmed/commentcorrection/16543943-2474545, http://linkedlifedata.com/resource/pubmed/commentcorrection/16543943-3010125, http://linkedlifedata.com/resource/pubmed/commentcorrection/16543943-3861241, http://linkedlifedata.com/resource/pubmed/commentcorrection/16543943-6571985, http://linkedlifedata.com/resource/pubmed/commentcorrection/16543943-7702601, http://linkedlifedata.com/resource/pubmed/commentcorrection/16543943-7723058, http://linkedlifedata.com/resource/pubmed/commentcorrection/16543943-7877164, http://linkedlifedata.com/resource/pubmed/commentcorrection/16543943-8388099, http://linkedlifedata.com/resource/pubmed/commentcorrection/16543943-9149143, http://linkedlifedata.com/resource/pubmed/commentcorrection/16543943-9403055, http://linkedlifedata.com/resource/pubmed/commentcorrection/16543943-9721203, http://linkedlifedata.com/resource/pubmed/commentcorrection/16543943-9862982
pubmed:language
eng
pubmed:journal
pubmed:citationSubset
IM
pubmed:status
MEDLINE
pubmed:month
Mar
pubmed:issn
1553-7358
pubmed:author
pubmed:issnType
Electronic
pubmed:volume
2
pubmed:owner
NLM
pubmed:authorsComplete
Y
pubmed:pagination
e18
pubmed:dateRevised
2009-11-18
pubmed:meshHeading
pubmed:year
2006
pubmed:articleTitle
A third approach to gene prediction suggests thousands of additional human transcribed regions.
pubmed:affiliation
Institute for Systems Biology, Seattle, Washington, USA. Gustavo@SystemsBiology.org
pubmed:publicationType
Journal Article, Research Support, Non-U.S. Gov't, Research Support, N.I.H., Extramural