Statements in which the resource exists as a subject.
PredicateObject
rdf:type
lifeskim:mentions
pubmed:issue
24
pubmed:dateCreated
1992-2-7
pubmed:abstractText
Genes in higher eukaryotes may span tens or hundreds of kilobases with the protein-coding regions accounting for only a few percent of the total sequence. Identifying genes within large regions of uncharacterized DNA is a difficult undertaking and is currently the focus of many research efforts. We describe a reliable computational approach for locating protein-coding portions of genes in anonymous DNA sequence. Using a concept suggested by robotic environmental sensing, our method combines a set of sensor algorithms and a neural network to localize the coding regions. Several algorithms that report local characteristics of the DNA sequence, and therefore act as sensors, are also described. In its current configuration the "coding recognition module" identifies 90% of coding exons of length 100 bases or greater with less than one false positive coding exon indicated per five coding exons indicated. This is a significantly lower false positive rate than any method of which we are aware. This module demonstrates a method with general applicability to sequence-pattern recognition problems and is available for current research efforts.
pubmed:commentsCorrections
http://linkedlifedata.com/resource/pubmed/commentcorrection/1763041-11607061, http://linkedlifedata.com/resource/pubmed/commentcorrection/1763041-16593901, http://linkedlifedata.com/resource/pubmed/commentcorrection/1763041-1690334, http://linkedlifedata.com/resource/pubmed/commentcorrection/1763041-2114220, http://linkedlifedata.com/resource/pubmed/commentcorrection/1763041-2134734, http://linkedlifedata.com/resource/pubmed/commentcorrection/1763041-2216778, http://linkedlifedata.com/resource/pubmed/commentcorrection/1763041-2333226, http://linkedlifedata.com/resource/pubmed/commentcorrection/1763041-2395643, http://linkedlifedata.com/resource/pubmed/commentcorrection/1763041-2475911, http://linkedlifedata.com/resource/pubmed/commentcorrection/1763041-2781285, http://linkedlifedata.com/resource/pubmed/commentcorrection/1763041-3353225, http://linkedlifedata.com/resource/pubmed/commentcorrection/1763041-3362858, http://linkedlifedata.com/resource/pubmed/commentcorrection/1763041-6393058, http://linkedlifedata.com/resource/pubmed/commentcorrection/1763041-6546423, http://linkedlifedata.com/resource/pubmed/commentcorrection/1763041-6953413, http://linkedlifedata.com/resource/pubmed/commentcorrection/1763041-7063399, http://linkedlifedata.com/resource/pubmed/commentcorrection/1763041-7145702
pubmed:language
eng
pubmed:journal
pubmed:citationSubset
IM
pubmed:chemical
pubmed:status
MEDLINE
pubmed:month
Dec
pubmed:issn
0027-8424
pubmed:author
pubmed:issnType
Print
pubmed:day
15
pubmed:volume
88
pubmed:geneSymbol
HRAS1
pubmed:owner
NLM
pubmed:authorsComplete
Y
pubmed:pagination
11261-5
pubmed:dateRevised
2010-9-10
pubmed:meshHeading
pubmed:year
1991
pubmed:articleTitle
Locating protein-coding regions in human DNA sequences by a multiple sensor-neural network approach.
pubmed:affiliation
Biology Division, Oak Ridge National Laboratory, TN.
pubmed:publicationType
Journal Article, Research Support, U.S. Gov't, Non-P.H.S., Research Support, Non-U.S. Gov't