Statements in which the resource exists as a subject.
PredicateObject
rdf:type
lifeskim:mentions
pubmed:issue
6 Pt 1
pubmed:dateCreated
2005-10-24
pubmed:abstractText
The classification of human gene sequences into exons and introns is a difficult problem in DNA sequence analysis. In this paper, we define a set of features, called the simple Z (SZ) features, which is derived from the Z-curve features for the recognition of human exons and introns. The classification results show that SZ features, while fewer in numbers (three in total), can preserve the high recognition rate of the original nine Z-curve features. Since the size of SZ features is one-third of the Z-curve features, the dimensionality of the feature space is much smaller, and better recognition efficiency is achieved. If the stop codon feature is used together with the three SZ features, a recognition rate of up to 92% for short sequences of length <140 bp can be obtained.
pubmed:language
eng
pubmed:journal
pubmed:citationSubset
IM
pubmed:chemical
pubmed:status
MEDLINE
pubmed:month
Jun
pubmed:issn
1539-3755
pubmed:author
pubmed:issnType
Print
pubmed:volume
67
pubmed:owner
NLM
pubmed:authorsComplete
Y
pubmed:pagination
061916
pubmed:meshHeading
pubmed:year
2003
pubmed:articleTitle
Classification of short human exons and introns based on statistical features.
pubmed:affiliation
Department of Computer Engineering and Information Technology, City University of Hong Kong, Kowloon, Hong Kong. itwyh@cityu.edu.hk
pubmed:publicationType
Journal Article