Statements in which the resource exists as a subject.
PredicateObject
rdf:type
lifeskim:mentions
pubmed:dateCreated
2007-10-22
pubmed:abstractText
To understand the regulation of the gene expression, the identification of transcription start sites (TSSs) is a primary and important step. With the aim to improve the computational prediction accuracy, we focus on the most challenging task, i.e., to identify the TSSs within 50 bp in non-CpG related promoter regions. Due to the diversity of non-CpG related promoters, a large number of features are extracted. Effective feature selection can minimize the noise, improve the prediction accuracy, and also to discover biologically meaningful intrinsic properties. In this paper, a newly proposed multi-objective simulated annealing based optimization method, Archive Multi-Objective Simulated Annealing (AMOSA), is integrated with Linear Discriminant Analysis (LDA) to yield a combined feature selection and classification system. This system is found to be comparable to, often better than, several existing methods in terms of different quantitative performance measures.
pubmed:grant
pubmed:language
eng
pubmed:journal
pubmed:citationSubset
IM
pubmed:status
MEDLINE
pubmed:issn
1752-7791
pubmed:author
pubmed:issnType
Print
pubmed:volume
6
pubmed:owner
NLM
pubmed:authorsComplete
Y
pubmed:pagination
183-93
pubmed:dateRevised
2010-12-3
pubmed:meshHeading
pubmed:year
2007
pubmed:articleTitle
Prediction of transcription start sites based on feature selection using AMOSA.
pubmed:affiliation
Bioinformatics Division, TNLIST and Department of Automation, Tsinghua Univ., Beijing 100084, China.
pubmed:publicationType
Journal Article, Research Support, Non-U.S. Gov't, Research Support, N.I.H., Extramural