9783226

Source:http://linkedlifedata.com/resource/pubmed/id/9783226

Download in:

Switch to

Custom View

Named Graph Language Inference

Statements in which the resource exists as a subject.
Predicate	Object
rdf:type	pubmed:Citation
lifeskim:mentions	umls-concept:C0009085, umls-concept:C0205554, umls-concept:C0600510, umls-concept:C0600644, umls-concept:C1337108, umls-concept:C1420459, umls-concept:C1706853, umls-concept:C1879748
pubmed:dateCreated	1998-12-18
pubmed:abstractText	The availability of large EST (Expressed Sequence Tag) databases has led to a revolution in the way new genes are cloned. Difficulties arise, however, due to high error rates and redundancy of raw EST data. For these reasons, one of the first tasks performed by a scientist investigating any EST of interest is to gather contiguous ESTs and assemble them into a larger virtual cDNA. The REX (Recursive EST eXtender) algorithm described in this paper completely automates this process by finding ESTs that can be clustered on the basis of overlapping bases, and then assembling the contigs into a consensus sequence. By combining the clustering and assembly steps, REX can quickly generate assemblies from EST databases that are frequently updated without having to preprocess the data. A consensus assembly method is used to correct miscalled bases and remove indel errors. A unique feature of this method is that it addresses the issues of splice variants and unspliced cDNA data. Since REX is a fast greedy algorithm, it can address the problem of generating a database of assembled sequences from very large collections of EST data. A procedure is described for creating and maintaining an Assembled Consensus EST database (ACE) that is useful for characterizing the large body of data that exists in EST databases.
pubmed:language	eng
pubmed:journal	http://linkedlifedata.com/resource/pubmed/journal/9509125
pubmed:citationSubset	IM
pubmed:chemical	http://linkedlifedata.com/resource/pubmed/chemical/DNA, Complementary, http://linkedlifedata.com/resource/pubmed/chemical/Growth Substances, http://linkedlifedata.com/resource/pubmed/chemical/INSL4 protein, human, http://linkedlifedata.com/resource/pubmed/chemical/Intercellular Signaling Peptides...
pubmed:status	MEDLINE
pubmed:issn	1553-0833
pubmed:author	pubmed-author:ConklinDD, pubmed-author:VikI LIL
pubmed:issnType	Print
pubmed:volume	6
pubmed:owner	NLM
pubmed:authorsComplete	Y
pubmed:pagination	203-11
pubmed:dateRevised	2004-11-17
pubmed:meshHeading	pubmed-meshheading:9783226-Algorithms, pubmed-meshheading:9783226-Artificial Intelligence, pubmed-meshheading:9783226-Base Sequence, pubmed-meshheading:9783226-Consensus Sequence, pubmed-meshheading:9783226-DNA, Complementary, pubmed-meshheading:9783226-Data Interpretation, Statistical, pubmed-meshheading:9783226-Databases, Factual, pubmed-meshheading:9783226-Expressed Sequence Tags, pubmed-meshheading:9783226-Growth Substances, pubmed-meshheading:9783226-Humans, pubmed-meshheading:9783226-Intercellular Signaling Peptides and Proteins, pubmed-meshheading:9783226-Molecular Sequence Data
pubmed:year	1998
pubmed:articleTitle	Automated clustering and assembly of large EST collections.
pubmed:affiliation	ZymoGenetics, Inc., Seattle, WA 98102, USA. yee,conklin@zgi.com
pubmed:publicationType	Journal Article