8891959

Source:http://linkedlifedata.com/resource/pubmed/id/8891959

Download in:

Switch to

Custom View

Named Graph Language Inference

Statements in which the resource exists as a subject.
Predicate	Object
rdf:type	pubmed:Citation
lifeskim:mentions	umls-concept:C0020202, umls-concept:C1519249, umls-concept:C1522240, umls-concept:C1561491, umls-concept:C1705914
pubmed:issue	3
pubmed:dateCreated	1997-2-4
pubmed:abstractText	Sequencing by hybridization is a tool to determine a DNA sequence from the unordered list of all l-tuples contained in this sequence; typical numbers for l are l = 8, 10, 12. For theoretical purposes we assume that the multiset of all l-tuples is known. This multiset determines the DNA sequence uniquely if none of the so-called Ukkonen transformations are possible. These transformations require repeats of (l-1)-tuples in the sequence, with these repeats occurring in certain spatial patterns. We model DNA as an i.i.d. sequence. We first prove Poisson process approximations for the process of indicators of all leftmost long repeats allowing self-overlap and for the process of indicators of all left-most long repeats without self-overlap. Using the Chen-Stein method, we get bounds on the error of these approximations. As a corollary, we approximate the distribution of longest repeats. In the second step we analyze the spatial patterns of the repeats. Finally we combine these two steps to prove an approximation for the probability that a random sequence is uniquely recoverable from its list of l-tuples. For all our results we give some numerical examples including error bounds.
pubmed:grant	http://linkedlifedata.com/resource/pubmed/grant/GM 36230
pubmed:language	eng
pubmed:journal	http://linkedlifedata.com/resource/pubmed/journal/9433358
pubmed:citationSubset	IM
pubmed:status	MEDLINE
pubmed:issn	1066-5277
pubmed:author	pubmed-author:ArratiaRR, pubmed-author:MartinDD, pubmed-author:ReinertGG, pubmed-author:WatermanM SMS
pubmed:issnType	Print
pubmed:volume	3
pubmed:owner	NLM
pubmed:authorsComplete	Y
pubmed:pagination	425-63
pubmed:dateRevised	2007-11-14
pubmed:meshHeading	pubmed-meshheading:8891959-Algorithms, pubmed-meshheading:8891959-Human Genome Project, pubmed-meshheading:8891959-Humans, pubmed-meshheading:8891959-Nucleic Acid Hybridization, pubmed-meshheading:8891959-Poisson Distribution, pubmed-meshheading:8891959-Repetitive Sequences, Nucleic Acid, pubmed-meshheading:8891959-Sequence Analysis, DNA
pubmed:year	1996
pubmed:articleTitle	Poisson process approximation for sequence repeats, and sequencing by hybridization.
pubmed:affiliation	Department of Mathematics, University of Southern California, Los Angeles 90089-1113, USA.
pubmed:publicationType	Journal Article, Research Support, U.S. Gov't, P.H.S., Research Support, U.S. Gov't, Non-P.H.S.