Switch to
Predicate | Object |
---|---|
rdf:type | |
lifeskim:mentions | |
pubmed:issue |
4
|
pubmed:dateCreated |
1994-3-29
|
pubmed:abstractText |
A statistical analysis designed to deal with the problem of identifying rare or abundant "words" of arbitrary length in genomic fragments is presented. Our approach has the novelty of taking into account the statistical role of the presence of shorter words nested into longer ones and of introducing a Bayesian correction to minimize the effects of statistical fluctuations and of possible mistakes in genomic data. The method is successfully used in a thorough analysis of the abundance of short nucleotide sequences in the Escherichia coli genome.
|
pubmed:language |
eng
|
pubmed:journal | |
pubmed:citationSubset |
IM
|
pubmed:status |
MEDLINE
|
pubmed:month |
Dec
|
pubmed:issn |
0022-5193
|
pubmed:author | |
pubmed:issnType |
Print
|
pubmed:day |
21
|
pubmed:volume |
165
|
pubmed:owner |
NLM
|
pubmed:authorsComplete |
Y
|
pubmed:pagination |
659-72
|
pubmed:dateRevised |
2006-11-15
|
pubmed:meshHeading | |
pubmed:year |
1993
|
pubmed:articleTitle |
An improved method for detection of words with unusual occurrence frequency in nucleotide sequences.
|
pubmed:affiliation |
Dipartimento di Scienze Biochimiche, Università degli Studi di Roma, La Sapienza, Italia.
|
pubmed:publicationType |
Journal Article,
Research Support, Non-U.S. Gov't
|