19531739

Source:http://linkedlifedata.com/resource/pubmed/id/19531739

Download in:

Switch to

Custom View

Named Graph Language Inference

Statements in which the resource exists as a subject.
Predicate	Object
rdf:type	pubmed:Citation
lifeskim:mentions	umls-concept:C0017428, umls-concept:C0034754, umls-concept:C0681842, umls-concept:C1444754, umls-concept:C1516240, umls-concept:C1554168, umls-concept:C1704711, umls-concept:C1706907
pubmed:issue	15
pubmed:dateCreated	2009-8-27
pubmed:abstractText	Ultra high-throughput sequencing is used to analyse the transcriptome or interactome at unprecedented depth on a genome-wide scale. These techniques yield short sequence reads that are then mapped on a genome sequence to predict putatively transcribed or protein-interacting regions. We argue that factors such as background distribution, sequence errors, and read length impact on the prediction capacity of sequence census experiments. Here we suggest a computational approach to measure these factors and analyse their influence on both transcriptomic and epigenomic assays. This investigation provides new clues on both methodological and biological issues. For instance, by analysing chromatin immunoprecipitation read sets, we estimate that 4.6% of reads are affected by SNPs. We show that, although the nucleotide error probability is low, it significantly increases with the position in the sequence. Choosing a read length above 19 bp practically eliminates the risk of finding irrelevant positions, while above 20 bp the number of uniquely mapped reads decreases. With our procedure, we obtain 0.6% false positives among genomic locations. Hence, even rare signatures should identify biologically relevant regions, if they are mapped on the genome. This indicates that digital transcriptomics may help to characterize the wealth of yet undiscovered, low-abundance transcripts.
pubmed:commentsCorrections	http://linkedlifedata.com/resource/pubmed/commentcorrection/19531739-11590101, http://linkedlifedata.com/resource/pubmed/commentcorrection/19531739-11981567, http://linkedlifedata.com/resource/pubmed/commentcorrection/19531739-12213207, http://linkedlifedata.com/resource/pubmed/commentcorrection/19531739-14871862, http://linkedlifedata.com/resource/pubmed/commentcorrection/19531739-15539566, http://linkedlifedata.com/resource/pubmed/commentcorrection/19531739-15562001, http://linkedlifedata.com/resource/pubmed/commentcorrection/19531739-15661355, http://linkedlifedata.com/resource/pubmed/commentcorrection/19531739-15973418, http://linkedlifedata.com/resource/pubmed/commentcorrection/19531739-17122850, http://linkedlifedata.com/resource/pubmed/commentcorrection/19531739-17135571, http://linkedlifedata.com/resource/pubmed/commentcorrection/19531739-17504516, http://linkedlifedata.com/resource/pubmed/commentcorrection/19531739-17510325, http://linkedlifedata.com/resource/pubmed/commentcorrection/19531739-17512414, http://linkedlifedata.com/resource/pubmed/commentcorrection/19531739-17556586, http://linkedlifedata.com/resource/pubmed/commentcorrection/19531739-17558387, http://linkedlifedata.com/resource/pubmed/commentcorrection/19531739-17664943, http://linkedlifedata.com/resource/pubmed/commentcorrection/19531739-17687366, http://linkedlifedata.com/resource/pubmed/commentcorrection/19531739-17709346, http://linkedlifedata.com/resource/pubmed/commentcorrection/19531739-18178374, http://linkedlifedata.com/resource/pubmed/commentcorrection/19531739-18243105, http://linkedlifedata.com/resource/pubmed/commentcorrection/19531739-18599741, http://linkedlifedata.com/resource/pubmed/commentcorrection/19531739-18660515, http://linkedlifedata.com/resource/pubmed/commentcorrection/19531739-19029915, http://linkedlifedata.com/resource/pubmed/commentcorrection/19531739-19089307
pubmed:language	eng
pubmed:journal	http://linkedlifedata.com/resource/pubmed/journal/0411011
pubmed:citationSubset	IM
pubmed:status	MEDLINE
pubmed:month	Aug
pubmed:issn	1362-4962
pubmed:author	pubmed-author:BoureuxAnthonyA, pubmed-author:BréhélinLaurentL, pubmed-author:CommesThérèseT, pubmed-author:PhilippeNicolasN, pubmed-author:RivalsEricE, pubmed-author:TarhioJormaJ
pubmed:issnType	Electronic
pubmed:volume	37
pubmed:owner	NLM
pubmed:authorsComplete	Y
pubmed:pagination	e104
pubmed:dateRevised	2009-11-18
pubmed:meshHeading	pubmed-meshheading:19531739-Chromatin Immunoprecipitation, pubmed-meshheading:19531739-Chromosome Mapping, pubmed-meshheading:19531739-Gene Expression Profiling, pubmed-meshheading:19531739-Genome, Human, pubmed-meshheading:19531739-Genomics, pubmed-meshheading:19531739-Humans, pubmed-meshheading:19531739-Polymorphism, Single Nucleotide, pubmed-meshheading:19531739-Sequence Analysis, DNA
pubmed:year	2009
pubmed:articleTitle	Using reads to annotate the genome: influence of length, background distribution, and sequence errors on prediction capacity.
pubmed:affiliation	Laboratoire d'Informatique, de Robotique et de Microélectronique, Université de Montpellier II, UMR 5506 CNRS, 34392 Montpellier, France.
pubmed:publicationType	Journal Article, Research Support, Non-U.S. Gov't