15621661

Source:http://linkedlifedata.com/resource/pubmed/id/15621661

Download in:

Switch to

Custom View

Named Graph Language Inference

Statements in which the resource exists as a subject.
Predicate	Object
rdf:type	pubmed:Citation
lifeskim:mentions	umls-concept:C0017428, umls-concept:C0025914, umls-concept:C0026809, umls-concept:C0034869, umls-concept:C0162326, umls-concept:C0750572, umls-concept:C1514811, umls-concept:C1521828, umls-concept:C1554168, umls-concept:C1561577, umls-concept:C1706462
pubmed:issue	5-6
pubmed:dateCreated	2004-12-28
pubmed:abstractText	We estimate DNA sequence error rates in Genbank records containing protein-coding and non-coding DNA sequences by comparing sequences of the inbred mouse strain C57BL/6J, sequenced as part of the mouse genome project and independently by other laboratories. C57BL/6J was produced by more than 100 generations of brother-sister mating, and can be assumed to be virtually free of residual polymorphism and mutational variation, so differences between independent sequences can be attributed to error. The estimated single nucleotide error rate for coding DNA is 0.10% (SE 0.012%), which is substantially lower than previous estimates for error rates in Genbank accessions. The estimated single nucleotide error rate for intronic DNA sequences (0.22%; SE 0.051%) is significantly higher than the rate for coding DNA. Since error rates for the mouse genome sequence are very low, the vast majority of the errors we detected are likely to be in individual Genbank accessions. The frequency of insertion-deletion (indel) errors in non-coding DNA approaches that of single nucleotide errors in non-coding DNA, whereas indel errors are uncommon in coding sequences.
pubmed:language	eng
pubmed:journal	http://linkedlifedata.com/resource/pubmed/journal/9107800
pubmed:citationSubset	IM
pubmed:status	MEDLINE
pubmed:issn	1042-5179
pubmed:author	pubmed-author:GaffneyDaniel JDJ, pubmed-author:KeightleyPeter DPD, pubmed-author:WeschePhilipp LPL
pubmed:issnType	Print
pubmed:volume	15
pubmed:owner	NLM
pubmed:authorsComplete	Y
pubmed:pagination	362-4
pubmed:dateRevised	2006-11-15
pubmed:meshHeading	pubmed-meshheading:15621661-Animals, pubmed-meshheading:15621661-Base Sequence, pubmed-meshheading:15621661-Databases, Nucleic Acid, pubmed-meshheading:15621661-Mice, pubmed-meshheading:15621661-Mice, Inbred C57BL, pubmed-meshheading:15621661-Research Design, pubmed-meshheading:15621661-Sequence Analysis, DNA
pubmed:articleTitle	DNA sequence error rates in Genbank records estimated using the mouse genome as a reference.
pubmed:affiliation	University of Edinburgh, School of Biological Sciences, Ashworth Laboratories, UK.
pubmed:publicationType	Journal Article, Comparative Study