Statements in which the resource exists as a subject.
PredicateObject
rdf:type
lifeskim:mentions
pubmed:issue
11
pubmed:dateCreated
2002-11-8
pubmed:abstractText
MOTIVATION: During the process of high-throughput genome sequencing there are opportunities for mixups of reagents and data associated with particular projects. The sequencing templates or sequence data generated for an assembly may become contaminated with reagents or sequences from another project, resulting in poorer quality and inaccurate assemblies. RESULTS: We have developed a system to assess sequence assemblies and monitor for laboratory mixups. We describe several methods for testing the consistency of assemblies and resolving mixed ones. We use statistical tests to evaluate the distribution of sequencing reads from different plates into contigs, and a graph-based approach to resolve situations where data has been inappropriately combined. While these methods have been designed for use in a high-throughput DNA sequencing environment processing thousands of clones, they can be applied in any situation where distinct sequencing projects are performed at redundant coverage.
pubmed:language
eng
pubmed:journal
pubmed:citationSubset
IM
pubmed:status
MEDLINE
pubmed:month
Nov
pubmed:issn
1367-4803
pubmed:author
pubmed:issnType
Print
pubmed:volume
18
pubmed:owner
NLM
pubmed:authorsComplete
Y
pubmed:pagination
1418-26
pubmed:dateRevised
2006-11-15
pubmed:meshHeading
pubmed-meshheading:12424111-Algorithms, pubmed-meshheading:12424111-Artifacts, pubmed-meshheading:12424111-Computer Simulation, pubmed-meshheading:12424111-Contig Mapping, pubmed-meshheading:12424111-Documentation, pubmed-meshheading:12424111-Equipment Failure Analysis, pubmed-meshheading:12424111-Human Genome Project, pubmed-meshheading:12424111-Humans, pubmed-meshheading:12424111-Models, Biological, pubmed-meshheading:12424111-Models, Statistical, pubmed-meshheading:12424111-Oligonucleotide Array Sequence Analysis, pubmed-meshheading:12424111-Quality Control, pubmed-meshheading:12424111-Reproducibility of Results, pubmed-meshheading:12424111-Research Design, pubmed-meshheading:12424111-Sensitivity and Specificity, pubmed-meshheading:12424111-Sequence Alignment, pubmed-meshheading:12424111-Sequence Analysis, DNA
pubmed:year
2002
pubmed:articleTitle
Identification of mixups among DNA sequencing plates.
pubmed:affiliation
Whitehead Institute, Center for Genome Research, 320 Charles Street, Cambridge MA 02141, USA. nick@genome.wi.mit.edu
pubmed:publicationType
Journal Article, Research Support, Non-U.S. Gov't