Statements in which the resource exists as a subject.
PredicateObject
rdf:type
lifeskim:mentions
pubmed:issue
4
pubmed:dateCreated
2011-1-26
pubmed:abstractText
Massively parallel DNA sequencing technologies are revolutionizing genomics by making it possible to generate billions of relatively short (~100-base) sequence reads at very low cost. Whereas such data can be readily used for a wide range of biomedical applications, it has proven difficult to use them to generate high-quality de novo genome assemblies of large, repeat-rich vertebrate genomes. To date, the genome assemblies generated from such data have fallen far short of those obtained with the older (but much more expensive) capillary-based sequencing approach. Here, we report the development of an algorithm for genome assembly, ALLPATHS-LG, and its application to massively parallel DNA sequence data from the human and mouse genomes, generated on the Illumina platform. The resulting draft genome assemblies have good accuracy, short-range contiguity, long-range connectivity, and coverage of the genome. In particular, the base accuracy is high (?99.95%) and the scaffold sizes (N50 size = 11.5 Mb for human and 7.2 Mb for mouse) approach those obtained with capillary-based sequencing. The combination of improved sequencing technology and improved computational methods should now make it possible to increase dramatically the de novo sequencing of large genomes. The ALLPATHS-LG program is available at http://www.broadinstitute.org/science/programs/genome-biology/crd.
pubmed:grant
pubmed:commentsCorrections
http://linkedlifedata.com/resource/pubmed/commentcorrection/21187386-11504945, http://linkedlifedata.com/resource/pubmed/commentcorrection/21187386-12466850, http://linkedlifedata.com/resource/pubmed/commentcorrection/21187386-15496913, http://linkedlifedata.com/resource/pubmed/commentcorrection/21187386-16341006, http://linkedlifedata.com/resource/pubmed/commentcorrection/21187386-17803354, http://linkedlifedata.com/resource/pubmed/commentcorrection/21187386-18340039, http://linkedlifedata.com/resource/pubmed/commentcorrection/21187386-18349386, http://linkedlifedata.com/resource/pubmed/commentcorrection/21187386-18464734, http://linkedlifedata.com/resource/pubmed/commentcorrection/21187386-18500340, http://linkedlifedata.com/resource/pubmed/commentcorrection/21187386-18987734, http://linkedlifedata.com/resource/pubmed/commentcorrection/21187386-19212409, http://linkedlifedata.com/resource/pubmed/commentcorrection/21187386-19251739, http://linkedlifedata.com/resource/pubmed/commentcorrection/21187386-19287394, http://linkedlifedata.com/resource/pubmed/commentcorrection/21187386-19468303, http://linkedlifedata.com/resource/pubmed/commentcorrection/21187386-19796385, http://linkedlifedata.com/resource/pubmed/commentcorrection/21187386-19890298, http://linkedlifedata.com/resource/pubmed/commentcorrection/21187386-20010809, http://linkedlifedata.com/resource/pubmed/commentcorrection/21187386-20019144, http://linkedlifedata.com/resource/pubmed/commentcorrection/21187386-20164927, http://linkedlifedata.com/resource/pubmed/commentcorrection/21187386-20386741, http://linkedlifedata.com/resource/pubmed/commentcorrection/21187386-20981092, http://linkedlifedata.com/resource/pubmed/commentcorrection/21187386-6093122, http://linkedlifedata.com/resource/pubmed/commentcorrection/21187386-9521922
pubmed:language
eng
pubmed:journal
pubmed:citationSubset
IM
pubmed:status
MEDLINE
pubmed:month
Jan
pubmed:issn
1091-6490
pubmed:author
pubmed:issnType
Electronic
pubmed:day
25
pubmed:volume
108
pubmed:owner
NLM
pubmed:authorsComplete
Y
pubmed:pagination
1513-8
pubmed:dateRevised
2011-7-25
pubmed:meshHeading
pubmed:year
2011
pubmed:articleTitle
High-quality draft assemblies of mammalian genomes from massively parallel sequence data.
pubmed:affiliation
Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA.
pubmed:publicationType
Journal Article, Research Support, N.I.H., Extramural