Statements in which the resource exists as a subject.
PredicateObject
rdf:type
lifeskim:mentions
pubmed:issue
8
pubmed:dateCreated
2002-8-14
pubmed:abstractText
Repetitive sequences make up a major part of eukaryotic genomes. We have developed an approach for the de novo identification and classification of repeat sequence families that is based on extensions to the usual approach of single linkage clustering of local pairwise alignments between genomic sequences. Our extensions use multiple alignment information to define the boundaries of individual copies of the repeats and to distinguish homologous but distinct repeat element families. When tested on the human genome, our approach was able to properly identify and group known transposable elements. The program, should be useful for first-pass automatic classification of repeats in newly sequenced genomes.
pubmed:commentsCorrections
http://linkedlifedata.com/resource/pubmed/commentcorrection/12176934-10222408, http://linkedlifedata.com/resource/pubmed/commentcorrection/12176934-10592242, http://linkedlifedata.com/resource/pubmed/commentcorrection/12176934-10702296, http://linkedlifedata.com/resource/pubmed/commentcorrection/12176934-11237011, http://linkedlifedata.com/resource/pubmed/commentcorrection/12176934-11408628, http://linkedlifedata.com/resource/pubmed/commentcorrection/12176934-12176921, http://linkedlifedata.com/resource/pubmed/commentcorrection/12176934-15739260, http://linkedlifedata.com/resource/pubmed/commentcorrection/12176934-1909376, http://linkedlifedata.com/resource/pubmed/commentcorrection/12176934-6245369, http://linkedlifedata.com/resource/pubmed/commentcorrection/12176934-7366731, http://linkedlifedata.com/resource/pubmed/commentcorrection/12176934-8019419, http://linkedlifedata.com/resource/pubmed/commentcorrection/12176934-8808577, http://linkedlifedata.com/resource/pubmed/commentcorrection/12176934-9545450, http://linkedlifedata.com/resource/pubmed/commentcorrection/12176934-9582191, http://linkedlifedata.com/resource/pubmed/commentcorrection/12176934-9851916
pubmed:language
eng
pubmed:journal
pubmed:citationSubset
IM
pubmed:status
MEDLINE
pubmed:month
Aug
pubmed:issn
1088-9051
pubmed:author
pubmed:issnType
Print
pubmed:volume
12
pubmed:owner
NLM
pubmed:authorsComplete
Y
pubmed:pagination
1269-76
pubmed:dateRevised
2010-11-18
pubmed:meshHeading
pubmed:year
2002
pubmed:articleTitle
Automated de novo identification of repeat sequence families in sequenced genomes.
pubmed:affiliation
Howard Hughes Medical Institute and Department of Genetics, Washington University School of Medicine, St. Louis, Missouri 63110, USA.
pubmed:publicationType
Journal Article, Research Support, U.S. Gov't, Non-P.H.S., Research Support, Non-U.S. Gov't