Statements in which the resource exists as a subject.
PredicateObject
rdf:type
lifeskim:mentions
pubmed:issue
1
pubmed:dateCreated
2007-1-22
pubmed:abstractText
Repetitive sequences are a major constituent of many eukaryote genomes and play roles in gene regulation, chromosome inheritance, nuclear architecture, and genome stability. The identification of repetitive elements has traditionally relied on in-depth, manual curation and computational determination of close relatives based on DNA identity. However, the rapid divergence of repetitive sequence has made identification of repeats by DNA identity difficult even in closely related species. Hence, the presence of unidentified repeats in genome sequences affects the quality of gene annotations and annotation-dependent analyses (e.g. microarray analyses). We have developed an enhanced repeat identification pipeline using two approaches. First, the de novo repeat finding program PILER-DF was used to identify interspersed repetitive elements in several recently finished Dipteran genomes. Repeats were classified, when possible, according to their similarity to known elements described in Repbase and GenBank, and also screened against annotated genes as one means of eliminating false positives. Second, we used a new program called RepeatRunner, which integrates results from both RepeatMasker nucleotide searches and protein searches using BLASTX. Using RepeatRunner with PILER-DF predictions, we masked repeats in thirteen Dipteran genomes and conclude that combining PILER-DF and RepeatRunner greatly enhances repeat identification in both well-characterized and un-annotated genomes.
pubmed:grant
pubmed:commentsCorrections
http://linkedlifedata.com/resource/pubmed/commentcorrection/17137733-10471706, http://linkedlifedata.com/resource/pubmed/commentcorrection/17137733-11116329, http://linkedlifedata.com/resource/pubmed/commentcorrection/17137733-11172014, http://linkedlifedata.com/resource/pubmed/commentcorrection/17137733-11237011, http://linkedlifedata.com/resource/pubmed/commentcorrection/17137733-11447285, http://linkedlifedata.com/resource/pubmed/commentcorrection/17137733-12097342, http://linkedlifedata.com/resource/pubmed/commentcorrection/17137733-12364752, http://linkedlifedata.com/resource/pubmed/commentcorrection/17137733-12466850, http://linkedlifedata.com/resource/pubmed/commentcorrection/17137733-12925568, http://linkedlifedata.com/resource/pubmed/commentcorrection/17137733-14527298, http://linkedlifedata.com/resource/pubmed/commentcorrection/17137733-14638329, http://linkedlifedata.com/resource/pubmed/commentcorrection/17137733-15016989, http://linkedlifedata.com/resource/pubmed/commentcorrection/17137733-15099521, http://linkedlifedata.com/resource/pubmed/commentcorrection/17137733-1542662, http://linkedlifedata.com/resource/pubmed/commentcorrection/17137733-15520288, http://linkedlifedata.com/resource/pubmed/commentcorrection/17137733-15632085, http://linkedlifedata.com/resource/pubmed/commentcorrection/17137733-15961452, http://linkedlifedata.com/resource/pubmed/commentcorrection/17137733-16024654, http://linkedlifedata.com/resource/pubmed/commentcorrection/17137733-16093699, http://linkedlifedata.com/resource/pubmed/commentcorrection/17137733-16110336, http://linkedlifedata.com/resource/pubmed/commentcorrection/17137733-16354754, http://linkedlifedata.com/resource/pubmed/commentcorrection/17137733-16376497, http://linkedlifedata.com/resource/pubmed/commentcorrection/17137733-16443682, http://linkedlifedata.com/resource/pubmed/commentcorrection/17137733-16518452, http://linkedlifedata.com/resource/pubmed/commentcorrection/17137733-16625209, http://linkedlifedata.com/resource/pubmed/commentcorrection/17137733-16737559, http://linkedlifedata.com/resource/pubmed/commentcorrection/17137733-2231712, http://linkedlifedata.com/resource/pubmed/commentcorrection/17137733-2469002, http://linkedlifedata.com/resource/pubmed/commentcorrection/17137733-6320712, http://linkedlifedata.com/resource/pubmed/commentcorrection/17137733-9149143, http://linkedlifedata.com/resource/pubmed/commentcorrection/17137733-9207116, http://linkedlifedata.com/resource/pubmed/commentcorrection/17137733-9278062, http://linkedlifedata.com/resource/pubmed/commentcorrection/17137733-9807830, http://linkedlifedata.com/resource/pubmed/commentcorrection/17137733-9862982
pubmed:language
eng
pubmed:journal
pubmed:citationSubset
IM
pubmed:chemical
pubmed:status
MEDLINE
pubmed:month
Mar
pubmed:issn
0378-1119
pubmed:author
pubmed:issnType
Print
pubmed:day
1
pubmed:volume
389
pubmed:owner
NLM
pubmed:authorsComplete
Y
pubmed:pagination
1-9
pubmed:dateRevised
2011-6-7
pubmed:meshHeading
pubmed:year
2007
pubmed:articleTitle
Improved repeat identification and masking in Dipterans.
pubmed:affiliation
Department of Biology, San Francisco State University, San Francisco, CA, United States. cdsmith@fruitfly.org
pubmed:publicationType
Journal Article, Research Support, Non-U.S. Gov't, Research Support, N.I.H., Extramural