Statements in which the resource exists as a subject.
PredicateObject
rdf:type
lifeskim:mentions
pubmed:issue
10
pubmed:dateCreated
2007-5-29
pubmed:abstractText
Redundant protein sequences in biological databases hinder sequence similarity searches and make interpretation of search results difficult. Clustering of protein sequence space based on sequence similarity helps organize all sequences into manageable datasets and reduces sampling bias and overrepresentation of sequences.
pubmed:grant
pubmed:language
eng
pubmed:journal
pubmed:citationSubset
IM
pubmed:chemical
pubmed:status
MEDLINE
pubmed:month
May
pubmed:issn
1367-4811
pubmed:author
pubmed:issnType
Electronic
pubmed:day
15
pubmed:volume
23
pubmed:owner
NLM
pubmed:authorsComplete
Y
pubmed:pagination
1282-8
pubmed:dateRevised
2009-11-4
pubmed:meshHeading
pubmed:year
2007
pubmed:articleTitle
UniRef: comprehensive and non-redundant UniProt reference clusters.
pubmed:affiliation
Protein Information Resource, Department of Biochemistry and Molecular & Cellular Biology, Georgetown University Medical Center, Washington, DC 20007, USA. bes23@georgetown.edu
pubmed:publicationType
Journal Article, Research Support, N.I.H., Extramural