15211504

Source:http://linkedlifedata.com/resource/pubmed/id/15211504

Download in:

Switch to

Custom View

Named Graph Language Inference

Statements in which the resource exists as a subject.
Predicate	Object
rdf:type	pubmed:Citation
lifeskim:mentions	umls-concept:C0036576, umls-concept:C0205554, umls-concept:C0684063, umls-concept:C1171366, umls-concept:C1521840
pubmed:issue	2
pubmed:dateCreated	2004-6-22
pubmed:abstractText	A central goal of structural genomics is to experimentally determine representative structures for all protein families. At least 14 structural genomics pilot projects are currently investigating the feasibility of high-throughput structure determination; the National Institutes of Health funded nine of these in the United States. Initiatives differ in the particular subset of "all families" on which they focus. At the NorthEast Structural Genomics consortium (NESG), we target eukaryotic protein domain families. The automatic target selection procedure has three aims: 1) identify all protein domain families from currently five entirely sequenced eukaryotic target organisms based on their sequence homology, 2) discard those families that can be modeled on the basis of structural information already present in the PDB, and 3) target representatives of the remaining families for structure determination. To guarantee that all members of one family share a common foldlike region, we had to begin by dissecting proteins into structural domain-like regions before clustering. Our hierarchical approach, CHOP, utilizing homology to PrISM, Pfam-A, and SWISS-PROT chopped the 103,796 eukaryotic proteins/ORFs into 247,222 fragments. Of these fragments, 122,999 appeared suitable targets that were grouped into >27,000 singletons and >18,000 multifragment clusters. Thus, our results suggested that it might be necessary to determine >40,000 structures to minimally cover the subset of five eukaryotic proteomes.
pubmed:grant	http://linkedlifedata.com/resource/pubmed/grant/P50 GM52413
pubmed:commentsCorrections	http://linkedlifedata.com/resource/pubmed/commentcorrection/15211504-15211503
pubmed:language	eng
pubmed:journal	http://linkedlifedata.com/resource/pubmed/journal/8700181
pubmed:citationSubset	IM
pubmed:chemical	http://linkedlifedata.com/resource/pubmed/chemical/Arabidopsis Proteins, http://linkedlifedata.com/resource/pubmed/chemical/Caenorhabditis elegans Proteins, http://linkedlifedata.com/resource/pubmed/chemical/Drosophila Proteins, http://linkedlifedata.com/resource/pubmed/chemical/Peptide Fragments, http://linkedlifedata.com/resource/pubmed/chemical/Proteins, http://linkedlifedata.com/resource/pubmed/chemical/Saccharomyces cerevisiae Proteins
pubmed:status	MEDLINE
pubmed:month	Aug
pubmed:issn	1097-0134
pubmed:author	pubmed-author:ActonThomas BTB, pubmed-author:HegyiHediH, pubmed-author:LiuJinfengJ, pubmed-author:MontelioneGaetano TGT, pubmed-author:RostBurkhardB
pubmed:copyrightInfo	Copyright 2004 Wiley-Liss, Inc.
pubmed:issnType	Electronic
pubmed:day	1
pubmed:volume	56
pubmed:owner	NLM
pubmed:authorsComplete	Y
pubmed:pagination	188-200
pubmed:dateRevised	2007-11-14
pubmed:meshHeading	pubmed-meshheading:15211504-Algorithms, pubmed-meshheading:15211504-Arabidopsis Proteins, pubmed-meshheading:15211504-Caenorhabditis elegans Proteins, pubmed-meshheading:15211504-Databases, Protein, pubmed-meshheading:15211504-Drosophila Proteins, pubmed-meshheading:15211504-Feasibility Studies, pubmed-meshheading:15211504-Genomics, pubmed-meshheading:15211504-Humans, pubmed-meshheading:15211504-Internet, pubmed-meshheading:15211504-Multigene Family, pubmed-meshheading:15211504-National Institutes of Health (U.S.), pubmed-meshheading:15211504-Peptide Fragments, pubmed-meshheading:15211504-Pilot Projects, pubmed-meshheading:15211504-Protein Conformation, pubmed-meshheading:15211504-Protein Folding, pubmed-meshheading:15211504-Protein Structure, Tertiary, pubmed-meshheading:15211504-Proteins, pubmed-meshheading:15211504-Proteomics, pubmed-meshheading:15211504-Saccharomyces cerevisiae Proteins, pubmed-meshheading:15211504-Sequence Homology, pubmed-meshheading:15211504-United States
pubmed:year	2004
pubmed:articleTitle	Automatic target selection for structural genomics on eukaryotes.
pubmed:affiliation	CUBIC, Department of Biochemistry and Molecular Biophysics, Columbia University, New York, New York 10032, USA.
pubmed:publicationType	Journal Article, Research Support, U.S. Gov't, P.H.S.