Switch to
Predicate | Object |
---|---|
rdf:type | |
lifeskim:mentions | |
pubmed:issue |
5
|
pubmed:dateCreated |
1997-12-17
|
pubmed:abstractText |
We present some new ideas for characterizing and comparing large chemical databases. The comparison of the contents of large databases is not trivial since it implies pairwise comparison of hundreds of thousands of compounds. We have developed methods for categorizing compounds into groups or series based on their ring-system content, using precalculated structure-based hashcodes. Two large databases can then be compared by simply comparing their hashcode tables. Furthermore, the number of distinct ring-system combinations can be used as an indicator of database diversity. We also present an independent technique for diversity assessment called the saturation diversity approach. This method is based on picking as many mutually dissimilar compounds as possible from a database or a subset thereof. We show that both methods yield similar results. Since the two methods measure very different properties, this probably says more about the properties of the databases studied than about the methods.
|
pubmed:language |
eng
|
pubmed:journal | |
pubmed:citationSubset |
IM
|
pubmed:chemical | |
pubmed:status |
MEDLINE
|
pubmed:month |
Sep
|
pubmed:issn |
0920-654X
|
pubmed:author | |
pubmed:issnType |
Print
|
pubmed:volume |
11
|
pubmed:owner |
NLM
|
pubmed:authorsComplete |
Y
|
pubmed:pagination |
447-52
|
pubmed:dateRevised |
2000-12-18
|
pubmed:meshHeading | |
pubmed:year |
1997
|
pubmed:articleTitle |
Database diversity assessment: new ideas, concepts, and tools.
|
pubmed:affiliation |
Wyeth-Ayerst Research, Pearl River, NY 10965, USA.
|
pubmed:publicationType |
Journal Article
|