rdf:type |
|
lifeskim:mentions |
|
pubmed:dateCreated |
2006-9-5
|
pubmed:abstractText |
Document classification is a wide-spread problem with many applications, from organizing search engine snippets to spam filtering. We previously described Textpresso, a text-mining system for biological literature, which marks up full text according to a shallow ontology that includes terms of biological interest. This project investigates document classification in the context of biological literature, making use of the Textpresso markup of a corpus of Caenorhabditis elegans literature.
|
pubmed:grant |
|
pubmed:commentsCorrections |
|
pubmed:language |
eng
|
pubmed:journal |
|
pubmed:citationSubset |
IM
|
pubmed:status |
MEDLINE
|
pubmed:issn |
1471-2105
|
pubmed:author |
|
pubmed:issnType |
Electronic
|
pubmed:volume |
7
|
pubmed:owner |
NLM
|
pubmed:authorsComplete |
Y
|
pubmed:pagination |
370
|
pubmed:dateRevised |
2009-11-18
|
pubmed:meshHeading |
|
pubmed:year |
2006
|
pubmed:articleTitle |
Automatic document classification of biological literature.
|
pubmed:affiliation |
Division of Biology and Howard Hughes Medical Institute, California Institute of Technology, Pasadena, California, USA. davidc@caltech.edu
|
pubmed:publicationType |
Journal Article,
Research Support, Non-U.S. Gov't,
Research Support, N.I.H., Extramural
|