rdf:type |
|
lifeskim:mentions |
|
pubmed:dateCreated |
2008-3-4
|
pubmed:abstractText |
A biomedical entity mention in articles and other free texts is often ambiguous. For example, 13% of the gene names (aliases) might refer to more than one gene. The task of Gene Symbol Disambiguation (GSD) - a special case of Word Sense Disambiguation (WSD) - is to assign a unique gene identifier for all identified gene name aliases in biology-related articles. Supervised and unsupervised machine learning WSD techniques have been applied in the biomedical field with promising results. We examine here the utilisation potential of the fact - one of the special features of biological articles - that the authors of the documents are known through graph-based semi-supervised methods for the GSD task.
|
pubmed:commentsCorrections |
|
pubmed:language |
eng
|
pubmed:journal |
|
pubmed:citationSubset |
IM
|
pubmed:status |
MEDLINE
|
pubmed:issn |
1471-2105
|
pubmed:author |
|
pubmed:issnType |
Electronic
|
pubmed:volume |
9
|
pubmed:owner |
NLM
|
pubmed:authorsComplete |
Y
|
pubmed:pagination |
69
|
pubmed:dateRevised |
2009-11-18
|
pubmed:meshHeading |
|
pubmed:year |
2008
|
pubmed:articleTitle |
The strength of co-authorship in gene name disambiguation.
|
pubmed:affiliation |
Hungarian Academy of Science, Research Group on Artificial Intelligence, Aradi vertanuk tere, Szeged, Hungary. rfarkas@inf.u-szeged.hu
|
pubmed:publicationType |
Journal Article
|