Permanent URL to this publication: http://dx.doi.org/10.5167/uzh-29272
Kappeler, T; Kaljurand, K; Rinaldi, F (2009). TX Task: Automatic detection of focus organisms in biomedical publications. In: Workshop on BioNLP, Boulder, June 2009 - June 2009, 80-88.
In biomedical information extraction (IE), a central problem is the disambiguation of ambiguous names for domain specific entities, such as proteins, genes, etc. One important dimension of ambiguity is the organism to which the entities belong: in order to disambiguate an ambiguous entity name (e.g. a protein), it is often necessary to identify the specific organism to which it refers. In this paper we present an approach to the detection and disambiguation of the focus organism(s), i.e. the organism(s) which are the subject of the research described in scientific papers, which can then be used for the disambiguation of other entities. The results are evaluated against a gold standard derived from IntAct annotations. The evaluation suggests that the results may already be useful within a curation environment and are certainly a baseline for more complex approaches.
57 downloads since deposited on 02 Feb 2010
27 downloads since 12 months
|Item Type:||Conference or Workshop Item (Paper), refereed, original work|
|Communities & Collections:||06 Faculty of Arts > Institute of Computational Linguistics|
|Dewey Decimal Classification:||000 Computer science, knowledge & systems
|Event End Date:||June 2009|
|Deposited On:||02 Feb 2010 15:58|
|Last Modified:||09 Jul 2012 04:11|
Users (please log in): suggest update or correction for this item
Repository Staff Only: item control page