Quick Search:

uzh logo
Browse by:
bullet
bullet
bullet
bullet

Zurich Open Repository and ArchiveĀ 

Permanent URL to this publication: http://dx.doi.org/10.5167/uzh-46738

Rinaldi, F; Clematide, S; Schneider, G; Romacker, M; Vachon, Th (2010). ODIN: an advanced interface for the curation of biomedical literature. In: Biocuration 2010, Tokyo, Japan, 11 October 2010 - 14 October 2010.

[img]
Preview
PDF
1253Kb

Abstract

We present ODIN (Ontogene Document INspector): a system for interactive curation of biomedical
literature, developed within the scope of the SASEBio project (Semi-Automated Semantic Enrichment of the Biomedical Literature), as a collaboration between the OntoGene group at the University of Zurich and the NITAS/TMS group of Novartis Pharma AG. The purpose of the system is to allow a human annotator/curator to leverage upon the results of an advanced text mining system in order to enhance the speed and effectiveness of the annotation process.

The OntoGene system takes as input a document (e.g a full paper from PubMed Central) and processes it with a custom NLP pipeline, which includes Named Entity recognition and relation extraction. Entities which are currently supported include proteins, genes, experimental methods, cell lines, species. Entities detected in the input document are disambiguated with respect to a reference database (UniProt, EntrezGene, NCBI taxonomy, PSI-MI ontology). The annotated documents are handed back to the ODIN interface, which allows multiple display modalities. The curator/annotator can view the whole document with in-line annotations highlighted, or can browse the extracted entities and be pointed back to the mentions of the entities within the original document. All entity mentions are entirely editable: the curator can easily add or delete any of them, and also change their extent (i.e. add/remove words to its right or left) with a simple click of the mouse. Different entity views are supported, with sorting capabilities according to different criteria (entity type, entity mention, confidence score, etc.). Selective highlighting of text units (e.g. sentences containing desired entities) is supported. Additionally, extensive logging functionalities are provided. All documents and entities are fully interlinked to reference databases, for the purpose of simplified inspection. Entities can be grouped in classes (e.g. by species) and actions can be applied to whole classes, for selective editing or removal.

Item Type:Conference or Workshop Item (Other), refereed, original work
Communities & Collections:06 Faculty of Arts > Institute of Computational Linguistics
DDC:000 Computer science, knowledge & systems
410 Linguistics
Language:English
Event End Date:14 October 2010
Deposited On:24 Feb 2011 14:21
Last Modified:20 Oct 2012 07:23
Free access at:Publisher DOI. An embargo period may apply.
Publisher DOI:10.1038/npre.2010.5169.1
Related URLs:http://hinv.jp/biocuration2010/ (Organisation)

Users (please log in): suggest update or correction for this item

Repository Staff Only: item control page