Permanent URL to this publication: http://dx.doi.org/10.5167/uzh-46738
Rinaldi, F; Clematide, S; Schneider, G; Romacker, M; Vachon, Th (2010). ODIN: an advanced interface for the curation of biomedical literature. In: Biocuration 2010, Tokyo, Japan, 11 October 2010 - 14 October 2010.
We present ODIN (Ontogene Document INspector): a system for interactive curation of biomedical
literature, developed within the scope of the SASEBio project (Semi-Automated Semantic Enrichment of the Biomedical Literature), as a collaboration between the OntoGene group at the University of Zurich and the NITAS/TMS group of Novartis Pharma AG. The purpose of the system is to allow a human annotator/curator to leverage upon the results of an advanced text mining system in order to enhance the speed and effectiveness of the annotation process.
The OntoGene system takes as input a document (e.g a full paper from PubMed Central) and processes it with a custom NLP pipeline, which includes Named Entity recognition and relation extraction. Entities which are currently supported include proteins, genes, experimental methods, cell lines, species. Entities detected in the input document are disambiguated with respect to a reference database (UniProt, EntrezGene, NCBI taxonomy, PSI-MI ontology). The annotated documents are handed back to the ODIN interface, which allows multiple display modalities. The curator/annotator can view the whole document with in-line annotations highlighted, or can browse the extracted entities and be pointed back to the mentions of the entities within the original document. All entity mentions are entirely editable: the curator can easily add or delete any of them, and also change their extent (i.e. add/remove words to its right or left) with a simple click of the mouse. Different entity views are supported, with sorting capabilities according to different criteria (entity type, entity mention, confidence score, etc.). Selective highlighting of text units (e.g. sentences containing desired entities) is supported. Additionally, extensive logging functionalities are provided. All documents and entities are fully interlinked to reference databases, for the purpose of simplified inspection. Entities can be grouped in classes (e.g. by species) and actions can be applied to whole classes, for selective editing or removal.
|Item Type:||Conference or Workshop Item (Other), refereed, original work|
|Communities & Collections:||06 Faculty of Arts > Institute of Computational Linguistics|
|DDC:||000 Computer science, knowledge & systems|
|Event End Date:||14 October 2010|
|Deposited On:||24 Feb 2011 14:21|
|Last Modified:||20 Oct 2012 07:23|
|Free access at:||Publisher DOI. An embargo period may apply.|
|Related URLs:||http://hinv.jp/biocuration2010/ (Organisation)|
Users (please log in): suggest update or correction for this item
Repository Staff Only: item control page