UZH-Logo

Maintenance Infos

OntoPDF: using a text mining pipeline to generate enriched PDF versions of scientific papers


Zhu, Yi; Rinaldi, Fabio (2014). OntoPDF: using a text mining pipeline to generate enriched PDF versions of scientific papers. In: 6th International Symposium on Semantic Mining in Biomedicine, Aveiro, Portugal, 6 October 2014 - 7 October 2014, 85-89.

Abstract

In this poster we present a recent extension of the OntoGene text mining utilities, which enables the generation of annotated pdf versions of the original articles. While a text-based view (in XML or HTML) can allow a more flexible presentation of the results of a text mining pipeline, for some applications, notably in assisted curation, it might be desirable to present the annotations in the context of the original pdf document.

In this poster we present a recent extension of the OntoGene text mining utilities, which enables the generation of annotated pdf versions of the original articles. While a text-based view (in XML or HTML) can allow a more flexible presentation of the results of a text mining pipeline, for some applications, notably in assisted curation, it might be desirable to present the annotations in the context of the original pdf document.

Downloads

44 downloads since deposited on 04 Dec 2014
28 downloads since 12 months
Detailed statistics

Additional indexing

Item Type:Conference or Workshop Item (Paper), not refereed, original work
Communities & Collections:06 Faculty of Arts > Institute of Computational Linguistics
Dewey Decimal Classification:000 Computer science, knowledge & systems
410 Linguistics
Language:English
Event End Date:7 October 2014
Deposited On:04 Dec 2014 17:04
Last Modified:05 Apr 2016 18:35
Publisher:s.n.
Permanent URL: https://doi.org/10.5167/uzh-101881

Download

[img]
Preview
Content: Published Version
Language: English
Filetype: PDF
Size: 931kB

TrendTerms

TrendTerms displays relevant terms of the abstract of this publication and related documents on a map. The terms and their relations were extracted from ZORA using word statistics. Their timelines are taken from ZORA as well. The bubble size of a term is proportional to the number of documents where the term occurs. Red, orange, yellow and green colors are used for terms that occur in the current document; red indicates high interlinkedness of a term with other terms, orange, yellow and green decreasing interlinkedness. Blue is used for terms that have a relation with the terms in this document, but occur in other documents.
You can navigate and zoom the map. Mouse-hovering a term displays its timeline, clicking it yields the associated documents.

Author Collaborations