UZH-Logo

Combining parallel treebanks and geo-tagging


Volk, M; Goehring, A; Marek, T (2010). Combining parallel treebanks and geo-tagging. In: Fourth Linguistic Annotation Workshop (LAW IV), Uppsala, 15 July 2010 - 16 July 2010.

Abstract

This paper describes a new kind of semantic annotation in parallel treebanks. We
build French-German parallel treebanks of
mountaineering reports, a text genre that
abounds with geographical names which
we classify and ground with reference to
a large gazetteer of Swiss toponyms. We
discuss the challenges in obtaining a high
recall and precision in automatic grounding, and sketch how we represent the
grounding information in our treebank.

This paper describes a new kind of semantic annotation in parallel treebanks. We
build French-German parallel treebanks of
mountaineering reports, a text genre that
abounds with geographical names which
we classify and ground with reference to
a large gazetteer of Swiss toponyms. We
discuss the challenges in obtaining a high
recall and precision in automatic grounding, and sketch how we represent the
grounding information in our treebank.

Downloads

244 downloads since deposited on 31 May 2010
37 downloads since 12 months
Detailed statistics

Additional indexing

Item Type:Conference or Workshop Item (Paper), refereed, original work
Communities & Collections:06 Faculty of Arts > Institute of Computational Linguistics
Dewey Decimal Classification:000 Computer science, knowledge & systems
410 Linguistics
Language:English
Event End Date:16 July 2010
Deposited On:31 May 2010 07:24
Last Modified:05 Apr 2016 14:05
Permanent URL: http://doi.org/10.5167/uzh-33378

Download

[img]
Preview
Filetype: PDF
Size: 1MB

TrendTerms

TrendTerms displays relevant terms of the abstract of this publication and related documents on a map. The terms and their relations were extracted from ZORA using word statistics. Their timelines are taken from ZORA as well. The bubble size of a term is proportional to the number of documents where the term occurs. Red, orange, yellow and green colors are used for terms that occur in the current document; red indicates high interlinkedness of a term with other terms, orange, yellow and green decreasing interlinkedness. Blue is used for terms that have a relation with the terms in this document, but occur in other documents.
You can navigate and zoom the map. Mouse-hovering a term displays its timeline, clicking it yields the associated documents.

Author Collaborations