Quick Search:

uzh logo
Browse by:
bullet
bullet
bullet
bullet

Zurich Open Repository and Archive

Permanent URL to this publication: http://dx.doi.org/10.5167/uzh-64980

Clematide, Simon; Gindl, Stefan; Klenner, Manfred; Petrakis, Stefanos; Remus, Robert; Ruppenhofer, Josef; Waltinger, Ulli; Wiegand, Michael (2012). MLSA ― A Multi-layered Reference Corpus for German Sentiment Analysis. In: Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC'12), Istanbul, 23 May 2012 - 25 May 2012.

[img]
Preview
PDF
3MB

Abstract

In this paper, we describe MLSA, a publicly available multi-layered reference corpus for German-language sentiment analysis. The construction of the corpus is based on the manual annotation of 270 German-language sentences considering three different layers of granularity. The sentence-layer annotation, as the most coarse-grained annotation, focuses on aspects of objectivity, subjectivity and the overall polarity of the respective sentences. Layer 2 is concerned with polarity on the word- and phrase-level, annotating both subjective and factual language. The annotations on Layer 3 focus on the expression-level, denoting frames of private states such as objective and direct speech events. These three layers and their respective annotations are intended to be fully independent of each other. At the same time, exploring for and discovering interactions that may exist between different layers should also be possible. The reliability of the respective annotations was assessed using the average pairwise agreement and Fleiss' multi-rater measures. We believe that MLSA is a beneficial resource for sentiment analysis research, algorithms and applications that focus on the German language.

Citations

Downloads

70 downloads since deposited on 05 Oct 2012
38 downloads since 12 months

Detailed statistics

Additional indexing

Item Type:Conference or Workshop Item (Paper), refereed, original work
Communities & Collections:06 Faculty of Arts > Institute of Computational Linguistics
DDC:410 Linguistics
000 Computer science, knowledge & systems
Language:English
Event End Date:25 May 2012
Deposited On:05 Oct 2012 06:45
Last Modified:18 Oct 2012 20:32
Free access at:Official URL. An embargo period may apply.
Official URL:http://www.lrec-conf.org/proceedings/lrec2012/index.html

Users (please log in): suggest update or correction for this item

Repository Staff Only: item control page