Header

UZH-Logo

Maintenance Infos

X-Stance: A Multilingual Multi-Target Dataset for Stance Detection


Vamvas, Jannis; Sennrich, Rico (2020). X-Stance: A Multilingual Multi-Target Dataset for Stance Detection. In: 5th Swiss Text Analytics Conference (SwissText) & 16th Conference on Natural Language Processing (KONVENS), Zurich, 23 June 2020 - 25 June 2020.

Abstract

We extract a large-scale stance detection dataset from comments written by candidates of elections in Switzerland. The dataset consists of German, French and Italian text, allowing for a cross-lingual evaluation of stance detection. It contains 67 000 comments on more than 150 political issues (targets). Unlike stance detection models that have specific target issues, we use the dataset to train a single model on all the issues. To make learning across targets possible, we prepend to each instance a natural question that represents the target (e.g. «Do you support X?»). Baseline results from multilingual BERT show that zero-shot cross-lingual and cross-target transfer of stance detection is moderately successful with this approach.

Abstract

We extract a large-scale stance detection dataset from comments written by candidates of elections in Switzerland. The dataset consists of German, French and Italian text, allowing for a cross-lingual evaluation of stance detection. It contains 67 000 comments on more than 150 political issues (targets). Unlike stance detection models that have specific target issues, we use the dataset to train a single model on all the issues. To make learning across targets possible, we prepend to each instance a natural question that represents the target (e.g. «Do you support X?»). Baseline results from multilingual BERT show that zero-shot cross-lingual and cross-target transfer of stance detection is moderately successful with this approach.

Statistics

Downloads

8 downloads since deposited on 22 Jun 2020
8 downloads since 12 months
Detailed statistics

Additional indexing

Item Type:Conference or Workshop Item (Paper), refereed, original work
Communities & Collections:06 Faculty of Arts > Institute of Computational Linguistics
Dewey Decimal Classification:000 Computer science, knowledge & systems
410 Linguistics
Language:English
Event End Date:25 June 2020
Deposited On:22 Jun 2020 09:53
Last Modified:23 Jun 2020 11:00
Publisher:CEUR Workshop Proceedings
OA Status:Green
Free access at:Official URL. An embargo period may apply.
Official URL:http://ceur-ws.org/Vol-2624/paper9.pdf
Project Information:

Download

Green Open Access

Download PDF  'X-Stance: A Multilingual Multi-Target Dataset for Stance Detection'.
Preview
Content: Published Version
Language: English
Filetype: PDF
Size: 606kB
Licence: Creative Commons: Attribution 4.0 International (CC BY 4.0)