Navigation auf zora.uzh.ch

Search

ZORA (Zurich Open Repository and Archive)

Detecting Code-Switching in a Multilingual Alpine Heritage Corpus

Volk, Martin; Clematide, Simon (2014). Detecting Code-Switching in a Multilingual Alpine Heritage Corpus. In: Proceedings of the First Workshop on Computational Approaches to Code Switching, Doha, Qatar, 25 October 2014. Association for Computational Linguistics, 24-33.

Abstract

This paper describes experiments in detecting and annotating code-switching in a large multilingual diachronic corpus of Swiss Alpine texts. The texts are in English, French, German, Italian, Romansh and Swiss German. Because of the multilingual authors (mountaineers, scientists) and the assumed multilingual readers, the texts contain numerous code-switching elements. When building and annotating the corpus, we faced issues of language identification on the sentence and sub-sentential level. We present our strategy for language identification and for the annotation of foreign language fragments within sentences. We report 78% precision on detecting a subset of code-switches with correct language labels and 92% unlabeled precision.

Additional indexing

Item Type:Conference or Workshop Item (Paper), refereed, original work
Communities & Collections:06 Faculty of Arts > Institute of Computational Linguistics
08 Research Priority Programs > Language and Space
Dewey Decimal Classification:000 Computer science, knowledge & systems
410 Linguistics
Scopus Subject Areas:Physical Sciences > Computational Theory and Mathematics
Physical Sciences > Computer Vision and Pattern Recognition
Physical Sciences > Information Systems
Language:English
Event End Date:25 October 2014
Deposited On:11 Nov 2014 15:11
Last Modified:27 Jan 2022 08:04
Publisher:Association for Computational Linguistics
ISBN:978-1-937284-96-1
Funders:Swiss National Science Foundation grant CRSII2_147653/1: MODERN: Modelling discourse entities and relations for coherent machine translation
OA Status:Green
Free access at:Official URL. An embargo period may apply.
Publisher DOI:https://doi.org/10.3115/v1/W14-3903
Official URL:http://www.aclweb.org/anthology/W14-39
Project Information:
  • Funder: SNSF
  • Grant ID:
  • Project Title: Swiss National Science Foundation grant CRSII2_147653/1: MODERN: Modelling discourse entities and relations for coherent machine translation
Download PDF  'Detecting Code-Switching in a Multilingual Alpine Heritage Corpus'.
Preview
  • Content: Accepted Version
  • Language: English

Metadata Export

Statistics

Citations

Dimensions.ai Metrics

Altmetrics

Downloads

378 downloads since deposited on 11 Nov 2014
31 downloads since 12 months
Detailed statistics

Authors, Affiliations, Collaborations

Similar Publications