Header

UZH-Logo

Maintenance Infos

Map Task Corpus of Heritage BCMS spoken by second-generation speakers in Switzerland


Lemmenmeier-Batinić, Dolores; Batinić, Josip; Escher, Anastasia (2023). Map Task Corpus of Heritage BCMS spoken by second-generation speakers in Switzerland. Language Resources and Evaluation, 57(4):1607-1644.

Abstract

In this paper, we present a corpus for heritage Bosnian/Croatian/Montenegrin/Serbian (BCMS) spoken in German-speaking Switzerland. The corpus consists of elicited conversations between 29 second-generation speakers originating from different regions of former Yugoslavia. In total, the corpus contains 30 turn-aligned transcripts with an average length of 6 min. It is enriched with extensive speakers’ metadata, annotations, and pre-calculated corpus counts. The corpus can be accessed through an interactive corpus platform that allows for browsing, querying, and filtering, but also for creating and sharing custom annotations. Principal user groups we address with this corpus are researchers of heritage BCMS, as well as students and teachers of BCMS living in diaspora. In addition to introducing the corpus platform and the workflows we adopted to create it, we also present a case study on BCMS spoken by a pair of siblings who participated in the map task, and discuss advantages and challenges of using this corpus platform for linguistic research.

Abstract

In this paper, we present a corpus for heritage Bosnian/Croatian/Montenegrin/Serbian (BCMS) spoken in German-speaking Switzerland. The corpus consists of elicited conversations between 29 second-generation speakers originating from different regions of former Yugoslavia. In total, the corpus contains 30 turn-aligned transcripts with an average length of 6 min. It is enriched with extensive speakers’ metadata, annotations, and pre-calculated corpus counts. The corpus can be accessed through an interactive corpus platform that allows for browsing, querying, and filtering, but also for creating and sharing custom annotations. Principal user groups we address with this corpus are researchers of heritage BCMS, as well as students and teachers of BCMS living in diaspora. In addition to introducing the corpus platform and the workflows we adopted to create it, we also present a case study on BCMS spoken by a pair of siblings who participated in the map task, and discuss advantages and challenges of using this corpus platform for linguistic research.

Statistics

Citations

Altmetrics

Downloads

11 downloads since deposited on 30 May 2023
9 downloads since 12 months
Detailed statistics

Additional indexing

Item Type:Journal Article, refereed, original work
Communities & Collections:06 Faculty of Arts > Institute of Slavonic Studies
Dewey Decimal Classification:490 Other languages
410 Linguistics
Scopus Subject Areas:Social Sciences & Humanities > Language and Linguistics
Social Sciences & Humanities > Education
Social Sciences & Humanities > Linguistics and Language
Social Sciences & Humanities > Library and Information Sciences
Uncontrolled Keywords:Interactive corpus platform, Spoken language, Heritage speakers, Bosnian/Croatian/Montenegrin/Serbian (BCMS)
Language:English
Date:1 December 2023
Deposited On:30 May 2023 13:35
Last Modified:28 Jun 2024 01:44
Publisher:Springer
ISSN:1574-020X
OA Status:Hybrid
Free access at:Publisher DOI. An embargo period may apply.
Publisher DOI:https://doi.org/10.1007/s10579-023-09634-7
  • Content: Published Version
  • Language: English
  • Licence: Creative Commons: Attribution 4.0 International (CC BY 4.0)