Navigation auf zora.uzh.ch

Search ZORA

ZORA (Zurich Open Repository and Archive)

Migration von ZORA auf die Software DSpace

ZORA will change to a new software on 8th September 2025. Please note: deadline for new submissions is 21th July 2025!

Information & dates for training courses can be found here: Information on Software Migration.

JWSign: A Highly Multilingual Corpus of Bible Translations for more Diversity in Sign Language Processing

Gueuwou, Shester; Siake, Sophie; Leong, Colin; Müller, Mathias (2023). JWSign: A Highly Multilingual Corpus of Bible Translations for more Diversity in Sign Language Processing. In: Findings of the Association for Computational Linguistics: EMNLP 2023, Singapore, December 2023. Association for Computational Linguistics, 9907-9927.

Abstract

Advancements in sign language processing have been hindered by a lack of sufficient data, impeding progress in recognition, translation, and production tasks. The absence of comprehensive sign language datasets across the world's sign languages has widened the gap in this field, resulting in a few sign languages being studied more than others, making this research area extremely skewed mostly towards sign languages from high-income countries. In this work we introduce a new large and highly multilingual dataset for sign language translation: JWSign. The dataset consists of 2,530 hours of Bible translations in 98 sign languages, featuring more than 1,500 individual signers. On this dataset, we report neural machine translation experiments. Apart from bilingual baseline systems, we also train multilingual systems, including some that take into account the typological relatedness of signed or spoken languages. Our experiments highlight that multilingual systems are superior to bilingual baselines, and that in higher-resource scenarios, clustering language pairs that are related improves translation quality.

Additional indexing

Item Type:Conference or Workshop Item (Paper), original work
Communities & Collections:06 Faculty of Arts > Institute of Computational Linguistics
08 Research Priority Programs > Digital Society Initiative
Dewey Decimal Classification:000 Computer science, knowledge & systems
410 Linguistics
Language:English
Event End Date:December 2023
Deposited On:20 Jan 2024 14:56
Last Modified:26 Mar 2024 03:40
Publisher:Association for Computational Linguistics
OA Status:Hybrid
Publisher DOI:https://doi.org/10.18653/v1/2023.findings-emnlp.664
Official URL:https://aclanthology.org/2023.findings-emnlp.664
Download PDF  'JWSign: A Highly Multilingual Corpus of Bible Translations for more Diversity in Sign Language Processing'.
Preview
  • Content: Published Version
  • Language: English

Metadata Export

Statistics

Citations

Dimensions.ai Metrics

Altmetrics

Downloads

9 downloads since deposited on 20 Jan 2024
6 downloads since 12 months
Detailed statistics

Authors, Affiliations, Collaborations

Similar Publications