Header

UZH-Logo

Maintenance Infos

Machine Translation of 16th Century Letters from Latin to German


Fischer, Lukas; Scheurer, Patricia; Schwitter, Raphael; Volk, Martin (2022). Machine Translation of 16th Century Letters from Latin to German. In: Second Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA 2022), Marseille, 25 June 2022. LREC, 43-50.

Abstract

This paper outlines our work in collecting training data for and developing a Latin–German Neural Machine Translation (NMT) system, for translating 16th century letters. While Latin–German is a low-resource language pair in terms of NMT, the domain of 16th century epistolary Latin is even more limited in this regard. Through our efforts in data collection and data generation, we are able to train a NMT model that provides good translations for short to medium sentences, and outperforms GoogleTranslate overall. We focus on the correspondence of the Swiss reformer Heinrich Bullinger, but our parallel corpus and our NMT system will be of use for many other texts of the time.

Abstract

This paper outlines our work in collecting training data for and developing a Latin–German Neural Machine Translation (NMT) system, for translating 16th century letters. While Latin–German is a low-resource language pair in terms of NMT, the domain of 16th century epistolary Latin is even more limited in this regard. Through our efforts in data collection and data generation, we are able to train a NMT model that provides good translations for short to medium sentences, and outperforms GoogleTranslate overall. We focus on the correspondence of the Swiss reformer Heinrich Bullinger, but our parallel corpus and our NMT system will be of use for many other texts of the time.

Statistics

Downloads

61 downloads since deposited on 06 Jul 2022
19 downloads since 12 months
Detailed statistics

Additional indexing

Item Type:Conference or Workshop Item (Speech), refereed, original work
Communities & Collections:06 Faculty of Arts > Institute of Computational Linguistics
06 Faculty of Arts > Zurich Center for Linguistics
Dewey Decimal Classification:000 Computer science, knowledge & systems
410 Linguistics
Language:English
Event End Date:25 June 2022
Deposited On:06 Jul 2022 12:35
Last Modified:30 May 2024 11:38
Publisher:LREC
OA Status:Green
Official URL:http://www.lrec-conf.org/proceedings/lrec2022/workshops/LT4HALA/pdf/2022.lt4hala2022-1.7.pdf
Related URLs:https://circse.github.io/LT4HALA/2022/ (Organisation)
  • Content: Published Version
  • Language: English
  • Licence: Creative Commons: Attribution-NonCommercial 4.0 International (CC BY-NC 4.0)