Publication:

Name Consistency in LLM-based Machine Translation of Historical Texts

Date

Date

Date
2025
Conference or Workshop Item
Epub ahead of print
cris.virtual.orcidhttps://orcid.org/0000-0002-2063-4516
cris.virtualsource.orcid8fbbe5f4-ab2a-4bbe-a533-e6a4112e86d8
dc.date.accessioned2025-07-22T08:28:16Z
dc.date.available2025-07-22T08:28:16Z
dc.date.issued2025-06-27
dc.description.abstract

Large Language Models (LLMs) excel at translating 16th-century letters from Latin and Early New High German to modern English and German. While they perform well at translating well-known historical city names (e.g., Lutetia --> Paris), their ability to handle person names (e.g., Theodor Bibliander) or lesser-known toponyms (e.g., Augusta Vindelicorum --> Augsburg) remains unclear. This study investigates LLM-based translations of person and place names across various frequency bands in a corpus of 16th-century letters. Our results show that LLMs struggle with person names, achieving accuracies around 60%, but perform better with place names, reaching accuracies around 90%. We further demonstrate that including a translation suggestion for the proper noun in the prompt substantially boosts accuracy, yielding highly reliable results.

dc.identifier.isbn978-2-9701897-0-1
dc.identifier.urihttps://www.zora.uzh.ch/handle/20.500.14742/232137
dc.language.isoeng
dc.subject.ddc410 Linguistics
dc.subject.ddc000 Computer science, knowledge & systems
dc.subject.ddc400 Language
dc.title

Name Consistency in LLM-based Machine Translation of Historical Texts

dc.typeconference_item
dcterms.accessRightsinfo:eu-repo/semantics/openAccess
dcterms.bibliographicCitation.originalpublishernameAssociation for Computational Linguistics
dspace.entity.typePublicationen
oairecerif.event.endDate2025-06-27
oairecerif.event.placeGenève
oairecerif.event.startDate2025-06-23
uzh.contributor.authorFischer, Dominic P
uzh.contributor.authorVolk, Martin
uzh.contributor.correspondenceYes
uzh.contributor.correspondenceNo
uzh.document.availabilitypostprint
uzh.eprint.datestamp2025-07-22 08:28:16
uzh.eprint.lastmod2025-07-22 08:28:16
uzh.eprint.statusChange2025-07-22 08:28:16
uzh.event.presentationTypepaper
uzh.event.title20th Machine Translation Summit
uzh.event.typeconference
uzh.funder.nameUZH Foundation
uzh.funder.projectTitleBullinger Digital
uzh.funder.projectURIhttps://www.bullinger-digital.ch
uzh.harvester.ethYes
uzh.harvester.nbNo
uzh.identifier.doi10.5167/uzh-279357
uzh.oastatus.zoraGreen
uzh.publication.citationFischer, D. P., & Volk, M. (2025). Name Consistency in LLM-based Machine Translation of Historical Texts. Proceedings of the Machine Translation Summit. Presented at the 20th Machine Translation Summit, Association for Computational Linguistics.
uzh.publication.citationFischer, Dominic P; Volk, Martin (2025). Name Consistency in LLM-based Machine Translation of Historical Texts. In: 20th Machine Translation Summit, Genève, 23 Juni 2025 - 27 Juni 2025, Association for Computational Linguistics.
uzh.publication.freeAccessAtUNSPECIFIED
uzh.publication.originalworkoriginal
uzh.publication.publishedStatusfirstelectronic
uzh.publication.seriesTitleProceedings of the Machine Translation Summit
uzh.workflow.eprintid279357
uzh.workflow.fulltextStatuspublic
uzh.workflow.revisions15
uzh.workflow.rightsCheckkeininfo
uzh.workflow.statusarchive
Files

Original bundle

Name:
Name_Consistency_in_LLM_based_MT_of_Historical_Texts_FINAL.pdf
Size:
314 KB
Format:
Adobe Portable Document Format
Publication available in collections: