Publication: Name Consistency in LLM-based Machine Translation of Historical Texts
Name Consistency in LLM-based Machine Translation of Historical Texts
Date
Date
Date
| cris.virtual.orcid | https://orcid.org/0000-0002-2063-4516 | |
| cris.virtualsource.orcid | 8fbbe5f4-ab2a-4bbe-a533-e6a4112e86d8 | |
| dc.date.accessioned | 2025-07-22T08:28:16Z | |
| dc.date.available | 2025-07-22T08:28:16Z | |
| dc.date.issued | 2025-06-27 | |
| dc.description.abstract | Large Language Models (LLMs) excel at translating 16th-century letters from Latin and Early New High German to modern English and German. While they perform well at translating well-known historical city names (e.g., Lutetia --> Paris), their ability to handle person names (e.g., Theodor Bibliander) or lesser-known toponyms (e.g., Augusta Vindelicorum --> Augsburg) remains unclear. This study investigates LLM-based translations of person and place names across various frequency bands in a corpus of 16th-century letters. Our results show that LLMs struggle with person names, achieving accuracies around 60%, but perform better with place names, reaching accuracies around 90%. We further demonstrate that including a translation suggestion for the proper noun in the prompt substantially boosts accuracy, yielding highly reliable results. | |
| dc.identifier.isbn | 978-2-9701897-0-1 | |
| dc.identifier.uri | https://www.zora.uzh.ch/handle/20.500.14742/232137 | |
| dc.language.iso | eng | |
| dc.subject.ddc | 410 Linguistics | |
| dc.subject.ddc | 000 Computer science, knowledge & systems | |
| dc.subject.ddc | 400 Language | |
| dc.title | Name Consistency in LLM-based Machine Translation of Historical Texts | |
| dc.type | conference_item | |
| dcterms.accessRights | info:eu-repo/semantics/openAccess | |
| dcterms.bibliographicCitation.originalpublishername | Association for Computational Linguistics | |
| dspace.entity.type | Publication | en |
| oairecerif.event.endDate | 2025-06-27 | |
| oairecerif.event.place | Genève | |
| oairecerif.event.startDate | 2025-06-23 | |
| uzh.contributor.author | Fischer, Dominic P | |
| uzh.contributor.author | Volk, Martin | |
| uzh.contributor.correspondence | Yes | |
| uzh.contributor.correspondence | No | |
| uzh.document.availability | postprint | |
| uzh.eprint.datestamp | 2025-07-22 08:28:16 | |
| uzh.eprint.lastmod | 2025-07-22 08:28:16 | |
| uzh.eprint.statusChange | 2025-07-22 08:28:16 | |
| uzh.event.presentationType | paper | |
| uzh.event.title | 20th Machine Translation Summit | |
| uzh.event.type | conference | |
| uzh.funder.name | UZH Foundation | |
| uzh.funder.projectTitle | Bullinger Digital | |
| uzh.funder.projectURI | https://www.bullinger-digital.ch | |
| uzh.harvester.eth | Yes | |
| uzh.harvester.nb | No | |
| uzh.identifier.doi | 10.5167/uzh-279357 | |
| uzh.oastatus.zora | Green | |
| uzh.publication.citation | Fischer, D. P., & Volk, M. (2025). Name Consistency in LLM-based Machine Translation of Historical Texts. Proceedings of the Machine Translation Summit. Presented at the 20th Machine Translation Summit, Association for Computational Linguistics. | |
| uzh.publication.citation | Fischer, Dominic P; Volk, Martin (2025). Name Consistency in LLM-based Machine Translation of Historical Texts. In: 20th Machine Translation Summit, Genève, 23 Juni 2025 - 27 Juni 2025, Association for Computational Linguistics. | |
| uzh.publication.freeAccessAt | UNSPECIFIED | |
| uzh.publication.originalwork | original | |
| uzh.publication.publishedStatus | firstelectronic | |
| uzh.publication.seriesTitle | Proceedings of the Machine Translation Summit | |
| uzh.workflow.eprintid | 279357 | |
| uzh.workflow.fulltextStatus | public | |
| uzh.workflow.revisions | 15 | |
| uzh.workflow.rightsCheck | keininfo | |
| uzh.workflow.status | archive | |
| Files | ||
| Publication available in collections: |