Publication: Mitigating Hallucinations and Off-target Machine Translation with Source-Contrastive and Language-Contrastive Decoding
Mitigating Hallucinations and Off-target Machine Translation with Source-Contrastive and Language-Contrastive Decoding
Date
Date
Date
| cris.lastimport.scopus | 2025-06-26T03:40:39Z | |
| cris.lastimport.wos | 2025-07-30T01:31:30Z | |
| cris.virtual.orcid | https://orcid.org/0000-0002-1438-4741 | |
| cris.virtualsource.orcid | ac7b092b-8c4b-4590-b002-eff6c71c35d0 | |
| dc.contributor.institution | University of Zurich | |
| dc.date.accessioned | 2024-08-22T07:17:15Z | |
| dc.date.available | 2024-08-22T07:17:15Z | |
| dc.date.issued | 2024-03 | |
| dc.description.abstract | Hallucinations and off-target translation remain unsolved problems in MT, especially for low-resource languages and massively multilingual models. In this paper, we introduce two related methods to mitigate these failure cases with a modified decoding objective, without either requiring retraining or external models. In source-contrastive decoding, we search for a translation that is probable given the correct input, but improbable given a random input segment. In language-contrastive decoding, we search for a translation that is probable, but improbable given the wrong language indicator token. Experiments on the massively multilingual models M2M-100 (418M) and SMaLL-100 show that these methods suppress hallucinations and off-target translations, reducing the number of translations with segment-level chrF2 below 10 by 67-83% on average across 57 tested translation directions. In a proof of concept on out-of-English translation, we also show that we can suppress off-target translations with large language models. We release code upon acceptance. | |
| dc.identifier.scopus | 2-s2.0-85189888420 | |
| dc.identifier.uri | https://www.zora.uzh.ch/handle/20.500.14742/220515 | |
| dc.identifier.wos | 001356733300004 | |
| dc.language.iso | eng | |
| dc.subject.ddc | 410 Linguistics | |
| dc.subject.ddc | 000 Computer science, knowledge & systems | |
| dc.title | Mitigating Hallucinations and Off-target Machine Translation with Source-Contrastive and Language-Contrastive Decoding | |
| dc.type | conference_item | |
| dcterms.accessRights | info:eu-repo/semantics/openAccess | |
| dcterms.bibliographicCitation.originalpublishername | Association for Computational Linguistics | |
| dcterms.bibliographicCitation.originalpublisherplace | St. Julian's, Malta | |
| dcterms.bibliographicCitation.pageend | 33 | |
| dcterms.bibliographicCitation.pagestart | 21 | |
| dcterms.bibliographicCitation.url | https://aclanthology.org/2024.eacl-short.4 | |
| dspace.entity.type | Publication | en |
| oairecerif.event.country | Malta | |
| oairecerif.event.endDate | 2024-03 | |
| oairecerif.event.place | St. Julian’s | |
| oairecerif.event.startDate | 2024-03 | |
| uzh.contributor.affiliation | University of Zurich, University of Edinburgh | |
| uzh.contributor.affiliation | University of Zurich | |
| uzh.contributor.affiliation | University of Zurich, EPFL | |
| uzh.contributor.author | Sennrich, Rico | |
| uzh.contributor.author | Vamvas, Jannis | |
| uzh.contributor.author | Mohammadshahi, Alireza | |
| uzh.contributor.correspondence | Yes | |
| uzh.contributor.correspondence | No | |
| uzh.contributor.correspondence | No | |
| uzh.document.availability | published_version | |
| uzh.eprint.datestamp | 2024-08-22 07:17:15 | |
| uzh.eprint.lastmod | 2025-01-31 02:37:10 | |
| uzh.eprint.statusChange | 2024-08-22 07:17:15 | |
| uzh.event.presentationType | paper | |
| uzh.event.title | Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (Volume 2: Short Papers) | |
| uzh.event.type | conference | |
| uzh.harvester.eth | Yes | |
| uzh.harvester.nb | No | |
| uzh.identifier.doi | 10.5167/uzh-261193 | |
| uzh.oastatus.zora | Green | |
| uzh.publication.citation | Sennrich, Rico; Vamvas, Jannis; Mohammadshahi, Alireza (2024). Mitigating Hallucinations and Off-target Machine Translation with Source-Contrastive and Language-Contrastive Decoding. In: Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (Volume 2: Short Papers), St. Julian’s, Malta, March 2024. Association for Computational Linguistics, 21-33. | |
| uzh.publication.freeAccessAt | officialurl | |
| uzh.publication.originalwork | original | |
| uzh.publication.publishedStatus | final | |
| uzh.scopus.impact | 5 | |
| uzh.scopus.subjects | Language and Linguistics | |
| uzh.scopus.subjects | Linguistics and Language | |
| uzh.workflow.eprintid | 261193 | |
| uzh.workflow.fulltextStatus | public | |
| uzh.workflow.revisions | 19 | |
| uzh.workflow.rightsCheck | offen | |
| uzh.workflow.status | archive | |
| uzh.wos.impact | 3 | |
| Files | ||
| Publication available in collections: |