Publication: On the Limits of Minimal Pairs in Contrastive Evaluation
On the Limits of Minimal Pairs in Contrastive Evaluation
Date
Date
Date
| cris.lastimport.scopus | 2025-06-10T03:44:17Z | |
| cris.virtual.orcid | https://orcid.org/0000-0002-1438-4741 | |
| cris.virtualsource.orcid | ac7b092b-8c4b-4590-b002-eff6c71c35d0 | |
| dc.contributor.institution | University of Zurich | |
| dc.date.accessioned | 2021-09-17T05:34:36Z | |
| dc.date.available | 2021-09-17T05:34:36Z | |
| dc.date.issued | 2021-11-11 | |
| dc.description.abstract | Minimal sentence pairs are frequently used to analyze the behavior of language models. It is often assumed that model behavior on contrastive pairs is predictive of model behavior at large. We argue that two conditions are necessary for this assumption to hold: First, a tested hypothesis should be well-motivated, since experiments show that contrastive evaluation can lead to false positives. Secondly, test data should be chosen such as to minimize distributional discrepancy between evaluation time and deployment time. For a good approximation of deployment-time decoding, we recommend that minimal pairs are created based on machine-generated text, as opposed to human-written references. We present a contrastive evaluation suite for English–German MT that implements this recommendation. | |
| dc.identifier.scopus | 2-s2.0-85127226928 | |
| dc.identifier.uri | https://www.zora.uzh.ch/handle/20.500.14742/185764 | |
| dc.language.iso | eng | |
| dc.subject.ddc | 000 Computer science, knowledge & systems | |
| dc.subject.ddc | 410 Linguistics | |
| dc.title | On the Limits of Minimal Pairs in Contrastive Evaluation | |
| dc.type | conference_item | |
| dcterms.accessRights | info:eu-repo/semantics/openAccess | |
| dcterms.bibliographicCitation.originalpublishername | ACL Anthology | |
| dcterms.bibliographicCitation.url | https://aclanthology.org/2021.blackboxnlp-1.5/ | |
| dspace.entity.type | Publication | en |
| oairecerif.event.country | Dominican Republic | |
| oairecerif.event.endDate | 2021-11-11 | |
| oairecerif.event.place | Online and in Punta Cana | |
| oairecerif.event.startDate | 2021-11-11 | |
| uzh.contributor.author | Vamvas, Jannis | |
| uzh.contributor.author | Sennrich, Rico | |
| uzh.contributor.correspondence | Yes | |
| uzh.contributor.correspondence | No | |
| uzh.document.availability | postprint | |
| uzh.eprint.datestamp | 2021-09-17 05:34:36 | |
| uzh.eprint.lastmod | 2022-04-27 07:35:27 | |
| uzh.eprint.statusChange | 2021-09-17 05:34:36 | |
| uzh.event.presentationType | lecture | |
| uzh.event.title | Fourth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP | |
| uzh.event.type | workshop | |
| uzh.funder.name | SNSF | |
| uzh.funder.projectNumber | PP00P1_176727 | |
| uzh.funder.projectTitle | Multi-Task Learning with Multilingual Resources for Better Natural Language Understanding | |
| uzh.harvester.eth | Yes | |
| uzh.harvester.nb | No | |
| uzh.identifier.doi | 10.5167/uzh-206607 | |
| uzh.oastatus.zora | Green | |
| uzh.publication.citation | Vamvas, J., & Sennrich, R. (2021, November 11). On the Limits of Minimal Pairs in Contrastive Evaluation. Fourth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, Online and in Punta Cana. https://aclanthology.org/2021.blackboxnlp-1.5/ | |
| uzh.publication.freeAccessAt | UNSPECIFIED | |
| uzh.publication.originalwork | original | |
| uzh.publication.publishedStatus | final | |
| uzh.relatedUrl.type | researchdata | |
| uzh.relatedUrl.url | https://github.com/ZurichNLP/distil-lingeval | |
| uzh.scopus.impact | 11 | |
| uzh.workflow.eprintid | 206607 | |
| uzh.workflow.fulltextStatus | public | |
| uzh.workflow.revisions | 22 | |
| uzh.workflow.rightsCheck | offen | |
| uzh.workflow.status | archive | |
| Files | ||
| Publication available in collections: |