Publication:

On the Limits of Minimal Pairs in Contrastive Evaluation

Date

Date

Date
2021
Conference or Workshop Item
Published version
cris.lastimport.scopus2025-06-10T03:44:17Z
cris.virtual.orcidhttps://orcid.org/0000-0002-1438-4741
cris.virtualsource.orcidac7b092b-8c4b-4590-b002-eff6c71c35d0
dc.contributor.institutionUniversity of Zurich
dc.date.accessioned2021-09-17T05:34:36Z
dc.date.available2021-09-17T05:34:36Z
dc.date.issued2021-11-11
dc.description.abstract

Minimal sentence pairs are frequently used to analyze the behavior of language models. It is often assumed that model behavior on contrastive pairs is predictive of model behavior at large. We argue that two conditions are necessary for this assumption to hold: First, a tested hypothesis should be well-motivated, since experiments show that contrastive evaluation can lead to false positives. Secondly, test data should be chosen such as to minimize distributional discrepancy between evaluation time and deployment time. For a good approximation of deployment-time decoding, we recommend that minimal pairs are created based on machine-generated text, as opposed to human-written references. We present a contrastive evaluation suite for English–German MT that implements this recommendation.

dc.identifier.scopus2-s2.0-85127226928
dc.identifier.urihttps://www.zora.uzh.ch/handle/20.500.14742/185764
dc.language.isoeng
dc.subject.ddc000 Computer science, knowledge & systems
dc.subject.ddc410 Linguistics
dc.title

On the Limits of Minimal Pairs in Contrastive Evaluation

dc.typeconference_item
dcterms.accessRightsinfo:eu-repo/semantics/openAccess
dcterms.bibliographicCitation.originalpublishernameACL Anthology
dcterms.bibliographicCitation.urlhttps://aclanthology.org/2021.blackboxnlp-1.5/
dspace.entity.typePublicationen
oairecerif.event.countryDominican Republic
oairecerif.event.endDate2021-11-11
oairecerif.event.placeOnline and in Punta Cana
oairecerif.event.startDate2021-11-11
uzh.contributor.authorVamvas, Jannis
uzh.contributor.authorSennrich, Rico
uzh.contributor.correspondenceYes
uzh.contributor.correspondenceNo
uzh.document.availabilitypostprint
uzh.eprint.datestamp2021-09-17 05:34:36
uzh.eprint.lastmod2022-04-27 07:35:27
uzh.eprint.statusChange2021-09-17 05:34:36
uzh.event.presentationTypelecture
uzh.event.titleFourth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP
uzh.event.typeworkshop
uzh.funder.nameSNSF
uzh.funder.projectNumberPP00P1_176727
uzh.funder.projectTitleMulti-Task Learning with Multilingual Resources for Better Natural Language Understanding
uzh.harvester.ethYes
uzh.harvester.nbNo
uzh.identifier.doi10.5167/uzh-206607
uzh.oastatus.zoraGreen
uzh.publication.citationVamvas, J., & Sennrich, R. (2021, November 11). On the Limits of Minimal Pairs in Contrastive Evaluation. Fourth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, Online and in Punta Cana. https://aclanthology.org/2021.blackboxnlp-1.5/
uzh.publication.freeAccessAtUNSPECIFIED
uzh.publication.originalworkoriginal
uzh.publication.publishedStatusfinal
uzh.relatedUrl.typeresearchdata
uzh.relatedUrl.urlhttps://github.com/ZurichNLP/distil-lingeval
uzh.scopus.impact11
uzh.workflow.eprintid206607
uzh.workflow.fulltextStatuspublic
uzh.workflow.revisions22
uzh.workflow.rightsCheckoffen
uzh.workflow.statusarchive
Files

Original bundle

Name:
emnlp2021.pdf
Size:
233.57 KB
Format:
Adobe Portable Document Format
Publication available in collections: