Publication:

Revisiting the Uniform Information Density Hypothesis

Date

Date

Date
2021
Conference or Workshop Item
Published version
cris.lastimport.scopus2025-06-11T03:39:30Z
cris.lastimport.wos2025-07-25T01:30:57Z
cris.virtual.orcidhttps://orcid.org/0000-0002-8968-7587
cris.virtualsource.orcid491375be-cddb-40d4-86bd-7632ef225cd2
dc.contributor.institutionUniversity of Zurich
dc.date.accessioned2021-10-25T05:59:13Z
dc.date.available2021-10-25T05:59:13Z
dc.date.issued2021-11-11
dc.description.abstract

The uniform information density (UID) hypothesis posits a preference among language users for utterances structured such that information is distributed uniformly across a signal. While its implications on language production have been well explored, the hypothesis potentially makes predictions about language comprehension and linguistic acceptability as well. Further, it is unclear how uniformity in a linguistic signal -- or lack thereof -- should be measured, and over which linguistic unit, e.g., the sentence or language level, this uniformity should hold. Here we investigate these facets of the UID hypothesis using reading time and acceptability data. While our reading time results are generally consistent with previous work, they are also consistent with a weakly super-linear effect of surprisal, which would be compatible with UID's predictions. For acceptability judgments, we find clearer evidence that non-uniformity in information density is predictive of lower acceptability. We then explore multiple operationalizations of UID, motivated by different interpretations of the original hypothesis, and analyze the scope over which the pressure towards uniformity is exerted. The explanatory power of a subset of the proposed operationalizations suggests that the strongest trend may be a regression towards a mean surprisal across the language, rather than the phrase, sentence, or document -- a finding that supports a typical interpretation of UID, namely that it is the byproduct of language users maximizing the use of a (hypothetical) communication channel.

dc.identifier.scopus2-s2.0-85117652251
dc.identifier.urihttps://www.zora.uzh.ch/handle/20.500.14742/186955
dc.identifier.wos000855966301007
dc.language.isoeng
dc.subject.ddc000 Computer science, knowledge & systems
dc.subject.ddc410 Linguistics
dc.title

Revisiting the Uniform Information Density Hypothesis

dc.typeconference_item
dcterms.accessRightsinfo:eu-repo/semantics/openAccess
dcterms.bibliographicCitation.number11635v1
dcterms.bibliographicCitation.originalpublishernamearXiv
dcterms.bibliographicCitation.urlhttps://aclanthology.org/2021.emnlp-main.74.pdf
dspace.entity.typePublicationen
oairecerif.event.countryDominican Republic
oairecerif.event.endDate2021-11-11
oairecerif.event.placeOnline and Punta Cana
oairecerif.event.startDate2021-11-07
uzh.contributor.authorMeister, Clara
uzh.contributor.authorPimentel, Tiago
uzh.contributor.authorHaller, Patrick
uzh.contributor.authorJäger, Lena Ann
uzh.contributor.authorCotterell, Ryan
uzh.contributor.authorLevy, Roger
uzh.contributor.correspondenceYes
uzh.contributor.correspondenceNo
uzh.contributor.correspondenceNo
uzh.contributor.correspondenceNo
uzh.contributor.correspondenceNo
uzh.contributor.correspondenceNo
uzh.document.availabilitypublished_version
uzh.eprint.datestamp2021-10-25 05:59:13
uzh.eprint.lastmod2022-04-28 07:07:50
uzh.eprint.statusChange2021-10-25 05:59:13
uzh.event.presentationTypepaper
uzh.event.titleEmpirical Methods in Natural Language Processing 2021
uzh.event.typeconference
uzh.harvester.ethYes
uzh.harvester.nbNo
uzh.identifier.doi10.5167/uzh-207982
uzh.note.publicOnline and Punta Cana, Dominican Republic - Findings of the Association for Computational Linguistics: EMNLP 2021 - 7-11.11.2021
uzh.oastatus.zoraGreen
uzh.publication.citationMeister, Clara; Pimentel, Tiago; Haller, Patrick; Jäger, Lena Ann; Cotterell, Ryan; Levy, Roger (2021). Revisiting the Uniform Information Density Hypothesis. In: Empirical Methods in Natural Language Processing 2021, Online and Punta Cana, Dominican Republic, 7 November 2021 - 11 November 2021, arXiv.
uzh.publication.freeAccessAtUNSPECIFIED
uzh.publication.originalworkoriginal
uzh.publication.publishedStatusfinal
uzh.relatedUrl.urlhttps://aclanthology.org/2021.emnlp-main.74.pdf
uzh.scopus.impact54
uzh.workflow.eprintid207982
uzh.workflow.fulltextStatuspublic
uzh.workflow.revisions49
uzh.workflow.rightsCheckoffen
uzh.workflow.statusarchive
uzh.wos.impact28
Files

Original bundle

Name:
emnlp2021_revisiting.pdf
Size:
5.72 MB
Format:
Adobe Portable Document Format
Publication available in collections: