Navigation auf zora.uzh.ch

Search ZORA

ZORA (Zurich Open Repository and Archive)

Revisiting the Uniform Information Density Hypothesis

Meister, Clara; Pimentel, Tiago; Haller, Patrick; Jäger, Lena Ann; Cotterell, Ryan; Levy, Roger (2021). Revisiting the Uniform Information Density Hypothesis. In: Empirical Methods in Natural Language Processing 2021, Online and Punta Cana, Dominican Republic, 7 November 2021 - 11 November 2021, arXiv.

Abstract

The uniform information density (UID) hypothesis posits a preference among language users for utterances structured such that information is distributed uniformly across a signal. While its implications on language production have been well explored, the hypothesis potentially makes predictions about language comprehension and linguistic acceptability as well. Further, it is unclear how uniformity in a linguistic signal -- or lack thereof -- should be measured, and over which linguistic unit, e.g., the sentence or language level, this uniformity should hold. Here we investigate these facets of the UID hypothesis using reading time and acceptability data. While our reading time results are generally consistent with previous work, they are also consistent with a weakly super-linear effect of surprisal, which would be compatible with UID's predictions. For acceptability judgments, we find clearer evidence that non-uniformity in information density is predictive of lower acceptability. We then explore multiple operationalizations of UID, motivated by different interpretations of the original hypothesis, and analyze the scope over which the pressure towards uniformity is exerted. The explanatory power of a subset of the proposed operationalizations suggests that the strongest trend may be a regression towards a mean surprisal across the language, rather than the phrase, sentence, or document -- a finding that supports a typical interpretation of UID, namely that it is the byproduct of language users maximizing the use of a (hypothetical) communication channel.

Additional indexing

Item Type:Conference or Workshop Item (Paper), refereed, original work
Communities & Collections:06 Faculty of Arts > Institute of Computational Linguistics
06 Faculty of Arts > Zurich Center for Linguistics
Dewey Decimal Classification:000 Computer science, knowledge & systems
410 Linguistics
Language:English
Event End Date:11 November 2021
Deposited On:25 Oct 2021 05:59
Last Modified:28 Apr 2022 07:07
Publisher:arXiv
Number:11635v1
Additional Information:Online and Punta Cana, Dominican Republic - Findings of the Association for Computational Linguistics: EMNLP 2021 - 7-11.11.2021
OA Status:Green
Official URL:https://aclanthology.org/2021.emnlp-main.74.pdf
Related URLs:https://aclanthology.org/2021.emnlp-main.74.pdf
Download PDF  'Revisiting the Uniform Information Density Hypothesis'.
Preview
  • Content: Published Version

Metadata Export

Statistics

Citations

17 citations in Web of Science®
41 citations in Scopus®
Google Scholar™

Downloads

33 downloads since deposited on 25 Oct 2021
15 downloads since 12 months
Detailed statistics

Authors, Affiliations, Collaborations

Similar Publications