Navigation auf zora.uzh.ch

Search ZORA

ZORA (Zurich Open Repository and Archive)

Assessing Large Language Models on climate information

Bulian, Jannis; Schäfer, Mike S; Amini, Afra; Lam, Heidi; Ciaramita, Massimiliano; Gaiarin, Ben; Chen Hübscher, Michelle; Buck, Christian; Mede, Niels G; Leippold, Markus; Strauß, Nadine (2024). Assessing Large Language Models on climate information. In: 41st International Conference on Machine Learning, Vienna, 21 July 2024 - 27 July 2024, MLResearch Press.

Abstract

As Large Language Models (LLMs) rise in popularity, it is necessary to assess their capability in critically relevant domains. We present a comprehensive evaluation framework, grounded in science communication research, to assess LLM responses to questions about climate change. Our framework emphasizes both presentational and epistemological adequacy, offering a fine-grained analysis of LLM generations spanning 8 dimensions and 30 issues. Our evaluation task is a real-world example of a growing number of challenging problems where AI can complement and lift human performance. We introduce a novel protocol for scalable oversight that relies on AI Assistance and raters with relevant education. We evaluate several recent LLMs on a set of diverse climate questions. Our results point to a significant gap between surface and epistemological qualities of LLMs in the realm of climate communication.

Additional indexing

Item Type:Conference or Workshop Item (Paper), not_refereed, original work
Communities & Collections:06 Faculty of Arts > Department of Communication and Media Research
Dewey Decimal Classification:070 News media, journalism & publishing
Language:English
Event End Date:27 July 2024
Deposited On:10 Jun 2024 15:07
Last Modified:08 Aug 2024 14:43
Publisher:MLResearch Press
Series Name:Proceedings of Machine Learning Research (PMLR)
ISSN:2640-3498
OA Status:Green
Publisher DOI:https://doi.org/10.48550/arXiv.2310.02932
Download PDF  'Assessing Large Language Models on climate information'.
Preview
  • Content: Accepted Version
  • Language: English
  • Licence: Creative Commons: Attribution 4.0 International (CC BY 4.0)

Metadata Export

Statistics

Citations

Dimensions.ai Metrics

Altmetrics

Downloads

43 downloads since deposited on 10 Jun 2024
43 downloads since 12 months
Detailed statistics

Authors, Affiliations, Collaborations

Similar Publications