Publication:

ClimRetrieve: A Benchmarking Dataset for Information Retrieval from Corporate Climate Disclosures

Date

Date

Date
2024
Conference or Workshop Item
Published version

Citations

Citation copied

Schimanski, T., Ni, J., Martín, R. S., Ranger, N., & Leippold, M. (2024). ClimRetrieve: A Benchmarking Dataset for Information Retrieval from Corporate Climate Disclosures. Proceedings of the Conference on Empirical Methods in Natural Language Processing, 17509–17524. https://doi.org/10.18653/v1/2024.emnlp-main.969

Abstract

Abstract

Abstract

To handle the vast amounts of qualitative data produced in corporate climate communication, stakeholders increasingly rely on Retrieval Augmented Generation (RAG) systems. However, a significant gap remains in evaluating domain-specific information retrieval – the basis for answer generation. To address this challenge, this work simulates the typical tasks of a sustainability analyst by examining 30 sustainability reports with 16 detailed climate-related questions. As a result, we obtain a dataset with over 8.5K unique question-source

Metrics

Citations

Additional indexing

Creators (Authors)

Event Title

Event Title

Event Title
The 2024 Conference on Empirical Methods in Natural Language Processing

Event Location

Event Location

Event Location
Miami

Event Country

Event Country

Event Country
Florida, USA

Event Start Date

Event Start Date

Event Start Date
2024-11-12

Event End Date

Event End Date

Event End Date
2024-11-16

Page range/Item number

Page range/Item number

Page range/Item number
17509

Page end

Page end

Page end
17524

Item Type

Item Type

Item Type
Conference or Workshop Item

Dewey Decimal Classifikation

Dewey Decimal Classifikation

Dewey Decimal Classifikation

Scope

Scope

Scope
Discipline-based scholarship (basic research)

Language

Language

Language
English

Date available

Date available

Date available
2025-02-03

Series Name

Series Name

Series Name
Proceedings of the Conference on Empirical Methods in Natural Language Processing

OA Status

OA Status

OA Status
Gold

Free Access at

Free Access at

Free Access at
DOI

Metrics

Citations

Citations

Citation copied

Schimanski, T., Ni, J., Martín, R. S., Ranger, N., & Leippold, M. (2024). ClimRetrieve: A Benchmarking Dataset for Information Retrieval from Corporate Climate Disclosures. Proceedings of the Conference on Empirical Methods in Natural Language Processing, 17509–17524. https://doi.org/10.18653/v1/2024.emnlp-main.969

Gold Open Access
Loading...
Thumbnail Image

Files

Files

Files
Files available to download:1

Files

Files

Files
Files available to download:1
Loading...
Thumbnail Image