Navigation auf zora.uzh.ch

Search ZORA

ZORA (Zurich Open Repository and Archive)

Overview of HIPE-2022: Named Entity Recognition and Linking in Multilingual Historical Documents

Ehrmann, Maud; Romanello, Matteo; Najem-Meyer, Sven; Doucet, Antoine; Clematide, Simon (2022). Overview of HIPE-2022: Named Entity Recognition and Linking in Multilingual Historical Documents. In: Barrón-Cedeño, Alberto; Da San Martino, Giovanni; Degli Esposti, Mirko; Sebastiani, Fabrizio; Macdonald, Craig; Pasini, Gabriella; Hanbury, Allan; Potthast, Martin; Faggioli, Guglielmo; Ferro, Nicola. Experimental IR Meets Multilinguality, Multimodality, and Interaction. Cham: Springer, 423-446.

Abstract

This paper presents an overview of the second edition of HIPE (Identifying Historical People, Places and other Entities), a shared task on named entity recognition and linking in multilingual historical documents. Following the success of the first CLEF-HIPE-2020 evaluation lab, HIPE-2022 confronts systems with the challenges of dealing with more languages, learning domain-specific entities, and adapting to diverse annotation tag sets. This shared task is part of the ongoing efforts of the natural language processing and digital humanities communities to adapt and develop appropriate technologies to efficiently retrieve and explore information from historical texts. On such material, however, named entity processing techniques face the challenges of domain heterogeneity, input noisiness, dynamics of language, and lack of resources. In this context, the main objective of HIPE-2022, run as an evaluation lab of the CLEF 2022 conference, is to gain new insights into the transferability of named entity processing approaches across languages, time periods, document types, and annotation tag sets. Tasks, corpora, and results of participating teams are presented.

Additional indexing

Item Type:Book Section, refereed, original work
Communities & Collections:06 Faculty of Arts > Institute of Computational Linguistics
Dewey Decimal Classification:000 Computer science, knowledge & systems
410 Linguistics
Scopus Subject Areas:Physical Sciences > Theoretical Computer Science
Physical Sciences > General Computer Science
Uncontrolled Keywords:Named entity recognition and classification Entity linking Historical texts Information extraction Digitised newspapers Digital humanities
Language:English
Date:25 August 2022
Deposited On:18 Feb 2023 16:13
Last Modified:23 Mar 2025 04:33
Publisher:Springer
Series Name:Lecture Notes in Computer Science
Number:13390
ISSN:0302-9743
ISBN:9783031136429
OA Status:Closed
Publisher DOI:https://doi.org/10.1007/978-3-031-13643-6_26
Full text not available from this repository.

Metadata Export

Statistics

Citations

Dimensions.ai Metrics
6 citations in Web of Science®
16 citations in Scopus®
Google Scholar™

Altmetrics

Authors, Affiliations, Collaborations

Similar Publications