Navigation auf zora.uzh.ch

Search ZORA

ZORA (Zurich Open Repository and Archive)

Extended Overview of HIPE-2022: Named Entity Recognition and Linking in Multilingual Historical Documents

Ehrmann, Maud; Romanello, Matteo; Najem-Meyer, Sven; Doucet, Antoine; Clematide, Simon (2022). Extended Overview of HIPE-2022: Named Entity Recognition and Linking in Multilingual Historical Documents. In: Faggioli, Gulielmo; Ferro, Nicola; Hanbury, Alan; Potthast, Martin. Working Notes of CLEF 2022 - Conference and Labs of the Evaluation Forum (CLEF). Aachen: CEUR-WS, 1038-1063.

Abstract

This paper presents an overview of the second edition of HIPE (Identifying Historical People, Places and other Entities), a shared task on named entity recognition and linking in multilingual historical documents. Following the success of the first CLEF-HIPE-2020 evaluation lab, HIPE-2022 confronts systems with the challenges of dealing with more languages, learning domain-specific entities, and adapting to diverse annotation tag sets. This shared task is part of the ongoing efforts of the natural language processing and digital humanities communities to adapt and develop appropriate technologies to efficiently retrieve and explore information from historical texts. On such material, however, named entity processing techniques face the challenges of domain heterogeneity, input noisiness, dynamics of language, and lack of resources. In this context, the main objective of HIPE-2022, run as an evaluation lab of the CLEF 2022 conference, is to gain new insights into the transferability of named entity processing approaches across languages, time periods, document types, and annotation tag sets. Tasks, corpora, and results of participating teams are presented. Compared to the condensed overview [1], this paper contains more refined statistics on the datasets, a break down of the results per type of entity, and a discussion of the ‘challenges’ proposed in the shared task.

Additional indexing

Item Type:Book Section, refereed, original work
Communities & Collections:06 Faculty of Arts > Institute of Computational Linguistics
Dewey Decimal Classification:000 Computer science, knowledge & systems
410 Linguistics
Scopus Subject Areas:Physical Sciences > General Computer Science
Language:English
Date:5 September 2022
Deposited On:18 Feb 2023 16:15
Last Modified:23 Mar 2025 04:33
Publisher:CEUR-WS
Series Name:CEUR Workshop Proceedings
Number:3180
ISSN:1613-0073
OA Status:Green
Free access at:Publisher DOI. An embargo period may apply.
Official URL:http://ceur-ws.org/Vol-3180/#paper-83
Download PDF  'Extended Overview of HIPE-2022: Named Entity Recognition and Linking in Multilingual Historical Documents'.
Preview
  • Content: Published Version
  • Language: English
  • Licence: Creative Commons: Attribution 4.0 International (CC BY 4.0)

Metadata Export

Statistics

Citations

Downloads

179 downloads since deposited on 18 Feb 2023
61 downloads since 12 months
Detailed statistics

Authors, Affiliations, Collaborations

Similar Publications