Navigation auf zora.uzh.ch

Search ZORA

ZORA (Zurich Open Repository and Archive)

Historical Newspaper Content Mining: Revisiting the impresso Project's Challenges in Text and Image Processing, Design and Historical Scholarship

Ehrmann, Maud; Bunout, Estelle; Clematide, Simon; Düring, Marten; Fickers, Andreas; Kalyakin, Roman; Kaplan, Frédéric; Romanello, Matteo; Schroeder, Paul; Ströbel, Phillip Benjamin; van Beek, Thijs; Volk, Martin; Wieneke, Lars (2020). Historical Newspaper Content Mining: Revisiting the impresso Project's Challenges in Text and Image Processing, Design and Historical Scholarship. In: Digital Humanities 2020, Ottawa, 22 July 2020 - 2020.

Abstract

impresso. Media Monitoring of the Past is an interdisciplinary research project in which a team of computational linguists, designers and historians collaborate on the datafication of a multilingual corpus of digitised historical newspapers. The primary goals of the project are to improve text mining tools for historical text, to enrich historical newspapers with (semi-) automatically generated data and to integrate such data into historical research workflows by means of a newly developed user interface. In this paper we discuss our efforts to overcome inherent challenges and to integrate text mining and data visualisation applications in general historical research practices which are characterised by search operations as well as the need to create topical collections.

Additional indexing

Item Type:Conference or Workshop Item (Paper), refereed, further contribution
Communities & Collections:06 Faculty of Arts > Institute of Computational Linguistics
Dewey Decimal Classification:000 Computer science, knowledge & systems
410 Linguistics
Language:English
Event End Date:2020
Deposited On:30 Aug 2023 08:39
Last Modified:09 Feb 2024 00:58
OA Status:Green
Related URLs:https://dh2020.adho.org/wp-content/uploads/2020/07/537_HistoricalNewspaperContentMiningRevisitingtheimpressoProjectsChallengesinTextandImageProcessingDesignandHistoricalScholarship.html (Organisation)
https://zenodo.org/record/4641894
Project Information:
Download PDF  'Historical Newspaper Content Mining: Revisiting the impresso Project's Challenges in Text and Image Processing, Design and Historical Scholarship'.
Preview
  • Content: Accepted Version
  • Language: English
  • Licence: Creative Commons: Attribution 4.0 International (CC BY 4.0)

Metadata Export

Statistics

Downloads

10 downloads since deposited on 30 Aug 2023
7 downloads since 12 months
Detailed statistics

Authors, Affiliations, Collaborations

Similar Publications