Navigation auf zora.uzh.ch

Search

ZORA (Zurich Open Repository and Archive)

Building a Corpus from Handwritten Picture Postcards: Transcription, Annotation and Part-of-Speech Tagging

Sugisaki, Kyoko; Wiedmer, Nicolas; Hausendorf, Heiko (2018). Building a Corpus from Handwritten Picture Postcards: Transcription, Annotation and Part-of-Speech Tagging. In: 11th edition of the Language Resources and Evaluation Conference, Miyazaki, Japan, 7 May 2018 - 12 May 2018, The LREC.

Abstract

In this paper, we present a corpus of over 11,000 holiday picture postcards written in German and Swiss German. We discuss the processes of digitalization, transcription, manual annotation and the development of the automatic text segmentation and part-of-speech tagging.

Additional indexing

Item Type:Conference or Workshop Item (Paper), refereed, original work
Communities & Collections:06 Faculty of Arts > Institute of German Studies
06 Faculty of Arts > Institute of Computational Linguistics
Dewey Decimal Classification:000 Computer science, knowledge & systems
410 Linguistics
Language:English
Event End Date:12 May 2018
Deposited On:20 Feb 2018 16:25
Last Modified:14 Aug 2022 07:36
Publisher:The LREC
Funders:SNF
OA Status:Green
Related URLs:http://lrec2018.lrec-conf.org/en/
Project Information:
  • Funder: SNSF
  • Grant ID:
  • Project Title: SNF
Download PDF  'Building a Corpus from Handwritten Picture Postcards: Transcription, Annotation and Part-of-Speech Tagging'.
Preview
  • Content: Published Version
  • Language: English

Metadata Export

Statistics

Citations

Downloads

109 downloads since deposited on 20 Feb 2018
13 downloads since 12 months
Detailed statistics

Authors, Affiliations, Collaborations

Similar Publications