Header

UZH-Logo

Maintenance Infos

Building a Corpus from Handwritten Picture Postcards: Transcription, Annotation and Part-of-Speech Tagging


Sugisaki, Kyoko; Wiedmer, Nicolas; Hausendorf, Heiko (2018). Building a Corpus from Handwritten Picture Postcards: Transcription, Annotation and Part-of-Speech Tagging. In: 11th edition of the Language Resources and Evaluation Conference, Miyazaki, Japan, 7 May 2018 - 12 May 2018, The LREC.

Abstract

In this paper, we present a corpus of over 11,000 holiday picture postcards written in German and Swiss German. We discuss the processes of digitalization, transcription, manual annotation and the development of the automatic text segmentation and part-of-speech tagging.

Abstract

In this paper, we present a corpus of over 11,000 holiday picture postcards written in German and Swiss German. We discuss the processes of digitalization, transcription, manual annotation and the development of the automatic text segmentation and part-of-speech tagging.

Statistics

Citations

Downloads

101 downloads since deposited on 20 Feb 2018
9 downloads since 12 months
Detailed statistics

Additional indexing

Item Type:Conference or Workshop Item (Paper), refereed, original work
Communities & Collections:06 Faculty of Arts > Institute of German Studies
06 Faculty of Arts > Institute of Computational Linguistics
Dewey Decimal Classification:000 Computer science, knowledge & systems
410 Linguistics
Language:English
Event End Date:12 May 2018
Deposited On:20 Feb 2018 16:25
Last Modified:14 Aug 2022 07:36
Publisher:The LREC
Funders:SNF
OA Status:Green
Related URLs:http://lrec2018.lrec-conf.org/en/
Project Information:
  • : FunderSNSF
  • : Grant ID
  • : Project TitleSNF
  • Content: Published Version
  • Language: English