Publication: Crowdsourcing an OCR Gold Standard for a German and French Heritage Corpus
Crowdsourcing an OCR Gold Standard for a German and French Heritage Corpus
Date
Date
Date
Citations
Clematide, S., Furrer, L., & Volk, M. (2016). Crowdsourcing an OCR Gold Standard for a German and French Heritage Corpus. 975–982. http://www.lrec-conf.org/proceedings/lrec2016/pdf/917_Paper.pdf
Abstract
Abstract
Abstract
Crowdsourcing approaches for post-correction of OCR output (Optical Character Recognition) have been successfully applied to several historic text collections. We report on our crowd-correction platform Kokos, which we built to improve the OCR quality of the digitized yearbooks of the Swiss Alpine Club (SAC) from the 19th century. This multilingual heritage corpus consists of Alpine texts mainly written in German and French, all typeset in Antiqua font. Finding and engaging volunteers for correcting large amounts of pages into high qu
Metrics
Downloads
Views
Additional indexing
Creators (Authors)
Event Title
Event Title
Event Title
Event Location
Event Location
Event Location
Event Country
Event Country
Event Country
Event Start Date
Event Start Date
Event Start Date
Event End Date
Event End Date
Event End Date
Publisher
Publisher
Publisher
Page range/Item number
Page range/Item number
Page range/Item number
Page end
Page end
Page end
Item Type
Item Type
Item Type
In collections
Dewey Decimal Classifikation
Dewey Decimal Classifikation
Dewey Decimal Classifikation
Language
Language
Language
Date available
Date available
Date available
ISBN or e-ISBN
ISBN or e-ISBN
ISBN or e-ISBN
OA Status
OA Status
OA Status
Free Access at
Free Access at
Free Access at
Official URL
Official URL
Official URL
Metrics
Downloads
Views
Citations
Clematide, S., Furrer, L., & Volk, M. (2016). Crowdsourcing an OCR Gold Standard for a German and French Heritage Corpus. 975–982. http://www.lrec-conf.org/proceedings/lrec2016/pdf/917_Paper.pdf