Publication: Part-Of-Speech in Historical Corpora: Tagger Evaluation and Ensemble Systems on ARCHER
Part-Of-Speech in Historical Corpora: Tagger Evaluation and Ensemble Systems on ARCHER
Date
Date
Date
Citations
Schneider, G., Hundt, M., & Oppliger, R. (2016, September 21). Part-Of-Speech in Historical Corpora: Tagger Evaluation and Ensemble Systems on ARCHER. KONVENS 2016, Bochum.
Abstract
Abstract
Abstract
Tagger accuracy deteriorates when applied to texts different from the training corpus, e.g. with respect to register or time period. On historical data, accuracy can drop to and below 90%. We are tagging and parsing ARCHER, a historical corpus sampled from British and American texts from 1600-1999. We improve tagging accuracy by (1) using a version of the corpus that has been automatically mapped to PDE spelling with VARD, (2) by combining several part-of-speech taggers in an ensemble system – which improves tagging by about 1% over C
Metrics
Downloads
Views
Additional indexing
Creators (Authors)
Event Title
Event Title
Event Title
Event Location
Event Location
Event Location
Event Start Date
Event Start Date
Event Start Date
Event End Date
Event End Date
Event End Date
Item Type
Item Type
Item Type
Dewey Decimal Classifikation
Dewey Decimal Classifikation
Dewey Decimal Classifikation
Language
Language
Language
Date available
Date available
Date available
OA Status
OA Status
OA Status
Free Access at
Free Access at
Free Access at
Metrics
Downloads
Views
Citations
Schneider, G., Hundt, M., & Oppliger, R. (2016, September 21). Part-Of-Speech in Historical Corpora: Tagger Evaluation and Ensemble Systems on ARCHER. KONVENS 2016, Bochum.