Publication:

Part-Of-Speech in Historical Corpora: Tagger Evaluation and Ensemble Systems on ARCHER

Date

Date

Date
2016
Conference or Workshop Item
Published version

Citations

Citation copied

Schneider, G., Hundt, M., & Oppliger, R. (2016, September 21). Part-Of-Speech in Historical Corpora: Tagger Evaluation and Ensemble Systems on ARCHER. KONVENS 2016, Bochum.

Abstract

Abstract

Abstract

Tagger accuracy deteriorates when applied to texts different from the training corpus, e.g. with respect to register or time period. On historical data, accuracy can drop to and below 90%. We are tagging and parsing ARCHER, a historical corpus sampled from British and American texts from 1600-1999. We improve tagging accuracy by (1) using a version of the corpus that has been automatically mapped to PDE spelling with VARD, (2) by combining several part-of-speech taggers in an ensemble system – which improves tagging by about 1% over C

Metrics

Downloads

176 since deposited on 2017-02-16
Acq. date: 2025-11-12

Views

337 since deposited on 2017-02-16
Acq. date: 2025-11-12

Additional indexing

Creators (Authors)

  • Schneider, Gerold
  • Hundt, Marianne
  • Oppliger, Rahel

Event Title

Event Title

Event Title
KONVENS 2016

Event Location

Event Location

Event Location
Bochum

Event Start Date

Event Start Date

Event Start Date
2016-09-19

Event End Date

Event End Date

Event End Date
2016-09-21

Publisher

Publisher

Publisher

Item Type

Item Type

Item Type
Conference or Workshop Item

Dewey Decimal Classifikation

Dewey Decimal Classifikation

Dewey Decimal Classifikation

Language

Language

Language
English

Date available

Date available

Date available
2017-02-16

OA Status

OA Status

OA Status
Green

Free Access at

Free Access at

Free Access at
Unspecified

Metrics

Downloads

176 since deposited on 2017-02-16
Acq. date: 2025-11-12

Views

337 since deposited on 2017-02-16
Acq. date: 2025-11-12

Citations

Citation copied

Schneider, G., Hundt, M., & Oppliger, R. (2016, September 21). Part-Of-Speech in Historical Corpora: Tagger Evaluation and Ensemble Systems on ARCHER. KONVENS 2016, Bochum.

Green Open Access
Loading...
Thumbnail Image

Files

Files

Files
Files available to download:1

Files

Files

Files
Files available to download:1
Loading...
Thumbnail Image