Publication: Parsing early and late modern English corpora
Parsing early and late modern English corpora
Date
Date
Date
Citations
Schneider, G., Lehmann, H. M., & Schneider, P. (2015). Parsing early and late modern English corpora. Literary and Linguistic Computing, 30(3), 423–439. https://doi.org/10.1093/llc/fqu001
Abstract
Abstract
Abstract
We describe, evaluate, and improve the automatic annotation of diachronic corpora at the levels of word-class, lemma, chunks, and dependency syntax. As corpora we use the ARCHER corpus (texts from 1,600 to 2,000) and the ZEN corpus (texts from 1,660 to 1,800). Performance on Modern English is considerably lower than on Present Day English (PDE). We present several methods that improve performance. First we use the spelling normalization tool VARD to map spelling variants to their PDE equivalent, which improves tagging. We investigate
Metrics
Downloads
Views
Additional indexing
Creators (Authors)
Volume
Volume
Volume
Number
Number
Number
Page range/Item number
Page range/Item number
Page range/Item number
Page end
Page end
Page end
Item Type
Item Type
Item Type
Dewey Decimal Classifikation
Dewey Decimal Classifikation
Dewey Decimal Classifikation
Language
Language
Language
Publication date
Publication date
Publication date
Date available
Date available
Date available
ISSN or e-ISSN
ISSN or e-ISSN
ISSN or e-ISSN
Additional Information
Additional Information
Additional Information
OA Status
OA Status
OA Status
Free Access at
Free Access at
Free Access at
Publisher DOI
Metrics
Downloads
Views
Citations
Schneider, G., Lehmann, H. M., & Schneider, P. (2015). Parsing early and late modern English corpora. Literary and Linguistic Computing, 30(3), 423–439. https://doi.org/10.1093/llc/fqu001