Publication:

Spelling normalisation of Late Modern English: comparison and combination of VARD and character-based statistical machine translation

Date

Date

Date
2020
Book Section
Published version

Citations

Citation copied

Schneider, G. (2020). Spelling normalisation of Late Modern English: comparison and combination of VARD and character-based statistical machine translation. In M. Kytö & E. Smitterberg (Eds.), Late Modern English: novel encounters (No. 214; Issue 214, pp. 243–268). John Benjamins Publishing. https://doi.org/10.1075/slcs.214.11sch

Abstract

Abstract

Abstract

To be able to profit from natural language processing (NLP) tools for analysing historical text, an important step is spelling normalisation. We first compare and second combine two different approaches: on the one hand VARD, a rule-based system which is based on dictionary lookup and rules with non-probabilistic but trainable weights; on the other hand a language-independent approach to spelling normalisation based on statistical machine translation (SMT) techniques. The rule-based system reaches the best accuracy, up to 94% precisio

Metrics

Downloads

4 since deposited on 2020-02-17
Acq. date: 2025-11-14

Views

190 since deposited on 2020-02-17
Acq. date: 2025-11-14

Citations

Additional indexing

Creators (Authors)

  • Schneider, Gerold

Editors

  • Kytö, Merja
  • Smitterberg, Eric

Title of Book

Title of Book

Title of Book
Late Modern English: novel encounters

Place of Publication

Place of Publication

Place of Publication
Amsterdam

Publisher

Publisher

Publisher

Page range/Item number

Page range/Item number

Page range/Item number
243

Page end

Page end

Page end
268

Item Type

Item Type

Item Type
Book Section

Dewey Decimal Classifikation

Dewey Decimal Classifikation

Dewey Decimal Classifikation

Keywords

Late Modern English, Spelling Normalisation, VARD, Ensemble Learning, Character-based Machine Translation

Language

Language

Language
English

Publication date

Publication date

Publication date
2020-03

Date available

Date available

Date available
2020-02-17

Series Name

Series Name

Series Name
Studies in language companion series

ISSN or e-ISSN

ISSN or e-ISSN

ISSN or e-ISSN
0165-7763

ISBN or e-ISBN

ISBN or e-ISBN

ISBN or e-ISBN
9789027261434

OA Status

OA Status

OA Status
Closed

Free Access at

Free Access at

Free Access at
Unspecified

Related URLs

Related URLs

Related URLs

Metrics

Downloads

4 since deposited on 2020-02-17
Acq. date: 2025-11-14

Views

190 since deposited on 2020-02-17
Acq. date: 2025-11-14

Citations

Citations

Citation copied

Schneider, G. (2020). Spelling normalisation of Late Modern English: comparison and combination of VARD and character-based statistical machine translation. In M. Kytö & E. Smitterberg (Eds.), Late Modern English: novel encounters (No. 214; Issue 214, pp. 243–268). John Benjamins Publishing. https://doi.org/10.1075/slcs.214.11sch

Closed
Loading...
Thumbnail Image

Files

Files

Files
Files available to download:1

Files

Files

Files
Files available to download:1
Loading...
Thumbnail Image