Publication: Comparing Rule-based and SMT-based Spelling Normalisation for English Historical Texts
Comparing Rule-based and SMT-based Spelling Normalisation for English Historical Texts
Date
Date
Date
Citations
Schneider, G., Pettersson, E., & Percillier, M. (2017). Comparing Rule-based and SMT-based Spelling Normalisation for English Historical Texts (No. 133). 133, 40–46. http://www.ep.liu.se/ecp/article.asp?issue=133&article=008&volume=#
Abstract
Abstract
Abstract
To be able to use existing natural language processing tools for analysing historical text, an important preprocessing step is spelling normalisation, converting the original spelling to present-day spelling, before applying tools such as taggers and parsers. In this paper, we compare a probablistic, language-independent approach to spelling normalisation based on statistical machine translation (SMT) techniques, to a rule-based system combining dictionary lookup with rules and non-probabilistic weights. The rule-based system reaches
Additional indexing
Creators (Authors)
Event Title
Event Title
Event Title
Event Location
Event Location
Event Location
Event Start Date
Event Start Date
Event Start Date
Event End Date
Event End Date
Event End Date
Publisher
Publisher
Publisher
Page range/Item number
Page range/Item number
Page range/Item number
Page end
Page end
Page end
Item Type
Item Type
Item Type
Dewey Decimal Classifikation
Dewey Decimal Classifikation
Dewey Decimal Classifikation
Language
Language
Language
Date available
Date available
Date available
Number
Number
Number
OA Status
OA Status
OA Status
Free Access at
Free Access at
Free Access at
Official URL
Official URL
Official URL
Citations
Schneider, G., Pettersson, E., & Percillier, M. (2017). Comparing Rule-based and SMT-based Spelling Normalisation for English Historical Texts (No. 133). 133, 40–46. http://www.ep.liu.se/ecp/article.asp?issue=133&article=008&volume=#