Publication: A Corpus for Automatic Readability Assessment and Text Simplification of German
A Corpus for Automatic Readability Assessment and Text Simplification of German
Date
Date
Date
Citations
Battisti, A., Pfütze, D., Säuberli, A., Kostrzewa, M., & Ebling, S. (2020, May 16). A Corpus for Automatic Readability Assessment and Text Simplification of German. 12th Edition of its Language Resources and Evaluation Conference, Marseille. https://www.aclweb.org/anthology/2020.lrec-1.404.pdf
Abstract
Abstract
Abstract
In this paper, we present a corpus for use in automatic readability assessment and automatic text simplification for German. The corpus is compiled from web sources and consists of parallel as well as monolingual-only (simplified German) data amounting to approximately 6,200 documents (nearly 211,000 sentences). As a unique feature, the corpus contains information on text structure (e.g., paragraphs, lines), typography (e.g., font type, font style), and images (content, position, and dimensions). While the importance of considering su
Metrics
Downloads
Views
Additional indexing
Creators (Authors)
Event Title
Event Title
Event Title
Event Location
Event Location
Event Location
Event Start Date
Event Start Date
Event Start Date
Event End Date
Event End Date
Event End Date
Publisher
Publisher
Publisher
Item Type
Item Type
Item Type
In collections
Dewey Decimal Classifikation
Dewey Decimal Classifikation
Dewey Decimal Classifikation
Language
Language
Language
Date available
Date available
Date available
OA Status
OA Status
OA Status
Free Access at
Free Access at
Free Access at
Metrics
Downloads
Views
Citations
Battisti, A., Pfütze, D., Säuberli, A., Kostrzewa, M., & Ebling, S. (2020, May 16). A Corpus for Automatic Readability Assessment and Text Simplification of German. 12th Edition of its Language Resources and Evaluation Conference, Marseille. https://www.aclweb.org/anthology/2020.lrec-1.404.pdf