Publication:

A Corpus for Automatic Readability Assessment and Text Simplification of German

Date

Date

Date
2020
Conference or Workshop Item
Published version

Citations

Citation copied

Battisti, A., Pfütze, D., Säuberli, A., Kostrzewa, M., & Ebling, S. (2020, May 16). A Corpus for Automatic Readability Assessment and Text Simplification of German. 12th Edition of its Language Resources and Evaluation Conference, Marseille. https://www.aclweb.org/anthology/2020.lrec-1.404.pdf

Abstract

Abstract

Abstract

In this paper, we present a corpus for use in automatic readability assessment and automatic text simplification for German. The corpus is compiled from web sources and consists of parallel as well as monolingual-only (simplified German) data amounting to approximately 6,200 documents (nearly 211,000 sentences). As a unique feature, the corpus contains information on text structure (e.g., paragraphs, lines), typography (e.g., font type, font style), and images (content, position, and dimensions). While the importance of considering su

Metrics

Downloads

3 since deposited on 2020-12-01
Acq. date: 2025-11-12

Views

1 since deposited on 2020-12-01
Acq. date: 2025-11-12

Additional indexing

Creators (Authors)

Event Title

Event Title

Event Title
12th Edition of its Language Resources and Evaluation Conference

Event Location

Event Location

Event Location
Marseille

Event Start Date

Event Start Date

Event Start Date
2020-05-11

Event End Date

Event End Date

Event End Date
2020-05-16

Publisher

Publisher

Publisher
European Language Resources Associatio

Item Type

Item Type

Item Type
Conference or Workshop Item

Dewey Decimal Classifikation

Dewey Decimal Classifikation

Dewey Decimal Classifikation

Language

Language

Language
English

Date available

Date available

Date available
2020-12-01

OA Status

OA Status

OA Status
Green

Free Access at

Free Access at

Free Access at
Official URL

Official URL

Official URL

Official URL

Metrics

Downloads

3 since deposited on 2020-12-01
Acq. date: 2025-11-12

Views

1 since deposited on 2020-12-01
Acq. date: 2025-11-12

Citations

Citation copied

Battisti, A., Pfütze, D., Säuberli, A., Kostrzewa, M., & Ebling, S. (2020, May 16). A Corpus for Automatic Readability Assessment and Text Simplification of German. 12th Edition of its Language Resources and Evaluation Conference, Marseille. https://www.aclweb.org/anthology/2020.lrec-1.404.pdf

Green Open Access
Loading...
Thumbnail Image

Files

Files

Files
Files available to download:1

Files

Files

Files
Files available to download:1
Loading...
Thumbnail Image