Header

UZH-Logo

Maintenance Infos

Zero-Shot Crosslingual Sentence Simplification


Mallinson, Jonathan; Sennrich, Rico; Lapata, Mirella (2020). Zero-Shot Crosslingual Sentence Simplification. In: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), Online, 16 November 2020 - 20 November 2020, 5109-5126.

Abstract

Sentence simplification aims to make sentences easier to read and understand. Recent approaches have shown promising results with encoder-decoder models trained on large amounts of parallel data which often only exists in English. We propose a zero-shot modeling framework which transfers simplification knowledge from English to another language (for which no parallel simplification corpus exists) while generalizing across languages and tasks. A shared transformer encoder constructs language-agnostic representations, with a combination of task-specific encoder layers added on top (e.g., for translation and simplification). Empirical results using both human and automatic metrics show that our approach produces better simplifications than unsupervised and pivot-based methods.

Abstract

Sentence simplification aims to make sentences easier to read and understand. Recent approaches have shown promising results with encoder-decoder models trained on large amounts of parallel data which often only exists in English. We propose a zero-shot modeling framework which transfers simplification knowledge from English to another language (for which no parallel simplification corpus exists) while generalizing across languages and tasks. A shared transformer encoder constructs language-agnostic representations, with a combination of task-specific encoder layers added on top (e.g., for translation and simplification). Empirical results using both human and automatic metrics show that our approach produces better simplifications than unsupervised and pivot-based methods.

Statistics

Downloads

108 downloads since deposited on 10 Nov 2020
108 downloads since 12 months
Detailed statistics

Additional indexing

Item Type:Conference or Workshop Item (Paper), original work
Communities & Collections:06 Faculty of Arts > Institute of Computational Linguistics
Dewey Decimal Classification:000 Computer science, knowledge & systems
410 Linguistics
Language:English
Event End Date:20 November 2020
Deposited On:10 Nov 2020 10:52
Last Modified:27 Nov 2020 07:34
Publisher:Association for Computational Linguistics
OA Status:Green
Official URL:https://www.aclweb.org/anthology/2020.emnlp-main.415
Project Information:
  • : FunderH2020
  • : Grant ID681760
  • : Project TitleTranslating from Multiple Modalities into Text
  • : FunderSNSF
  • : Grant IDPP00P1_176727
  • : Project TitleMulti-Task Learning with Multilingual Resources for Better Natural Language Understanding

Download

Green Open Access

Download PDF  'Zero-Shot Crosslingual Sentence Simplification'.
Preview
Content: Published Version
Language: English
Filetype: PDF
Size: 483kB
Licence: Creative Commons: Attribution-NonCommercial 4.0 International (CC BY-NC 4.0)