Navigation auf zora.uzh.ch

Search ZORA

ZORA (Zurich Open Repository and Archive)

Med-EASi: Finely Annotated Dataset and Models for Controllable Simplification of Medical Texts

Basu, Chandrayee; Kottekulam Vasu, Rosni; Yasunaga, Michihiro; Yang, Qiang (2023). Med-EASi: Finely Annotated Dataset and Models for Controllable Simplification of Medical Texts. In: AAAI Conference on Artificial Intelligence, Washington DC, USA, 7 February 2023 - 14 February 2023. AAAI Press, 14093-14101.

Abstract

Automatic medical text simplification can assist providers with patient-friendly communication and make medical texts more accessible, thereby improving health literacy. But curating a quality corpus for this task requires the supervision of medical experts. In this work, we present Med-EASi (Medical dataset for Elaborative and Abstractive Simplification), a uniquely crowdsourced and finely annotated dataset for supervised simplification of short medical texts. Its expert-layman-AI collaborative annotations facilitate controllability over text simplification by marking four kinds of textual transformations: elaboration, replacement, deletion, and insertion. To learn medical text simplification, we fine-tune T5-large with four different styles of input-output combinations, leading to two control-free and two controllable versions of the model. We add two types of controllability into text simplification, by using a multi-angle training approach: position-aware, which uses in-place annotated inputs and outputs, and position-agnostic, where the model only knows the contents to be edited, but not their positions. Our results show that our fine-grained annotations improve learning compared to the unannotated baseline. Furthermore, our position-aware control enhances the model's ability to generate better simplification than the position-agnostic version. The data and code are available at https://github.com/Chandrayee/CTRL-SIMP.

Additional indexing

Item Type:Conference or Workshop Item (Paper), refereed, original work
Communities & Collections:03 Faculty of Economics > Department of Informatics
Dewey Decimal Classification:000 Computer science, knowledge & systems
Scopus Subject Areas:Physical Sciences > Artificial Intelligence
Scope:Discipline-based scholarship (basic research)
Language:English
Event End Date:14 February 2023
Deposited On:07 Oct 2024 15:06
Last Modified:09 Oct 2024 12:07
Publisher:AAAI Press
Series Name:Proceedings of the AAAI Conference on Artificial Intelligence
Number:12
ISSN:2159-5399
OA Status:Green
Free access at:Publisher DOI. An embargo period may apply.
Publisher DOI:https://doi.org/10.1609/aaai.v37i12.26649
Download PDF  'Med-EASi: Finely Annotated Dataset and Models for Controllable Simplification of Medical Texts'.
Preview
  • Content: Published Version
  • Language: English
  • Publisher License

Metadata Export

Statistics

Citations

Dimensions.ai Metrics

Altmetrics

Downloads

16 downloads since deposited on 07 Oct 2024
16 downloads since 12 months
Detailed statistics

Authors, Affiliations, Collaborations

Similar Publications