Navigation auf zora.uzh.ch

Search ZORA

ZORA (Zurich Open Repository and Archive)

Tracing Linguistic Footprints of ChatGPT Across Tasks, Domains and Personas in English and German

Shaitarova, Anastassia; Bauer, Nikolaj; Vamvas, Jannis; Volk, Martin (2024). Tracing Linguistic Footprints of ChatGPT Across Tasks, Domains and Personas in English and German. In: The 9th edition of the Swiss Text Analytics Conference, Chur, Switzerland, 10 June 2024 - 11 June 2024. Association for Computational Linguistics, 102-112.

Abstract

Large language models like ChatGPT can be used to generate seemingly human-like text. However, it is still not well understood how their output differs from text written by humans, and to what degree prompting influences their linguistic profile. In our paper, we instruct ChatGPT to complete, explain and create texts in English and German across journalistic, scientific, and clinical domains. We assign corpus-specific personas to the system setting as part of the prompt within each task. We extract a large number of linguistic features and perform statistical and qualitative comparison across text pairs. Our results show that prompting makes a larger impact on English output than on German. Most basic features such as mean word length distinctly set human and generated texts apart. Readability metrics indicate that ChatGPT overcomplicates English texts, particularly in the clinical domain, while German-generated texts suffer from excessive morpho-syntactic standardization coupled with lexical simplification.

Additional indexing

Item Type:Conference or Workshop Item (Paper), refereed, original work
Communities & Collections:06 Faculty of Arts > Institute of Computational Linguistics
Dewey Decimal Classification:000 Computer science, knowledge & systems
410 Linguistics
Language:English
Event End Date:11 June 2024
Deposited On:07 Feb 2025 18:02
Last Modified:07 Feb 2025 18:02
Publisher:Association for Computational Linguistics
OA Status:Green
Free access at:Official URL. An embargo period may apply.
Official URL:https://aclanthology.org/2024.swisstext-1.9/
Download PDF  'Tracing Linguistic Footprints of ChatGPT Across Tasks, Domains and Personas in English and German'.
Preview
  • Content: Published Version
  • Language: English

Metadata Export

Statistics

Downloads

1 download since deposited on 07 Feb 2025
1 download since 12 months
Detailed statistics

Authors, Affiliations, Collaborations

Similar Publications