Navigation auf zora.uzh.ch

Search ZORA

ZORA (Zurich Open Repository and Archive)

Addressing the Blind Spots in Spoken Language Processing

Moryossef, Amit (2023). Addressing the Blind Spots in Spoken Language Processing. ArXiv.org 2309.06572, Cornell University.

Abstract

This paper explores the critical but often overlooked role of non-verbal cues, including co-speech gestures and facial expressions, in human communication and their implications for Natural Language Processing (NLP). We argue that understanding human communication requires a more holistic approach that goes beyond textual or spoken words to include non-verbal elements. Borrowing from advances in sign language processing, we propose the development of universal automatic gesture segmentation and transcription models to transcribe these non-verbal cues into textual form. Such a methodology aims to bridge the blind spots in spoken language understanding, enhancing the scope and applicability of NLP models. Through motivating examples, we demonstrate the limitations of relying solely on text-based models. We propose a computationally efficient and flexible approach for incorporating non-verbal cues, which can seamlessly integrate with existing NLP pipelines. We conclude by calling upon the research community to contribute to the development of universal transcription methods and to validate their effectiveness in capturing the complexities of real-world, multi-modal interactions.

Additional indexing

Item Type:Working Paper
Communities & Collections:06 Faculty of Arts > Institute of Computational Linguistics
Dewey Decimal Classification:000 Computer science, knowledge & systems
410 Linguistics
Language:English
Date:September 2023
Deposited On:12 Oct 2023 07:54
Last Modified:28 May 2024 13:44
Series Name:ArXiv.org
ISSN:2331-8422
OA Status:Green
Free access at:Publisher DOI. An embargo period may apply.
Publisher DOI:https://doi.org/10.48550/arXiv.2309.06572
Download PDF  'Addressing the Blind Spots in Spoken Language Processing'.
Preview
  • Content: Draft Version
  • Language: English
  • Licence: Creative Commons: Attribution 4.0 International (CC BY 4.0)

Metadata Export

Statistics

Citations

Dimensions.ai Metrics

Altmetrics

Downloads

4 downloads since deposited on 12 Oct 2023
2 downloads since 12 months
Detailed statistics

Authors, Affiliations, Collaborations

Similar Publications