Navigation auf zora.uzh.ch

Search ZORA

ZORA (Zurich Open Repository and Archive)

Towards Unsupervised Recognition of Token-level Semantic Differences in Related Documents

Vamvas, Jannis; Sennrich, Rico (2023). Towards Unsupervised Recognition of Token-level Semantic Differences in Related Documents. In: 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP), Singapore, 6 December 2023 - 10 December 2023. Association for Computational Linguistics, 13543-13552.

Abstract

Automatically highlighting words that cause semantic differences between two documents could be useful for a wide range of applications. We formulate recognizing semantic differences (RSD) as a token-level regression task and study three unsupervised approaches that rely on a masked language model. To assess the approaches, we begin with basic English sentences and gradually move to more complex, cross-lingual document pairs. Our results show that an approach based on word alignment and sentence-level contrastive learning has a robust correlation to gold labels. However, all unsupervised approaches still leave a large margin of improvement.

Additional indexing

Item Type:Conference or Workshop Item (Paper), refereed, original work
Communities & Collections:06 Faculty of Arts > Institute of Computational Linguistics
06 Faculty of Arts > Zurich Center for Linguistics
Dewey Decimal Classification:410 Linguistics
000 Computer science, knowledge & systems
Language:English
Event End Date:10 December 2023
Deposited On:13 Dec 2023 11:24
Last Modified:31 Mar 2024 03:36
Publisher:Association for Computational Linguistics
Series Name:Proceedings of the Conference on Empirical Methods in Natural Language Processing
OA Status:Gold
Free access at:Publisher DOI. An embargo period may apply.
Publisher DOI:https://doi.org/10.18653/v1/2023.emnlp-main.835
Official URL:https://aclanthology.org/2023.emnlp-main.835/
Related URLs:https://arxiv.org/abs/2305.13303 (Author)
Project Information:
Download PDF  'Towards Unsupervised Recognition of Token-level Semantic Differences in Related Documents'.
Preview
  • Content: Published Version
  • Language: English
  • Licence: Creative Commons: Attribution 4.0 International (CC BY 4.0)

Metadata Export

Statistics

Citations

Dimensions.ai Metrics

Altmetrics

Downloads

7 downloads since deposited on 13 Dec 2023
3 downloads since 12 months
Detailed statistics

Authors, Affiliations, Collaborations

Similar Publications