Publication:

Towards Unsupervised Recognition of Token-level Semantic Differences in Related Documents

Date

Date

Date
2023
Conference or Workshop Item
Published version

Citations

Citation copied

Vamvas, J., & Sennrich, R. (2023). Towards Unsupervised Recognition of Token-level Semantic Differences in Related Documents. Proceedings of the Conference on Empirical Methods in Natural Language Processing, 13543–13552. https://doi.org/10.18653/v1/2023.emnlp-main.835

Abstract

Abstract

Abstract

Automatically highlighting words that cause semantic differences between two documents could be useful for a wide range of applications. We formulate recognizing semantic differences (RSD) as a token-level regression task and study three unsupervised approaches that rely on a masked language model. To assess the approaches, we begin with basic English sentences and gradually move to more complex, cross-lingual document pairs. Our results show that an approach based on word alignment and sentence-level contrastive learning has a robust

Metrics

Citations

Additional indexing

Creators (Authors)

Event Title

Event Title

Event Title
2023 Conference on Empirical Methods in Natural Language Processing (EMNLP)

Event Location

Event Location

Event Location
Singapore

Event Start Date

Event Start Date

Event Start Date
2023-12-06

Event End Date

Event End Date

Event End Date
2023-12-10

Page range/Item number

Page range/Item number

Page range/Item number
13543

Page end

Page end

Page end
13552

Item Type

Item Type

Item Type
Conference or Workshop Item

Dewey Decimal Classifikation

Dewey Decimal Classifikation

Dewey Decimal Classifikation

Language

Language

Language
English

Date available

Date available

Date available
2023-12-13

Series Name

Series Name

Series Name
Proceedings of the Conference on Empirical Methods in Natural Language Processing

OA Status

OA Status

OA Status
Gold

Free Access at

Free Access at

Free Access at
DOI

Official URL

Official URL

Official URL

Related URLs

Related URLs

Related URLs

Metrics

Citations

Citations

Citation copied

Vamvas, J., & Sennrich, R. (2023). Towards Unsupervised Recognition of Token-level Semantic Differences in Related Documents. Proceedings of the Conference on Empirical Methods in Natural Language Processing, 13543–13552. https://doi.org/10.18653/v1/2023.emnlp-main.835

Gold Open Access
Loading...
Thumbnail Image

Files

Files

Files
Files available to download:1

Files

Files

Files
Files available to download:1
Loading...
Thumbnail Image