Publication:

Using Multilingual Word Embeddings for Similarity-Based Word Alignments in a Zero-Shot Setting: Tested on the Case of German–Romansh

Date

Date

Date
2022
Master's Thesis

Citations

Citation copied

Dolev, E. L. (2022). Using Multilingual Word Embeddings for Similarity-Based Word Alignments in a Zero-Shot Setting: Tested on the Case of German–Romansh. (Master’s thesis, University of Zurich) https://doi.org/10.5167/uzh-233699

Abstract

Abstract

Abstract

Using multilingual word embeddings for computing word alignments has been shown to be competetive with statistical word alignment methods. However, the languages on which the experiments were made on were all “seen” languages, i.e., they were part of the training data for the embeddings. In this thesis I show that multilingual word embeddings taken from mBERT can be used for computing word alignments for the “unseen” language Romansh, aligned against German. The performance is on par with a baseline statistical model (fast_align). I a

Metrics

Downloads

55 since deposited on 2023-06-22
Acq. date: 2025-11-12

Views

155 since deposited on 2023-06-22
Acq. date: 2025-11-12

Citations

Additional indexing

Creators (Authors)

Institution

Institution

Institution

Faculty

Faculty

Faculty
Faculty of Arts

Item Type

Item Type

Item Type
Master's Thesis

Dewey Decimal Classifikation

Dewey Decimal Classifikation

Dewey Decimal Classifikation

Keywords

NLP, Romansh, word alignment, corpus, multi-lingual corpus, Graubünden

Language

Language

Language
English

Publication date

Publication date

Publication date
2022-08-15

Date available

Date available

Date available
2023-06-22

Number of pages

Number of pages

Number of pages
99

OA Status

OA Status

OA Status
Green

Metrics

Downloads

55 since deposited on 2023-06-22
Acq. date: 2025-11-12

Views

155 since deposited on 2023-06-22
Acq. date: 2025-11-12

Citations

Citations

Citation copied

Dolev, E. L. (2022). Using Multilingual Word Embeddings for Similarity-Based Word Alignments in a Zero-Shot Setting: Tested on the Case of German–Romansh. (Master’s thesis, University of Zurich) https://doi.org/10.5167/uzh-233699

Green Open Access
Loading...
Thumbnail Image

Files

Files

Files
Files available to download:1

Files

Files

Files
Files available to download:1
Loading...
Thumbnail Image