Navigation auf zora.uzh.ch

Search

ZORA (Zurich Open Repository and Archive)

Ab Antiquo: Neural Proto-language Reconstruction

Meloni, Carlo; Ravfogel, Shauli; Goldberg, Yoav (2021). Ab Antiquo: Neural Proto-language Reconstruction. In: Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Online, 6 June 2021 - 11 June 2021. ACL Anthology, 4460-4473.

Abstract

Historical linguists have identified regularities in the process of historic sound change. The comparative method utilizes those regularities to reconstruct proto-words based on observed forms in daughter languages. Can this process be efficiently automated? We address the task of proto-word reconstruction, in which the model is exposed to cognates in contemporary daughter languages, and has to predict the proto word in the ancestor language. We provide a novel dataset for this task, encompassing over 8,000 comparative entries, and show that neural sequence models outperform conventional methods applied to this task so far. Error analysis reveals a variability in the ability of neural model to capture different phonological changes, correlating with the complexity of the changes. Analysis of learned embeddings reveals the models learn phonologically meaningful generalizations, corresponding to well-attested phonological shifts documented by historical linguistics.

Additional indexing

Item Type:Conference or Workshop Item (Paper), refereed, original work
Communities & Collections:06 Faculty of Arts > Department of Comparative Language Science
Dewey Decimal Classification:490 Other languages
890 Other literatures
410 Linguistics
Language:English
Event End Date:11 June 2021
Deposited On:24 Jan 2022 17:31
Last Modified:20 Feb 2023 14:56
Publisher:ACL Anthology
OA Status:Hybrid
Free access at:Publisher DOI. An embargo period may apply.
Publisher DOI:https://doi.org/10.18653/v1/2021.naacl-main.353
Download PDF  'Ab Antiquo: Neural Proto-language Reconstruction'.
Preview
  • Content: Published Version
  • Language: English
  • Licence: Creative Commons: Attribution 4.0 International (CC BY 4.0)

Metadata Export

Statistics

Citations

Dimensions.ai Metrics
9 citations in Web of Science®
19 citations in Scopus®
Google Scholar™

Altmetrics

Downloads

35 downloads since deposited on 24 Jan 2022
28 downloads since 12 months
Detailed statistics

Authors, Affiliations, Collaborations

Similar Publications