Header

UZH-Logo

Maintenance Infos

Degrees of non-standardness. Feature-based analysis of variation in a Torlak dialect corpus


Vuković, Teodora; Escher, Anastasia; Sonnenhauser, Barbara (2022). Degrees of non-standardness. Feature-based analysis of variation in a Torlak dialect corpus. International Journal of Corpus Linguistics, 27(2):220-247.

Abstract

A corpus-based method for assessing a range of dialect-standard variation is presented for identifying samples exhibiting the highest prevalence of dialect features. This method provides insight into areal and inter-speaker variation and allows the extraction of maximally non-standard manifestations of the dialect, which may then be sampled and used for the study of language change and variation. The focus is on a non-standard Torlak variety, which has undergone considerable change under the influence of standard Serbian. The degree of variation is assessed by measuring the frequencies of five distinguishing linguistic features: accent position, dative reflexive si, auxiliary omission in the compound perfect, the post-positive article, and analytic case marking in the indirect object and possessive. Locations subject to the greatest and least influence of the standard are revealed using hierarchical clustering. A positive correlation between the frequencies of occurrence reveals which non-standard feature is the best predictor of the others.

Abstract

A corpus-based method for assessing a range of dialect-standard variation is presented for identifying samples exhibiting the highest prevalence of dialect features. This method provides insight into areal and inter-speaker variation and allows the extraction of maximally non-standard manifestations of the dialect, which may then be sampled and used for the study of language change and variation. The focus is on a non-standard Torlak variety, which has undergone considerable change under the influence of standard Serbian. The degree of variation is assessed by measuring the frequencies of five distinguishing linguistic features: accent position, dative reflexive si, auxiliary omission in the compound perfect, the post-positive article, and analytic case marking in the indirect object and possessive. Locations subject to the greatest and least influence of the standard are revealed using hierarchical clustering. A positive correlation between the frequencies of occurrence reveals which non-standard feature is the best predictor of the others.

Statistics

Citations

Dimensions.ai Metrics

Altmetrics

Downloads

50 downloads since deposited on 22 May 2022
42 downloads since 12 months
Detailed statistics

Additional indexing

Item Type:Journal Article, refereed, original work
Communities & Collections:06 Faculty of Arts > Institute of Slavonic Studies
06 Faculty of Arts > Zurich Center for Linguistics
08 Research Priority Programs > Language and Space
Dewey Decimal Classification:490 Other languages
410 Linguistics
Uncontrolled Keywords:Linguistics, Language contact, morphosyntax
Language:English
Date:8 June 2022
Deposited On:22 May 2022 09:59
Last Modified:28 Nov 2023 02:42
Publisher:John Benjamins Publishing
ISSN:1384-6655
OA Status:Green
Publisher DOI:https://doi.org/10.1075/ijcl.20014.vuk
Project Information:
  • : FunderSNSF
  • : Grant ID100015_176378
  • : Project Title‘Ill-bred sons’, family and friends: tracing the multiple affiliations of Balkan Slavic
  • : FunderSNSF
  • : Grant IDIZRPZ0_177557
  • : Project Title(Dis-)entangling traditions on the Central Balkans: Performance and perception (TraCeBa)