Header

UZH-Logo

Maintenance Infos

Detecting innovations in a parsed corpus of learner English


Schneider, Gerold; Gilquin, Gaëtanelle (2018). Detecting innovations in a parsed corpus of learner English. In: Deshors, Sandra C.; Götz, Sandra; Laporte, Samanantha. Rethinking linguistic creativity in non-native Englishes. Amsterdam: John Benjamins Publishing, 47-74.

Abstract

In research on L2 English, recent corpus-based studies indicate that some non- standard forms are shared by indigenized (ESL) and foreign (EFL) varieties of English, which challenges the idea of a clear dichotomy between innovation and error. We present a data-driven large-scale method to detect innovations, test it on verb + preposition structures (including phrasal verbs) and adjective + preposition structures, and describe similarities and differences between EFL and ESL. We use a dependency-parsed version of the International Corpus of Learner English to automatically extract potential innovations, defined as patterns of overuse compared to the British National Corpus as reference corpus. We measure overuse by means of collocation measures like O/E or T-score, and compare our results with similar results for ESL. In both quantitative and qualitative analyses, we detect similarities between the two varieties (e.g. discuss about) and dissimilarities (e.g. accuse for, only distinctive for EFL). We report more verb/adjective + preposition combinations than previous studies and discuss the roles of analogy and transfer.

Abstract

In research on L2 English, recent corpus-based studies indicate that some non- standard forms are shared by indigenized (ESL) and foreign (EFL) varieties of English, which challenges the idea of a clear dichotomy between innovation and error. We present a data-driven large-scale method to detect innovations, test it on verb + preposition structures (including phrasal verbs) and adjective + preposition structures, and describe similarities and differences between EFL and ESL. We use a dependency-parsed version of the International Corpus of Learner English to automatically extract potential innovations, defined as patterns of overuse compared to the British National Corpus as reference corpus. We measure overuse by means of collocation measures like O/E or T-score, and compare our results with similar results for ESL. In both quantitative and qualitative analyses, we detect similarities between the two varieties (e.g. discuss about) and dissimilarities (e.g. accuse for, only distinctive for EFL). We report more verb/adjective + preposition combinations than previous studies and discuss the roles of analogy and transfer.

Statistics

Citations

Dimensions.ai Metrics

Altmetrics

Downloads

30 downloads since deposited on 23 Jan 2019
30 downloads since 12 months
Detailed statistics

Additional indexing

Item Type:Book Section, refereed, original work
Communities & Collections:06 Faculty of Arts > English Department
06 Faculty of Arts > Institute of Computational Linguistics
06 Faculty of Arts > Center for Linguistics
Dewey Decimal Classification:400 Language
410 Linguistics
Language:English
Date:19 July 2018
Deposited On:23 Jan 2019 13:21
Last Modified:13 Mar 2019 16:31
Publisher:John Benjamins Publishing
Series Name:Benjamins Current Topics
Number:98
ISSN:1874-0081
ISBN:9789027201461
OA Status:Green
Publisher DOI:https://doi.org/10.1075/bct.98.03sch
Related URLs:https://benjamins.com/catalog/bct.98.03sch (Publisher)
https://www.recherche-portal.ch/primo-explore/fulldisplay?docid=ebi01_prod011340476&context=L&vid=ZAD&search_scope=default_scope&tab=default_tab&lang=de_DE (Library Catalogue)

Download

Download PDF  'Detecting innovations in a parsed corpus of learner English'.
Preview
Content: Accepted Version
Language: English
Filetype: PDF
Size: 1MB
View at publisher
Get full-text in a library