Permanent URL to this publication: http://dx.doi.org/10.5167/uzh-52963
Schneider, G (2011). Using automatically parsed corpora to discover lexico-grammatical features of English varieties. In: 30th International Conference on Lexis and Grammar, Nicosia, Cyprus, 5 October 2011 - 8 October 2011, 251-258.
|Accepted Version (English)|
We employ syntactic parsing to describe and to discover lexico-grammatical features of English regional varieties. In the absence of suitable Treebanks, automatically parsed corpora (tree jungles) can be used. As an example we focus on Indian English, using the International Corpus of English (ICE), and the British National Corpus (BNC). We use a largely corpus-driven method. There are few differences in frequencies of syntactic relations between the corpora, but considerable differences when taking the intricate relations between grammar and lexis into account. We describe differences in the use of zero articles, verb-preposition constructions, and ditransitive verbs. We show that relatively small corpora can be used to discover subtle lexico-grammatical differences.
|Item Type:||Conference or Workshop Item (Other), refereed, original work|
|Communities & Collections:||06 Faculty of Arts > English Department
06 Faculty of Arts > Institute of Computational Linguistics
|DDC:||820 English & Old English literatures
000 Computer science, knowledge & systems
|Uncontrolled Keywords:||lexico-grammar syntactic parsing language variation Indian English corpus-driven|
|Event End Date:||8 October 2011|
|Deposited On:||05 Mar 2012 13:23|
|Last Modified:||24 Oct 2012 11:28|
|Publisher:||University of Cyprus, Department of French Studies and Modern Languages|
Users (please log in): suggest update or correction for this item
Repository Staff Only: item control page