Publication:

Using automatically parsed corpora to discover lexico-grammatical features of English varieties

Date

Date

Date
2011
Conference or Workshop Item
Published version
dc.contributor.institutionUniversity of Zurich
dc.date.accessioned2012-03-05T13:23:33Z
dc.date.available2012-03-05T13:23:33Z
dc.date.issued2011-10-08
dc.description.abstract

We employ syntactic parsing to describe and to discover lexico-grammatical features of English regional varieties. In the absence of suitable Treebanks, automatically parsed corpora (tree jungles) can be used. As an example we focus on Indian English, using the International Corpus of English (ICE), and the British National Corpus (BNC). We use a largely corpus-driven method. There are few differences in frequencies of syntactic relations between the corpora, but considerable differences when taking the intricate relations between grammar and lexis into account. We describe differences in the use of zero articles, verb-preposition constructions, and ditransitive verbs. We show that relatively small corpora can be used to discover subtle lexico-grammatical differences.

dc.identifier.urihttps://www.zora.uzh.ch/handle/20.500.14742/65147
dc.language.isoeng
dc.subjectlexico-grammar
dc.subject
dc.subjectsyntactic parsing
dc.subject
dc.subjectlanguage variation
dc.subject
dc.subjectIndian English
dc.subject
dc.subjectcorpus-driven
dc.subject.ddc000 Computer science, knowledge & systems
dc.subject.ddc820 English & Old English literatures
dc.subject.ddc410 Linguistics
dc.title

Using automatically parsed corpora to discover lexico-grammatical features of English varieties

dc.typeconference_item
dcterms.accessRightsinfo:eu-repo/semantics/openAccess
dcterms.bibliographicCitation.originalpublishernameUniversity of Cyprus, Department of French Studies and Modern Languages
dcterms.bibliographicCitation.pageend258
dcterms.bibliographicCitation.pagestart251
dcterms.bibliographicCitation.urlhttp://infolingu.univ-mlv.fr/Colloques/lgc/index.php?year=2011&lang=en&page=1
dspace.entity.typePublicationen
oairecerif.event.countryCyprus
oairecerif.event.endDate2011-10-08
oairecerif.event.placeNicosia
oairecerif.event.startDate2011-10-05
uzh.contributor.authorSchneider, Gerold
uzh.contributor.correspondenceYes
uzh.document.availabilitypostprint
uzh.eprint.datestamp2012-03-05 13:23:33
uzh.eprint.lastmod2020-11-27 07:15:35
uzh.eprint.statusChange2012-03-05 13:23:33
uzh.event.presentationTypeother
uzh.event.title30th International Conference on Lexis and Grammar
uzh.event.typeconference
uzh.harvester.ethYes
uzh.harvester.nbNo
uzh.identifier.doi10.5167/uzh-52963
uzh.oastatus.zoraGreen
uzh.publication.citationSchneider, Gerold (2011). Using automatically parsed corpora to discover lexico-grammatical features of English varieties. In: 30th International Conference on Lexis and Grammar, Nicosia, Cyprus, 5 October 2011 - 8 October 2011, 251-258.
uzh.publication.originalworkoriginal
uzh.publication.publishedStatusfinal
uzh.workflow.eprintid52963
uzh.workflow.fulltextStatuspublic
uzh.workflow.revisions123
uzh.workflow.rightsCheckoffen
uzh.workflow.statusarchive
Files

Original bundle

Name:
ICE_lexicogrammar_cyprus2011.pdf
Size:
187.46 KB
Format:
Adobe Portable Document Format
Publication available in collections: