Publication: The LiRI Corpus Platform
The LiRI Corpus Platform
Date
Date
Date
| cris.virtual.orcid | https://orcid.org/0000-0002-5780-5665 | |
| cris.virtual.orcid | https://orcid.org/0000-0002-0459-5086 | |
| cris.virtual.orcid | https://orcid.org/0000-0002-2134-2013 | |
| cris.virtualsource.orcid | 56acab60-8e01-4c2d-a26e-35a806e6b999 | |
| cris.virtualsource.orcid | 21fff778-6a26-4132-abdc-3ceff710ddb2 | |
| cris.virtualsource.orcid | 28c67ff6-3e63-4ddb-a9a7-1b9b7e569a74 | |
| dc.contributor.institution | University of Zurich | |
| dc.date.accessioned | 2024-07-18T11:33:15Z | |
| dc.date.available | 2024-07-18T11:33:15Z | |
| dc.date.issued | 2024-07-09 | |
| dc.description.abstract | We present the LiRI Corpus Platform (LCP), a software system and infrastructure for querying a vast array of corpora of different kinds. It heavily relies on the PostgreSQL relational database management system, employing state-of-the-art data representation and indexing techniques, which lead to significant performance gains when querying, even for structurally complex queries involving nested logical operations and quantifiers. In this work, we describe the requirements that led to the development of this novel system, discuss methods from corpus linguistics and beyond that we considered key for such a system, and provide details on a number of technological features that we take advantage of. Our platform also comes with its own query language tailored both to the requirements in terms of information need and our philosophy of how to define corpora in an abstract way. | |
| dc.identifier.doi | 10.3384/ecp210010 | |
| dc.identifier.isbn | 978-91-8075-740-9 | |
| dc.identifier.issn | 1650-3740 | |
| dc.identifier.uri | https://www.zora.uzh.ch/handle/20.500.14742/220423 | |
| dc.language.iso | eng | |
| dc.subject.ddc | 410 Linguistics | |
| dc.subject.ddc | 000 Computer science, knowledge & systems | |
| dc.title | The LiRI Corpus Platform | |
| dc.type | conference_item | |
| dcterms.accessRights | info:eu-repo/semantics/openAccess | |
| dcterms.bibliographicCitation.journaltitle | Linköping Electronic Conference Proceedings | |
| dcterms.bibliographicCitation.originalpublishername | Linköping University Electronic Press | |
| dcterms.bibliographicCitation.pageend | 75 | |
| dcterms.bibliographicCitation.pagestart | 62 | |
| dspace.entity.type | Publication | en |
| oairecerif.event.country | Belgium | |
| oairecerif.event.endDate | 2023-10-18 | |
| oairecerif.event.place | Leuven | |
| oairecerif.event.startDate | 2023-10-16 | |
| uzh.contributor.author | Graën, Johannes | |
| uzh.contributor.author | Schaber, Jonathan | |
| uzh.contributor.author | McDonald, Daniel | |
| uzh.contributor.author | Mustač, Igor | |
| uzh.contributor.author | Rajović, Nikolina | |
| uzh.contributor.author | Schneider, Gerold | |
| uzh.contributor.author | Vuković, Teodora | |
| uzh.contributor.author | Zehr, Jeremy | |
| uzh.contributor.author | Bubenhofer, Noah | |
| uzh.contributor.correspondence | Yes | |
| uzh.contributor.correspondence | No | |
| uzh.contributor.correspondence | No | |
| uzh.contributor.correspondence | No | |
| uzh.contributor.correspondence | No | |
| uzh.contributor.correspondence | No | |
| uzh.contributor.correspondence | No | |
| uzh.contributor.correspondence | No | |
| uzh.contributor.correspondence | No | |
| uzh.document.availability | published_version | |
| uzh.eprint.datestamp | 2024-07-18 11:33:15 | |
| uzh.eprint.lastmod | 2025-03-26 13:16:43 | |
| uzh.eprint.statusChange | 2024-07-18 11:33:15 | |
| uzh.event.presentationType | paper | |
| uzh.event.title | CLARIN Annual Conference 2023 | |
| uzh.event.type | conference | |
| uzh.harvester.eth | Yes | |
| uzh.harvester.nb | No | |
| uzh.identifier.doi | 10.5167/uzh-261076 | |
| uzh.jdb.eprintsId | 36465 | |
| uzh.oastatus.unpaywall | closed | |
| uzh.oastatus.zora | Gold | |
| uzh.publication.citation | Graën, Johannes; Schaber, Jonathan; McDonald, Daniel; Mustač, Igor; Rajović, Nikolina; Schneider, Gerold; Vuković, Teodora; Zehr, Jeremy; Bubenhofer, Noah (2024). The LiRI Corpus Platform. In: CLARIN Annual Conference 2023, Leuven, Belgium, 16 October 2023 - 18 October 2023. Linköping University Electronic Press, 62-75. | |
| uzh.publication.freeAccessAt | doi | |
| uzh.publication.originalwork | original | |
| uzh.publication.publishedStatus | final | |
| uzh.publication.seriesTitle | Linköping Electronic Conference Proceedings | |
| uzh.relatedUrl.url | https://www.zora.uzh.ch/id/eprint/257131/ | |
| uzh.workflow.doaj | uzh.workflow.doaj.false | |
| uzh.workflow.eprintid | 261076 | |
| uzh.workflow.fulltextStatus | public | |
| uzh.workflow.revisions | 28 | |
| uzh.workflow.rightsCheck | keininfo | |
| uzh.workflow.source | Crossref:10.3384/ecp210010 | |
| uzh.workflow.status | archive | |
| Files | ||
| Publication available in collections: |