Navigation auf zora.uzh.ch

Search ZORA

ZORA (Zurich Open Repository and Archive)

Overlap interval partition join

Dignös, Anton; Böhlen, Michael Hanspeter; Gamper, Johann (2014). Overlap interval partition join. In: ACM SIGMOD 2014 international conference on Management of Data, Snowbird, Utah, USA, 22 July 2014 - 27 July 2014. ACM, 1459-1470.

Abstract

Each tuple in a valid-time relation includes an interval attribute T that represents the tuple's valid time. The overlap join between two valid-time relations determines all pairs of tuples with overlapping intervals. Although overlap joins are common, existing partitioning and indexing schemes are inefficient if the data includes long-lived tuples or if intervals intersect partition boundaries. We propose Overlap Interval Partitioning (OIP), a new partitioning approach for data with an interval. OIP divides the time range of a relation into k base granules and defines overlapping partitions for sequences of contiguous granules. OIP is the first partitioning method for interval data that gives a constant clustering guarantee: the difference in duration between the interval of a tuple and the interval of its partition is independent of the duration of the tuple's interval. We offer a detailed analysis of the average false hit ratio and the average number of partition accesses for queries with overlap predicates, and we prove that the average false hit ratio is independent of the number of short- and long-lived tuples. To compute the overlap join, we propose the Overlap Interval Partition Join (OIPJoin), which uses OIP to partition the input relations on-the-fly. Only the tuples from overlapping partitions have to be joined to compute the result. We analytically derive the optimal number of granules, k, for partitioning the two input relations, from the size of the data, the cost of CPU operations, and the cost of main memory or disk IOs. Our experiments confirm the analytical results and show that the OIPJoin outperforms state-of-the-art techniques for the overlap join.

Additional indexing

Item Type:Conference or Workshop Item (Paper), refereed, original work
Communities & Collections:03 Faculty of Economics > Department of Informatics
Dewey Decimal Classification:000 Computer science, knowledge & systems
Scopus Subject Areas:Physical Sciences > Software
Physical Sciences > Information Systems
Scope:Discipline-based scholarship (basic research)
Language:English
Event End Date:27 July 2014
Deposited On:08 Jul 2014 10:30
Last Modified:03 Dec 2024 19:58
Publisher:ACM
Series Name:SIGMOD '14
ISBN:978-1-4503-2376-5
OA Status:Closed
Publisher DOI:https://doi.org/10.1145/2588555.2612175
Related URLs:http://www.sigmod2014.org/ (Organisation)
Other Identification Number:merlin-id:9628

Metadata Export

Statistics

Citations

Dimensions.ai Metrics
32 citations in Web of Science®
46 citations in Scopus®
Google Scholar™

Altmetrics

Downloads

2 downloads since deposited on 08 Jul 2014
0 downloads since 12 months
Detailed statistics

Authors, Affiliations, Collaborations

Similar Publications