Header

UZH-Logo

Maintenance Infos

Semantic word cloud representations: hardness and approximation algorithms - Zurich Open Repository and Archive


Barth, Lukas; Fabrikant, Sara I; Kobourov, Stephen; Lubiw, Anna; Nöllenburg, Martin; Okamoto, Yoshio; Pupyrev, Sergey; Squarcella, Claudio; Ueckerdt, Torsten; Wolff, Alexander (2013). Semantic word cloud representations: hardness and approximation algorithms. arXiv 1311.4778, University of Zurich.

Abstract

We study a geometric representation problem, where we are given a set R of axis-aligned rectangles with fixed dimensions and a graph with vertex set R. The task is to place the rectangles without overlap such that two rectangles touch if and only if the graph contains an edge between them. We call this problem CONTACT REPRESENTATION OF WORD NETWORKS (CROWN). It formalizes the geometric problem behind drawing word clouds in which semantically related words are close to each other. Here, we represent words by rectangles and semantic relationships by edges.
We show that CROWN is strongly NP-hard even restricted trees and weakly NP-hard if restricted stars. We consider the optimization problem MAX-CROWN where each adjacency induces a certain profit and the task is to maximize the sum of the profits. For this problem, we present constant-factor approximations for several graph classes, namely stars, trees, planar graphs, and graphs of bounded degree. Finally, we evaluate the algorithms experimentally and show that our best method improves upon the best existing heuristic by 45%.

Abstract

We study a geometric representation problem, where we are given a set R of axis-aligned rectangles with fixed dimensions and a graph with vertex set R. The task is to place the rectangles without overlap such that two rectangles touch if and only if the graph contains an edge between them. We call this problem CONTACT REPRESENTATION OF WORD NETWORKS (CROWN). It formalizes the geometric problem behind drawing word clouds in which semantically related words are close to each other. Here, we represent words by rectangles and semantic relationships by edges.
We show that CROWN is strongly NP-hard even restricted trees and weakly NP-hard if restricted stars. We consider the optimization problem MAX-CROWN where each adjacency induces a certain profit and the task is to maximize the sum of the profits. For this problem, we present constant-factor approximations for several graph classes, namely stars, trees, planar graphs, and graphs of bounded degree. Finally, we evaluate the algorithms experimentally and show that our best method improves upon the best existing heuristic by 45%.

Downloads

230 downloads since deposited on 23 Jan 2014
37 downloads since 12 months
Detailed statistics

Additional indexing

Item Type:Working Paper
Communities & Collections:07 Faculty of Science > Institute of Geography
Dewey Decimal Classification:910 Geography & travel
Language:English
Date:November 2013
Deposited On:23 Jan 2014 10:35
Last Modified:14 Aug 2017 19:56
Series Name:arXiv
Number of Pages:14
Official URL:http://arxiv.org/abs/1311.4778
Related URLs:http://arxiv.org (Publisher)

Download

Preview Icon on Download
Preview
Filetype: PDF
Size: 1MB

TrendTerms

TrendTerms displays relevant terms of the abstract of this publication and related documents on a map. The terms and their relations were extracted from ZORA using word statistics. Their timelines are taken from ZORA as well. The bubble size of a term is proportional to the number of documents where the term occurs. Red, orange, yellow and green colors are used for terms that occur in the current document; red indicates high interlinkedness of a term with other terms, orange, yellow and green decreasing interlinkedness. Blue is used for terms that have a relation with the terms in this document, but occur in other documents.
You can navigate and zoom the map. Mouse-hovering a term displays its timeline, clicking it yields the associated documents.

Author Collaborations