Navigation auf zora.uzh.ch

Search ZORA

ZORA (Zurich Open Repository and Archive)

Interactive visual labelling versus active learning: an experimental comparison

Chegini, Mohammad; Bernard, Jürgen; Cui, Jian; Chegini, Fatemeh; Sourin, Alexei; Andrews, Keith; Schreck, Tobias (2020). Interactive visual labelling versus active learning: an experimental comparison. Frontiers of Informaion Technology & Electronic Engineering, 21(4):524-535.

Abstract

Methods from supervised machine learning allow the classification of new data automatically and are tremendously helpful for data analysis. The quality of supervised maching learning depends not only on the type of algorithm used, but also on the quality of the labelled dataset used to train the classifier. Labelling instances in a training dataset is often done manually relying on selections and annotations by expert analysts, and is often a tedious and time-consuming process. Active learning algorithms can automatically determine a subset of data instances for which labels would provide useful input to the learning process. Interactive visual labelling techniques are a promising alternative, providing effective visual overviews from which an analyst can simultaneously explore data records and select items to a label. By putting the analyst in the loop, higher accuracy can be achieved in the resulting classifier. While initial results of interactive visual labelling techniques are promising in the sense that user labelling can improve supervised learning, many aspects of these techniques are still largely unexplored. This paper presents a study conducted using the mVis tool to compare three interactive visualisations, similarity map, scatterplot matrix (SPLOM), and parallel coordinates, with each other and with active learning for the purpose of labelling a multivariate dataset. The results show that all three interactive visual labelling techniques surpass active learning algorithms in terms of classifier accuracy, and that users subjectively prefer the similarity map over SPLOM and parallel coordinates for labelling. Users also employ different labelling strategies depending on the visualisation used.

Additional indexing

Item Type:Journal Article, not_refereed, original work
Communities & Collections:03 Faculty of Economics > Department of Informatics
Dewey Decimal Classification:000 Computer science, knowledge & systems
Scopus Subject Areas:Physical Sciences > Signal Processing
Physical Sciences > Hardware and Architecture
Physical Sciences > Computer Networks and Communications
Physical Sciences > Electrical and Electronic Engineering
Uncontrolled Keywords:Electrical and Electronic Engineering, Computer Networks and Communications, Hardware and Architecture, Signal Processing, Visual Analytics, Interactive Visual Data Analysis
Scope:Discipline-based scholarship (basic research)
Language:English
Date:1 April 2020
Deposited On:01 Feb 2024 10:52
Last Modified:29 Dec 2024 04:33
Publisher:Zheijiang University Press
ISSN:2095-9184
OA Status:Green
Publisher DOI:https://doi.org/10.1631/fitee.1900549
Other Identification Number:merlin-id:24330
Download PDF  'Interactive visual labelling versus active learning: an experimental comparison'.
Preview
  • Content: Accepted Version
  • Language: English
  • Licence: Creative Commons: Attribution 4.0 International (CC BY 4.0)

Metadata Export

Statistics

Citations

Dimensions.ai Metrics

Altmetrics

Downloads

11 downloads since deposited on 01 Feb 2024
11 downloads since 12 months
Detailed statistics

Authors, Affiliations, Collaborations

Similar Publications