Header

UZH-Logo

Maintenance Infos

Solving structured segmentation of aerial images as puzzles


Marcos, Diego; Volpi, Michele; Tuia, Devis (2016). Solving structured segmentation of aerial images as puzzles. In: IEEE International Geoscience & Remote Sensing Symposium IGARSS, Beijing (China), 10 July 2016 - 15 July 2016. Institute of Electrical and Electronics Engineers, 3346-3349.

Abstract

Traditional approaches to structured semantic segmentation employ appearance-based classifiers to provide a class likelihood at each spatial location and then post-process it with Markov Random Fields (MRF) to enforce label smoothness and structure in the output space. The spatial support for such techniques is usually a patch of pixels, which makes the prediction over-smoothed because the borders of objects are not explicitly taken into account. This is further exacerbated by MRF post-processing employing the standard Potts model, which tends to further over-smooth predictions at boundaries. In this paper, we propose a different but related approach: we optimize an energy function finding the optimal combination of small ground truth (GT) tiles from training data over predictions at test time, effectively solving a puzzle. We optimize over a first configuration given by a Convolutional Neural Network (CNN) output.

Abstract

Traditional approaches to structured semantic segmentation employ appearance-based classifiers to provide a class likelihood at each spatial location and then post-process it with Markov Random Fields (MRF) to enforce label smoothness and structure in the output space. The spatial support for such techniques is usually a patch of pixels, which makes the prediction over-smoothed because the borders of objects are not explicitly taken into account. This is further exacerbated by MRF post-processing employing the standard Potts model, which tends to further over-smooth predictions at boundaries. In this paper, we propose a different but related approach: we optimize an energy function finding the optimal combination of small ground truth (GT) tiles from training data over predictions at test time, effectively solving a puzzle. We optimize over a first configuration given by a Convolutional Neural Network (CNN) output.

Statistics

Citations

Altmetrics

Downloads

1 download since deposited on 08 Feb 2017
0 downloads since 12 months
Detailed statistics

Additional indexing

Item Type:Conference or Workshop Item (Paper), refereed, original work
Communities & Collections:07 Faculty of Science > Institute of Geography
Dewey Decimal Classification:910 Geography & travel
Scopus Subject Areas:Physical Sciences > Computer Science Applications
Physical Sciences > General Earth and Planetary Sciences
Language:English
Event End Date:15 July 2016
Deposited On:08 Feb 2017 15:52
Last Modified:26 Jan 2022 12:08
Publisher:Institute of Electrical and Electronics Engineers
Series Name:IEEE International Geoscience and Remote Sensing Symposium Proceedings
ISSN:2153-6996
ISBN:978-1-5090-3332-4
Additional Information:Proceedings
OA Status:Closed
Publisher DOI:https://doi.org/10.1109/IGARSS.2016.7729865