Navigation auf zora.uzh.ch

Search ZORA

ZORA (Zurich Open Repository and Archive)

Dense semantic labeling of subdecimeter resolution images with convolutional neural networks

Volpi, Michele; Tuia, Devis (2017). Dense semantic labeling of subdecimeter resolution images with convolutional neural networks. IEEE Transactions on Geoscience and Remote Sensing, 55(2):881-893.

Abstract

Semantic labeling (or pixel-level land-cover classification) in ultrahigh-resolution imagery (<10 cm) requires statistical models able to learn high-level concepts from spatial data, with large appearance variations. Convolutional neural networks (CNNs) achieve this goal by learning discriminatively a hierarchy of representations of increasing abstraction. In this paper, we present a CNN-based system relying on a downsample-thenupsample architecture. Specifically, it first learns a rough spatial map of high-level representations by means of convolutions and then learns to upsample them back to the original resolution by deconvolutions. By doing so, the CNN learns to densely label every pixel at the original resolution of the image. This results in many advantages, including: 1) the state-of-the-art numerical accuracy; 2) the improved geometric accuracy of predictions; and 3) high efficiency at inference time. We test the proposed system on the Vaihingen and Potsdam sub decimeter resolution data sets, involving the semantic labeling of aerial images of 9- and 5-cm resolution, respectively. These data sets are composed by many large and fully annotated tiles, allowing an unbiased evaluation of models making use of spatial information. We do so by comparing two standard CNN architectures with the proposed one: standard patch classification, prediction of local label patches by employing only convolutions, and full patch labeling by employing deconvolutions. All the systems compare favorably or outperform a state-of-the-art baseline relying on superpixels and powerful appearance descriptors. The proposed full patch labeling CNN outperforms these models by a large margin, also showing a very appealing inference time.

Additional indexing

Item Type:Journal Article, refereed, original work
Communities & Collections:07 Faculty of Science > Institute of Geography
Dewey Decimal Classification:910 Geography & travel
Scopus Subject Areas:Physical Sciences > Electrical and Electronic Engineering
Physical Sciences > General Earth and Planetary Sciences
Language:English
Date:2017
Deposited On:01 Dec 2016 15:35
Last Modified:15 Jan 2025 02:43
Publisher:Institute of Electrical and Electronics Engineers
ISSN:0196-2892
OA Status:Closed
Publisher DOI:https://doi.org/10.1109/tgrs.2016.2616585
Project Information:
  • Funder: SNSF
  • Grant ID: PP00P2_150593
  • Project Title: Multimodal machine learning for remote sensing information fusion

Metadata Export

Statistics

Citations

Dimensions.ai Metrics
371 citations in Web of Science®
443 citations in Scopus®
Google Scholar™

Altmetrics

Downloads

2 downloads since deposited on 01 Dec 2016
0 downloads since 12 months
Detailed statistics

Authors, Affiliations, Collaborations

Similar Publications