Navigation auf zora.uzh.ch

Search ZORA

ZORA (Zurich Open Repository and Archive)

CLIP and the City: Addressing the Artificial Encoding of Cities in Multimodal Foundation Deep Learning Models

Negueruela del Castillo, Dario; Neri, Iacopo (2023). CLIP and the City: Addressing the Artificial Encoding of Cities in Multimodal Foundation Deep Learning Models. In: On Architecture Challenges in Design, Belgrade, Serbia, 7 December 2023 - 8 December 2023. STRAND – Sustainable Urban Society Association, 100-109.

Abstract

In this project, we propose and explore a computational pipeline to examine urban cultural landscapes through the lens of artificial intelligence, and for questioning modes of embedding culture in machine learning models. By employing machine learning models that extract features and textual properties from images, we aim to uncover the connections between a city's history, architecture, and urban development. The city of Rome serves as a significant case study for this research.
To achieve this objective, we feed 360° panoramic images into large vision-language models (e.g. OpenCLIP), to question how mainstream culture is expressed in these models. In this machine-triggered urban experiment, we investigate overlaps between history and machinic interpretation and whether relevant temporal correlations can be captured through generic street images only. Finally, by spatially analysing the captured data, we identify clusters and discontinuities in the urban layout aiming at visually depicting the interplay of forces behind its development.
As in a forensic exercise, the paper seeks to uncover the complex social and historical dynamics of urban environments, exploiting only contemporary images of their settings and a generic embedding of culture. It explores potential cultural biases embedded in machine learning models by comparing Rome - culturally relevant for the western world - with other cities around the world; leveraging innovative computational pipelines and globally covering datasets to provide a novel research line for urban studies.

Additional indexing

Item Type:Conference or Workshop Item (Paper), refereed, original work
Communities & Collections:06 Faculty of Arts > Digital Visual Studies
Dewey Decimal Classification:900 History
410 Linguistics
400 Language
Language:English
Event End Date:8 December 2023
Deposited On:18 Dec 2023 12:15
Last Modified:18 Dec 2023 12:15
Publisher:STRAND – Sustainable Urban Society Association
Additional Information:Serbian Academy of Sciences and Arts, Gallery of Science and Technology and Rectorate of the University of Belgrade, Serbia
OA Status:Green
Official URL:https://www.strand.rs/2023-proceedings/
Download PDF  'CLIP and the City: Addressing the Artificial Encoding of Cities in Multimodal Foundation Deep Learning Models'.
Preview
  • Content: Published Version
  • Language: English

Metadata Export

Statistics

Downloads

80 downloads since deposited on 18 Dec 2023
79 downloads since 12 months
Detailed statistics

Authors, Affiliations, Collaborations

Similar Publications