Navigation auf zora.uzh.ch

Search ZORA

ZORA (Zurich Open Repository and Archive)

Interactive extraction of diverse vocal units from a planar embedding without the need for prior sound segmentation

Lorenz, Corinna; Hao, Xinyu; Tomka, Tomas; Rüttimann, Linus; Hahnloser, Richard H R (2023). Interactive extraction of diverse vocal units from a planar embedding without the need for prior sound segmentation. Frontiers in Bioinformatics, 2:966066.

Abstract

Annotating and proofreading data sets of complex natural behaviors such as vocalizations are tedious tasks because instances of a given behavior need to be correctly segmented from background noise and must be classified with minimal false positive error rate. Low-dimensional embeddings have proven very useful for this task because they can provide a visual overview of a data set in which distinct behaviors appear in different clusters. However, low-dimensional embeddings introduce errors because they fail to preserve distances; and embeddings represent only objects of fixed dimensionality, which conflicts with vocalizations that have variable dimensions stemming from their variable durations. To mitigate these issues, we introduce a semi-supervised, analytical method for simultaneous segmentation and clustering of vocalizations. We define a given vocalization type by specifying pairs of high-density regions in the embedding plane of sound spectrograms, one region associated with vocalization onsets and the other with offsets. We demonstrate our two-neighborhood (2N) extraction method on the task of clustering adult zebra finch vocalizations embedded with UMAP. We show that 2N extraction allows the identification of short and long vocal renditions from continuous data streams without initially committing to a particular segmentation of the data. Also, 2N extraction achieves much lower false positive error rate than comparable approaches based on a single defining region. Along with our method, we present a graphical user interface (GUI) for visualizing and annotating data.

Additional indexing

Item Type:Journal Article, refereed, original work
Communities & Collections:07 Faculty of Science > Institute of Neuroinformatics
Dewey Decimal Classification:570 Life sciences; biology
Scopus Subject Areas:Life Sciences > Biochemistry
Life Sciences > Biotechnology
Physical Sciences > Computational Mathematics
Physical Sciences > Statistics and Probability
Life Sciences > Structural Biology
Uncontrolled Keywords:General Medicine
Language:English
Date:13 January 2023
Deposited On:30 Jan 2024 15:29
Last Modified:28 Apr 2025 01:39
Publisher:Frontiers Research Foundation
ISSN:2673-7647
OA Status:Gold
Free access at:Publisher DOI. An embargo period may apply.
Publisher DOI:https://doi.org/10.3389/fbinf.2022.966066
PubMed ID:36710910
Download PDF  'Interactive extraction of diverse vocal units from a planar embedding without the need for prior sound segmentation'.
Preview
  • Content: Published Version
  • Language: English
  • Licence: Creative Commons: Attribution 4.0 International (CC BY 4.0)

Metadata Export

Statistics

Citations

Altmetrics

Downloads

3 downloads since deposited on 30 Jan 2024
2 downloads since 12 months
Detailed statistics

Authors, Affiliations, Collaborations

Similar Publications