Publication:

On feature representations for marmoset vocal communication analysis

Date

Date

Date
2025
Journal Article
Published version
cris.lastimport.scopus2025-07-05T03:45:35Z
cris.lastimport.wos2025-07-05T01:35:22Z
cris.virtual.orcidhttps://orcid.org/0000-0003-4604-5952
cris.virtualsource.orcidb90b6bd3-f18f-4208-87c3-885f2e257da5
dc.contributor.institutionUniversity of Zurich
dc.date.accessioned2025-07-04T09:09:23Z
dc.date.available2025-07-04T09:09:23Z
dc.date.issued2025-05-04
dc.description.abstract

The acoustic analysis of marmoset (Callithrix jacchus) vocalisations is often used to understand the evolutionary origins of human language. Currently, the analysis is largely carried out in a manual or semi-manual manner. Thus, there is a need to develop automatic call analysis methods. In that direction, research has been limited to the development of analysis methods with small amounts of data or for specific scenarios. Furthermore, there is lack of prior knowledge about what type of information is relevant for different call analysis tasks. To address these issues, as a first step, this paper explores different feature representation methods, namely, HCTSA-based hand-crafted features Catch22, pre-trained self supervised learning (SSL) based features extracted from neural networks trained on human speech and end-to-end acoustic modelling for call-type classification, caller identification and caller sex identification. Through an investigation on three different marmoset call datasets, we demonstrate that SSL-based feature representations and end-to-end acoustic modelling tend to lead to better systems than Catch22 features for call-type and caller classification. Furthermore, we also highlight the impact of signal bandwidth on the obtained task performances.

dc.identifier.doi10.1080/09524622.2025.2487688
dc.identifier.issn0952-4622
dc.identifier.scopus2-s2.0-105002727845
dc.identifier.urihttps://www.zora.uzh.ch/handle/20.500.14742/231525
dc.identifier.wos001466591100001
dc.language.isoeng
dc.subject.ddc490 Other languages
dc.subject.ddc300 Social sciences, sociology & anthropology
dc.subject.ddc890 Other literatures
dc.subject.ddc410 Linguistics
dc.title

On feature representations for marmoset vocal communication analysis

dc.typearticle
dcterms.accessRightsinfo:eu-repo/semantics/closedAccess
dcterms.bibliographicCitation.journaltitleBioacoustics
dcterms.bibliographicCitation.number3
dcterms.bibliographicCitation.originalpublishernameTaylor & Francis
dcterms.bibliographicCitation.pageend369
dcterms.bibliographicCitation.pagestart355
dcterms.bibliographicCitation.volume34
dspace.entity.typePublicationen
uzh.contributor.affiliationInstitut Dalle Molle D'intelligence Artificielle Perceptive, Swiss Federal Institute of Technology EPFL, Lausanne
uzh.contributor.affiliationDeutsches Primatenzentrum
uzh.contributor.affiliationUniversity of Zurich
uzh.contributor.affiliationUniversity of Zurich
uzh.contributor.affiliationInstitut Dalle Molle D'intelligence Artificielle Perceptive
uzh.contributor.authorSarkar, Eklavya
uzh.contributor.authorWierucka, Kaja
uzh.contributor.authorBosshard, Alexandra B
uzh.contributor.authorBurkart, Judith
uzh.contributor.authorMagimai Doss, Mathew
uzh.contributor.correspondenceYes
uzh.contributor.correspondenceNo
uzh.contributor.correspondenceNo
uzh.contributor.correspondenceNo
uzh.contributor.correspondenceNo
uzh.document.availabilityno_document
uzh.eprint.datestamp2025-07-04 09:09:23
uzh.eprint.lastmod2025-07-05 20:00:33
uzh.eprint.statusChange2025-07-04 09:09:23
uzh.harvester.ethNo
uzh.harvester.nbNo
uzh.jdb.eprintsId38220
uzh.oastatus.unpaywallgreen
uzh.oastatus.zoraClosed
uzh.publication.citationSarkar, Eklavya; Wierucka, Kaja; Bosshard, Alexandra B; Burkart, Judith; Magimai Doss, Mathew (2025). On feature representations for marmoset vocal communication analysis. Bioacoustics, 34(3):355-369.
uzh.publication.freeAccessAtdoi
uzh.publication.originalworkoriginal
uzh.publication.publishedStatusfinal
uzh.scopus.impact0
uzh.scopus.subjectsEcology, Evolution, Behavior and Systematics
uzh.scopus.subjectsEcology
uzh.workflow.doajuzh.workflow.doaj.false
uzh.workflow.eprintid278656
uzh.workflow.fulltextStatusnone
uzh.workflow.revisions37
uzh.workflow.rightsCheckkeininfo
uzh.workflow.sourceCrossref:10.1080/09524622.2025.2487688
uzh.workflow.statusarchive
uzh.wos.impact0
Publication available in collections: