Header

UZH-Logo

Maintenance Infos

A model of attention-driven scene analysis


Slaney, M; Agus, T; Liu, S C; Kaya, M; Elhilali, M (2012). A model of attention-driven scene analysis. In: 2012 IEEE International Conference on Acoustics, Speech, and Signal Processing , Tokyo, Japan, 25 March 2012 - 30 March 2012, 145-148.

Abstract

Parsing complex acoustic scenes involves an intricate interplay between bottom-up, stimulus-driven salient elements in the scene with top-down, goal-directed, mechanisms that shift our attention to particular parts of the scene. Here, we present a framework for exploring the interaction between these two processes in a simulated cocktail party setting. The model shows improved digit recognition in a multi-talker environment with a goal of tracking the source uttering the highest value. This work highlights the relevance of both data-driven and goal-driven processes in tackling real multi-talker, multi-source sound analysis.

Abstract

Parsing complex acoustic scenes involves an intricate interplay between bottom-up, stimulus-driven salient elements in the scene with top-down, goal-directed, mechanisms that shift our attention to particular parts of the scene. Here, we present a framework for exploring the interaction between these two processes in a simulated cocktail party setting. The model shows improved digit recognition in a multi-talker environment with a goal of tracking the source uttering the highest value. This work highlights the relevance of both data-driven and goal-driven processes in tackling real multi-talker, multi-source sound analysis.

Statistics

Citations

3 citations in Web of Science®
4 citations in Scopus®
Google Scholar™

Altmetrics

Additional indexing

Item Type:Conference or Workshop Item (Speech), refereed, original work
Communities & Collections:07 Faculty of Science > Institute of Neuroinformatics
Dewey Decimal Classification:570 Life sciences; biology
Language:English
Event End Date:30 March 2012
Deposited On:28 Feb 2013 09:37
Last Modified:14 Aug 2017 10:31
Publisher:Institute of Electrical and Electronics Engineers
Series Name:IEEE International Conference on Acoustics, Speech and Signal Processing. Proceedings
Number of Pages:4
ISSN:1520-6149
ISBN:978-1-4673-0044-5
Publisher DOI:https://doi.org/10.1109/ICASSP.2012.6287838

Download

Full text not available from this repository.
View at publisher

TrendTerms

TrendTerms displays relevant terms of the abstract of this publication and related documents on a map. The terms and their relations were extracted from ZORA using word statistics. Their timelines are taken from ZORA as well. The bubble size of a term is proportional to the number of documents where the term occurs. Red, orange, yellow and green colors are used for terms that occur in the current document; red indicates high interlinkedness of a term with other terms, orange, yellow and green decreasing interlinkedness. Blue is used for terms that have a relation with the terms in this document, but occur in other documents.
You can navigate and zoom the map. Mouse-hovering a term displays its timeline, clicking it yields the associated documents.

Author Collaborations