Header

UZH-Logo

Maintenance Infos

A model of attention-driven scene analysis


Slaney, M; Agus, T; Liu, S C; Kaya, M; Elhilali, M (2012). A model of attention-driven scene analysis. In: 2012 IEEE International Conference on Acoustics, Speech, and Signal Processing, Tokyo, Japan, 25 March 2012 - 30 March 2012, 145-148.

Abstract

Parsing complex acoustic scenes involves an intricate interplay between bottom-up, stimulus-driven salient elements in the scene with top-down, goal-directed, mechanisms that shift our attention to particular parts of the scene. Here, we present a framework for exploring the interaction between these two processes in a simulated cocktail party setting. The model shows improved digit recognition in a multi-talker environment with a goal of tracking the source uttering the highest value. This work highlights the relevance of both data-driven and goal-driven processes in tackling real multi-talker, multi-source sound analysis.

Abstract

Parsing complex acoustic scenes involves an intricate interplay between bottom-up, stimulus-driven salient elements in the scene with top-down, goal-directed, mechanisms that shift our attention to particular parts of the scene. Here, we present a framework for exploring the interaction between these two processes in a simulated cocktail party setting. The model shows improved digit recognition in a multi-talker environment with a goal of tracking the source uttering the highest value. This work highlights the relevance of both data-driven and goal-driven processes in tackling real multi-talker, multi-source sound analysis.

Statistics

Citations

3 citations in Web of Science®
4 citations in Scopus®
Google Scholar™

Altmetrics

Additional indexing

Item Type:Conference or Workshop Item (Speech), refereed, original work
Communities & Collections:07 Faculty of Science > Institute of Neuroinformatics
Dewey Decimal Classification:570 Life sciences; biology
Language:English
Event End Date:30 March 2012
Deposited On:28 Feb 2013 09:37
Last Modified:07 Dec 2017 20:21
Publisher:Institute of Electrical and Electronics Engineers
Series Name:IEEE International Conference on Acoustics, Speech and Signal Processing. Proceedings
Number of Pages:4
ISSN:1520-6149
ISBN:978-1-4673-0044-5
Publisher DOI:https://doi.org/10.1109/ICASSP.2012.6287838

Download

Full text not available from this repository.
View at publisher