Header

UZH-Logo

Maintenance Infos

Highly spectrally undersampled vowels can be classified by machines without supervision


Kathiresan, Thayabaran; Maurer, Dieter; Dellwo, Volker (2019). Highly spectrally undersampled vowels can be classified by machines without supervision. Journal of the Acoustical Society of America, 146(1):EL1-EL7.

Abstract

An unsupervised automatic clustering algorithm (k-means) classified 1282 Mel frequency cepstral coefficient (MFCC) representations of isolated steady-state vowel utterances from eight standard German vowel categories with fo between 196 and 698 Hz. Experiment I obtained the number of MFCCs (1–20) in connection with the spectral bandwidth (2–20 kHz) at which performance peaked (five MFCCs at 4 kHz). In experiment II, classification performance with different ranges of fo revealed that ranges with fo > 500 Hz reduced classification performance but it remained well above chance. This shows that isolated steady state vowels with strongly undersampled spectra contain sufficient acoustic information to be classified automatically.

Abstract

An unsupervised automatic clustering algorithm (k-means) classified 1282 Mel frequency cepstral coefficient (MFCC) representations of isolated steady-state vowel utterances from eight standard German vowel categories with fo between 196 and 698 Hz. Experiment I obtained the number of MFCCs (1–20) in connection with the spectral bandwidth (2–20 kHz) at which performance peaked (five MFCCs at 4 kHz). In experiment II, classification performance with different ranges of fo revealed that ranges with fo > 500 Hz reduced classification performance but it remained well above chance. This shows that isolated steady state vowels with strongly undersampled spectra contain sufficient acoustic information to be classified automatically.

Statistics

Citations

Altmetrics

Downloads

1 download since deposited on 03 Jul 2019
1 download since 12 months
Detailed statistics

Additional indexing

Item Type:Journal Article, refereed, original work
Communities & Collections:06 Faculty of Arts > Institute of Computational Linguistics
06 Faculty of Arts > Center for Linguistics
08 Research Priority Programs > Digital Society Initiative
08 Research Priority Programs > Language and Space
Dewey Decimal Classification:400 Language
410 Linguistics
Uncontrolled Keywords:Acoustics and Ultrasonics, Arts and Humanities (miscellaneous)
Language:English
Date:1 July 2019
Deposited On:03 Jul 2019 07:19
Last Modified:29 Nov 2019 08:42
Publisher:Acoustical Society of America
ISSN:0001-4966
OA Status:Closed
Free access at:Publisher DOI. An embargo period may apply.
Publisher DOI:https://doi.org/10.1121/1.5111154
Official URL:https://asa.scitation.org/doi/10.1121/1.5111154

Download

Closed Access: Download allowed only for UZH members