Hostname: page-component-8448b6f56d-tj2md Total loading time: 0 Render date: 2024-04-19T06:43:48.984Z Has data issue: false hasContentIssue false

Off-line classification of Polish vowel spectra using artificial neural networks

Published online by Cambridge University Press:  07 June 2004

Wiktor Jassem
Affiliation:
Institute of Fundamental Technological Research, Poznań, wjassem@amu.edu.pl
Waldemar Grygiel
Affiliation:
Kulczyk Tradex, Poznań, grygielw@kulczyktradex.com.pl

Abstract

The mid-frequencies and bandwidths of formants 1–5 were measured at targets, at plus 0.01 s and at minus 0.01 s off the targets of vowels in a 100-word list read by five male and five female speakers, for a total of 3390 10-variable spectrum specifications. Each of the six Polish vowel phonemes was represented approximately the same number of times. The 3390* 10 original-data matrix was processed by probabilistic neural networks to produce a classification of the spectra with respect to (a) vowel phoneme, (b) identity of the speaker, and (c) speaker gender. For (a) and (b), networks with added input information from another independent variable were also used, as well as matrices of the numerical data appropriately normalized. Mean scores for classification with respect to phonemes in a multi-speaker design in the testing sets were around 95%, and mean speaker-dependent scores for the phonemes varied between 86% and 100%, with two speakers scoring 100% correct. The individual voices were identified between 95% and 96% of the time, and classifications of the spectra for speaker gender were practically 100% correct.

Type
Research Article
Copyright
© Journal of the International Phonetic Association 2004

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)