Optimization of Gabor Features for Text-Independent Speaker Identification

Autoren: V. Mildner, S. Goetze, K.-D. Kammeyer, A. Mertins

For text-independent speaker identification a prominent combination is to use Gaussian Mixture Models (GMM) for classification while relying on Mel-Frequency Cepstral Coefficients (MFCC) as features. To take temporal information into account the time difference of features of adjacent speech frames are appended to the initial features. In this paper we investigate the applicability of spectro-temporal features obtained from Gabor-Filters and present an algorithm for optimizing the possible parameters. Simulation results on a database show that spectro-temporal features achieve higher recognition rates than purely temporal features for clean speech as well as for disturbed speech.

Dokumenttyp: Konferenzbeitrag
Veröffentlichung: New Orleans, USA, 27. - 30. Mai 2007
Konferenz: IEEE International Symposium on Circuits and Systems (ISCAS 07)
Seiten: 3932-3935
Index: 296
ISCAS_2007_mildner.pdf174 KB
Zuletzt aktualisiert am 30.04.2008 von S. Goetze
AIT ieee GOC tzi ith Fachbereich 1
© Arbeitsbereich Nachrichtentechnik - Universität BremenImpressum / Kontakt