Regularized All-Pole Models for Speaker Verification Under Noisy Environments

Hanilci, Cemal; Kinnunen, Tomi; ERTAŞ, FİGEN; Saeidi, Rahim; Pohjalainen, Jouni; Alku, Paavo

doi:10.1109/lsp.2012.2184284

Regularized All-Pole Models for Speaker Verification Under Noisy Environments

Hanilci C., Kinnunen T., ERTAŞ F., Saeidi R., Pohjalainen J., Alku P.

IEEE SIGNAL PROCESSING LETTERS, cilt.19, sa.3, ss.163-166, 2012 (SCI-Expanded, Scopus)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 19 Sayı: 3
Basım Tarihi: 2012
Doi Numarası: 10.1109/lsp.2012.2184284
Dergi Adı: IEEE SIGNAL PROCESSING LETTERS
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus
Sayfa Sayıları: ss.163-166
Bursa Uludağ Üniversitesi Adresli: Evet

Özet

Regularization of linear prediction based mel-frequency cepstral coefficient (MFCC) extraction in speaker verification is considered. Commonly, MFCCs are extracted from the discrete Fourier transform (DFT) spectrum of speech frames. In this paper, DFT spectrum estimate is replaced with the recently proposed regularized linear prediction (RLP) method. Regularization of temporally weighted variants, weighted LP (WLP) and stabilized WLP (SWLP) which have earlier shown success in speech and speaker recognition, is also introduced. A novel type of double autocorrelation (DAC) lag windowing is also proposed to enhance robustness. Experiments on the NIST 2002 corpus indicate that regularized all-pole methods (RLP, RWLP and RSWLP) yield large improvement on recognition accuracy under additive factory and babble noise conditions in terms of both equal error rate (EER) and minimum detection cost function (MinDCF).