A Comparison between Mel-Scale Cepstrum and Auditory Model Representation for Noisy Speech Recognition | Istituto di Scienze e Tecnologie della Cognizione

A joint synchrony/mean-rate auditory model, recently proposed by Seneff[6], is embedded into a classical DTW-based system for the recognition of Italian digits. Its performances are evalu¬ated in both clean and noisy speech and compared with those of a system based on the Mel¬cepstrum representation. Experimental results show that the Mel representation outperforms the auditory model. Problems encountered by the auditory model in noisy speech are outlined and suggestions for noise compensation techniques both inside and outside the model are given. Simple image processing techniques aiming to clean up the synchrony spectrogram in noisy speech are suggested and some promising preliminary results are presented.

Tipo Pubblicazione:

Contributo in atti di convegno

Author or Creator:

Cosi P.

Falavigna D.

Mian G.A.

Omologo M.

Publisher:

Elsevier, Amsterdam, NLD

Source:

Proceedings EUSIPCO-90, pp. 1199–1202, Barcellona, Spain, 18-21 September, 1990

Date:

1990

Resource Identifier:

http://www.cnr.it/prodotto/i/241887

urn:isbn:0444886362

Language:

Eng

ISTC Author:

Real name:

Piero Cosi