Timbre Classification by NN and Auditory Modeling | Istituto di Scienze e Tecnologie della Cognizione

Short time Fourier analysis in combination with filter-bank techniques or cep-strum analysis have been used for many years in order to reduce timbre repre¬sentation complexity. Recently, in speech analysis and recognition, the intro¬duction of auditory models (Cooke et al. 1993) which explicitly consider non¬linear phenomena occurring in the perception mechanism, has given promising results especially when speech is highly degraded by noise (Cosi 1993). On the other hand, Neural Networks (NN) have already proved their classification capability in various pattern recognition tasks. For these reasons, a timbre clas¬sification system, directly starting from sound signals, was conceived in which auditory modeling and neural network techniques were combined together in order to reduce timbre multidimensionality. In particular S. Seneff's auditory modeling (Seneff, 1988) was used in the analysis stage, while a bidimensional Kohonen Self Organizing Map (SOM) was used in the classification stage.

Publication type:

Contributo in atti di convegno

Author or Creator:

Cosi P.

De Poli G.

Lauzzana G.

Publisher:

Springer-Verlag, Berlin/Heidelberg, DEU

Source:

ICANN-94, International Conference on Artificial Neural Networks, pp. 933–936, Sorrento, Italy, 26-29 May, 1994

Date:

1994

Resource Identifier:

http://www.cnr.it/prodotto/i/241585

https://dx.doi.org/10.1007/978-1-4471-2097-1

info:doi:10.1007/978-1-4471-2097-1

urn:isbn:978-3-540-19887-1

Language:

Eng

ISTC Author:

Real name:

Piero Cosi