LABORATORIO DE COMUNICACIÓN ORAL
ROBERT WAYNE NEWCOMB
P. Gómez, V. Rodellar, A. Alvarez, N. Mayo, F. Rubio, V. Nieto and M. M. Pérez
- "Visual Representation of the Speech Trace using Neural Networks"
- International Conference on Circuits and Systems, ISCAS'96:
- Atlanta, USA, 12-15 May. 1996, pp. III-586-589.
- Through the present paper, a methodology to create Visual Representations of Speech for
Speech Perception Enhancement Applications, is presented, based on the use of Time-Delay Neural
Networks. The advantages of using Neural Networks for such purposes, come from a lower computational
cost, and from an easier DSP or VLSI implementation. On the other hand, the main inconvenient
found in using this technique, is the need for training to each specific speaker. This requirement
may be relaxed if proper normalization methods are used. The specific mathematical and computational
issues introduced for such treatment are given, and a specific case for Computed-Aided language
Learning oriented to the Phonetic Specificities of English for Spanish Speakers is also presented
and discussed. This specific technique may also be used in statistically normalizing Speech Data
for Speech Recognition Systems.
Pulse aquí para bajarse el artículo
Formato PostScript (580 Kb.)
Comprimido en zip (178 Kb.)
Madrid a 9 de junio de 2004