Continued development of an IMELDA based voice recognition system for persons with severe disabilities

Auteurs-es

  • G.E. Birch Neil Squire Found., North Vancouver, BC, Canada
  • D.A. Zwierzynski
  • C. Lefebvre
  • D. Starks

Mots-clés :

handicapped aids, speech recognition equipment, speech recognition system, front-end component, back-end component, dynamic programming, mel-scale fast-fourier-transform based spectral filter-bank analysis, linear transformation, IMELDA, linear discriminant analysis

Résumé

The general design of the authors' speech recognition system is such that the acoustic features are first extracted from the speech signal in the front-end component. Subsequently, these features are passed on to the back-end component where they are compared with stored templates through the technique of dynamic programming. The front-end processing in their speech recognition system is a mel-scale fast-fourier-transform based spectral filter-bank analysis followed by a linear transformation. This linear transformation, which is called IMELDA, was developed and tested and is based on linear discriminant analysis. Comparisons with other systems indicate that it is the `state of the art' for front-end processing in a robust speech recognition system, outperforming other transforms and recognition systems, particularly in degraded speech

Fichiers supplémentaires

Publié-e

1992-09-01

Comment citer

1.
Birch G, Zwierzynski D, Lefebvre C, Starks D. Continued development of an IMELDA based voice recognition system for persons with severe disabilities. Canadian Acoustics [Internet]. 1 sept. 1992 [cité 19 févr. 2025];20(3):45-6. Disponible à: https://jcaa.caa-aca.ca/index.php/jcaa/article/view/728

Numéro

Rubrique

Actes du congrès de la Semaine canadienne d'acoustique