Fuzzy string kernel representations in speech processing

Robert Kirchner

Fuzzy string kernel representations in speech processing

Auteurs-es

Robert Kirchner Linguistics Dept., University of Alberta, Edmonton, Alta. T6G2E1, Canada

Mots-clés :

Fuzzy sets, Learning systems, Neural networks, Pattern recognition, Vector quantization, Speech signal, Support vector machines

Résumé

Many widely used approaches to pattern recognition/machine learning, including neural nets, k-nearest-neighbours classifiers, and support vector machines, have hitherto made little headway in speech processing, largely due to their inability to represent and compute over the variable-length sequential data of speech signals. A new technique, the string (subsequence) kernel, first applied in bioinformatics [1] and text classification [2], and extended to speech recognition by Goddard et al. [3], maps a variable-length input signal to a fixed-length feature array, by taking the inner product of n-gram subsequences. Similarity of signals can then be evaluated, by any of the above approaches, in the kernel space. In this presentation, two variations on Goddard's approach are considered and evaluated: a string kernel using fuzzy rather than absolute k-means clustering; and a kernel in which the feature counts are preserved as waveforms rather than scalars, to address the reverse mapping problem.

Fichiers supplémentaires

PDF (English)

Publié-e

2003-09-01

Comment citer

Kirchner R. Fuzzy string kernel representations in speech processing. Canadian Acoustics [Internet]. 1 sept. 2003 [cité 13 mai 2026];31(3):38-9. Disponible à: https://jcaa.caa-aca.ca/index.php/jcaa/article/view/1539

Télécharger la référence

Numéro

Vol. 31 No. 3 (2003)

Rubrique

Actes du congrès de la Semaine canadienne d'acoustique

Licence

Author Licensing Addendum

This Licensing Addendum ("Addendum") is entered into between the undersigned Author(s) and Canadian Acoustics journal published by the Canadian Acoustical Association (hereinafter referred to as the "Publisher"). The Author(s) and the Publisher agree as follows:

Retained Rights: The Author(s) retain(s) the following rights:
- The right to reproduce, distribute, and publicly display the Work on the Author's personal website or the website of the Author's institution.
- The right to use the Work in the Author's teaching activities and presentations.
- The right to include the Work in a compilation for the Author's personal use, not for sale.
Grant of License: The Author(s) grant(s) to the Publisher a worldwide exclusive license to publish, reproduce, distribute, and display the Work in Canadian Acoustics and any other formats and media deemed appropriate by the Publisher.
Attribution: The Publisher agrees to include proper attribution to the Author(s) in all publications and reproductions of the Work.
No Conflict: This Addendum is intended to be in harmony with, and not in conflict with, the terms and conditions of the original agreement entered into between the Author(s) and the Publisher.
Copyright Clause: Copyright on articles is held by the Author(s). The corresponding Author has the right to grant on behalf of all Authors and does grant on behalf of all Authors, a worldwide exclusive license to the Publisher and its licensees in perpetuity, in all forms, formats, and media (whether known now or created in the future), including but not limited to the rights to publish, reproduce, distribute, display, store, translate, create adaptations, reprints, include within collections, and create summaries, extracts, and/or abstracts of the Contribution.

Fuzzy string kernel representations in speech processing

Auteurs-es

Mots-clés :

Résumé

Fichiers supplémentaires

Publié-e

Comment citer

Numéro

Rubrique

Licence

Langue

Abonnement

Faire une soumission

Renseignements