On The Robustness of Phonetic Information In Short-Time Speech Spectra

Meg Withgott; Marcia A. Bush

On The Robustness of Phonetic Information In Short-Time Speech Spectra

Auteurs-es

Meg Withgott
Marcia A. Bush

Résumé

Speech recognition techniques which take fixed-time slices as input to a matcher face the task of mapping from arbitrary pieces of the physical signal to abstract linguistic units. This paper examines the reliability with which individual vector-quantized LPC spectra can be mapped to various sets of acoustic-phonetic classes. The database for the experiments consisted of approximately 130,000 spectra from a pre-labeled corpus of 616 5-digit strings, and classification was performed on the basis of a maximum likehood decision rule. Classification accuracy, when the same database was used for training and testing, ranged from 94.0% for a simple voiced-voiceless distinction to 42 .7% for a set of 45 acoustic-phonetic classes used in earlier connected digit recognition experiments

Fichiers supplémentaires

pdf (English)

Publié-e

2022-12-03

Comment citer

Withgott M, Bush MA. On The Robustness of Phonetic Information In Short-Time Speech Spectra. Canadian Acoustics [Internet]. 3 déc. 2022 [cité 22 févr. 2025];14(3 bis):101-2. Disponible à: https://jcaa.caa-aca.ca/index.php/jcaa/article/view/3540

Télécharger la référence

Numéro

Vol. 14 No. 3 bis (1986): Symposium sur la reconnaissance de la parole

Rubrique

Actes du congrès de la Semaine canadienne d'acoustique

Licence

Author Licensing Addendum

This Licensing Addendum ("Addendum") is entered into between the undersigned Author(s) and Canadian Acoustics journal published by the Canadian Acoustical Association (hereinafter referred to as the "Publisher"). The Author(s) and the Publisher agree as follows:

Retained Rights: The Author(s) retain(s) the following rights:
- The right to reproduce, distribute, and publicly display the Work on the Author's personal website or the website of the Author's institution.
- The right to use the Work in the Author's teaching activities and presentations.
- The right to include the Work in a compilation for the Author's personal use, not for sale.
Grant of License: The Author(s) grant(s) to the Publisher a worldwide exclusive license to publish, reproduce, distribute, and display the Work in Canadian Acoustics and any other formats and media deemed appropriate by the Publisher.
Attribution: The Publisher agrees to include proper attribution to the Author(s) in all publications and reproductions of the Work.
No Conflict: This Addendum is intended to be in harmony with, and not in conflict with, the terms and conditions of the original agreement entered into between the Author(s) and the Publisher.
Copyright Clause: Copyright on articles is held by the Author(s). The corresponding Author has the right to grant on behalf of all Authors and does grant on behalf of all Authors, a worldwide exclusive license to the Publisher and its licensees in perpetuity, in all forms, formats, and media (whether known now or created in the future), including but not limited to the rights to publish, reproduce, distribute, display, store, translate, create adaptations, reprints, include within collections, and create summaries, extracts, and/or abstracts of the Contribution.

On The Robustness of Phonetic Information In Short-Time Speech Spectra

Auteurs-es

Résumé

Fichiers supplémentaires

Publié-e

Comment citer

Numéro

Rubrique

Licence

Articles les plus lus du,de la,des même-s auteur-e-s

Langue

Abonnement

Faire une soumission

Renseignements