Representation of The First Formant In Speech Recognition And In Models of The Auditory Periphery

Dennis H. Klatt

Representation of The First Formant In Speech Recognition And In Models of The Auditory Periphery

Authors

Dennis H. Klatt

Abstract

The frequency and amplitude of' the first formant are not easy to measure as fundamental frequency (f0) varies in speech. Perceptual data indicate that the auditory system is not bothered by changes to f0, but processing strategies used in speech recognition, such as linear prediction, filterbank analysis, and the synchrony spectrum are seriously perturbed as f0 varies. The irrelevant variation makes it difficult/unreliable to perform phonetic comparisons between similar vowels based on simple ideas of pattern similarity. Of the possible solutions to this problem considered here, the one of greatest practical attraction is to implement a synchrony spectrum representation of vowel-like speech sounds, and a "learned pattern equivalence" approach to vowel phonetic-quality equivalence across different fundamental frequencies.

Additional Files

Published

1986-07-21

How to Cite

Klatt DH. Representation of The First Formant In Speech Recognition And In Models of The Auditory Periphery. Canadian Acoustics [Internet]. 1986 Jul. 21 [cited 2024 Nov. 21];14(3 bis):5-7. Available from: https://jcaa.caa-aca.ca/index.php/jcaa/article/view/3495

Download Citation

Issue

Vol. 14 No. 3 bis (1986): Montreal Symposium On Speech Recognition

Section

Proceedings of the Acoustics Week in Canada

License

Author Licensing Addendum

This Licensing Addendum ("Addendum") is entered into between the undersigned Author(s) and Canadian Acoustics journal published by the Canadian Acoustical Association (hereinafter referred to as the "Publisher"). The Author(s) and the Publisher agree as follows:

Retained Rights: The Author(s) retain(s) the following rights:
- The right to reproduce, distribute, and publicly display the Work on the Author's personal website or the website of the Author's institution.
- The right to use the Work in the Author's teaching activities and presentations.
- The right to include the Work in a compilation for the Author's personal use, not for sale.
Grant of License: The Author(s) grant(s) to the Publisher a worldwide exclusive license to publish, reproduce, distribute, and display the Work in Canadian Acoustics and any other formats and media deemed appropriate by the Publisher.
Attribution: The Publisher agrees to include proper attribution to the Author(s) in all publications and reproductions of the Work.
No Conflict: This Addendum is intended to be in harmony with, and not in conflict with, the terms and conditions of the original agreement entered into between the Author(s) and the Publisher.
Copyright Clause: Copyright on articles is held by the Author(s). The corresponding Author has the right to grant on behalf of all Authors and does grant on behalf of all Authors, a worldwide exclusive license to the Publisher and its licensees in perpetuity, in all forms, formats, and media (whether known now or created in the future), including but not limited to the rights to publish, reproduce, distribute, display, store, translate, create adaptations, reprints, include within collections, and create summaries, extracts, and/or abstracts of the Contribution.

Representation of The First Formant In Speech Recognition And In Models of The Auditory Periphery

Authors

Abstract

Additional Files

Published

How to Cite

Issue

Section

License

Most read articles by the same author(s)

Language

Subscription

Make a Submission

Information