Characterizing Formants Through Straight-Line Approximations Without Explicit Formant Tracking

Auteurs-es

  • S Seneff

Résumé

A new method for representing the formants of sonorant speech sounds is described. The method collapses the two-stage process of (1) formant tracking and (2) abstraction of rates and directions of formant movements into a one-step process of directly assigning straight-line segments to the resonance contours in the frequency-time space. The method resembles techniques used in vision research [1], and is also motivated by observations of specialized frequency-modulation detectors in the central auditory system [4]. The computational procedures are straightforward, leading to a description of the formant information for a given vowel by a list of oriented straight-line segments. The line segments are not assigned to particular formants, such as F2 . Instead, the recognition process is hypothesis-driven. For each vowel or diphthong to be recognized, a short description or expected ranges of frequency and orientation in the time-frequency dimensions for the first two formants is given. Feasibility of the method is demonstrated by applying it to the specific task of recognizing the vowels and diphthongs of American English in restricted context, spoken by multiple speakers.

Fichiers supplémentaires

Publié-e

2022-12-03

Comment citer

1.
Seneff S. Characterizing Formants Through Straight-Line Approximations Without Explicit Formant Tracking. Canadian Acoustics [Internet]. 3 déc. 2022 [cité 7 août 2024];14(3 bis):75-6. Disponible à: https://jcaa.caa-aca.ca/index.php/jcaa/article/view/3549

Numéro

Rubrique

Actes du congrès de la Semaine canadienne d'acoustique