A formant frequency estimator for noisy speech based on correlation and cepstrum
Keywords:Correlation methods, Curve fitting, Natural frequencies, Speech, Speech analysis, Transfer functions, Accurate, Adverse effects, AR systems, Cepstral domains, Cepstrum, Correlation domains, Formant estimations, Formant frequencies, Least squares optimization techniques, Natural speeches, Noisy conditions, Noisy environments, Noisy observations, Noisy speeches, Observation noises
AbstractA formant estimation scheme combining the advantageous features of correlation and cepstral domains, which is capable of handling the adverse effect of observation noise was investigated. A residue-based least squares optimization technique based on a model-fitting approach was introduced in order to obtain formant frequencies from noisy observations. Simulations were carried out to estimate formant frequencies from synthetic and natural speech signals under noisy conditions. The human vocal-tract system was represented by a P-th order AR system with a transfer function. A formant frequency estimation scheme based on a new ramp cepstrum model was developed which is capable of efficiently handling the noisy environment. The once-repeated ACF was employed which can significantly reduce the effect of noise in the correlation domain. It was observed that the proposed method provides an accurate formant frequency estimate even at a low level of SNR.
How to Cite
Copyright on articles is held by the author(s). The corresponding author has the right to grant on behalf of all authors and does grant on behalf of all authors, a worldwide exclusive licence (or non-exclusive license for government employees) to the Publishers and its licensees in perpetuity, in all forms, formats and media (whether known now or created in the future)
i) to publish, reproduce, distribute, display and store the Contribution;
ii) to translate the Contribution into other languages, create adaptations, reprints, include within collections and create summaries, extracts and/or, abstracts of the Contribution;
iii) to exploit all subsidiary rights in the Contribution,
iv) to provide the inclusion of electronic links from the Contribution to third party material where-ever it may be located;
v) to licence any third party to do any or all of the above.