Experiments on The Use of Demisyllables For Automatic Speech Recognition
Abstract
The paper describes methods for an explicit segmentation of the speech signal into demisyllable segments by evaluating the output of a loudness model. Syllable nuclei are indicated by maxima of a smoothed loudness function. Consonant clusters and vowels are introduced as decision units in order to reduce the inventory of classes. Two methods for classification of consonant clusters are compared: template matching and a feature extraction approach based on acoustic cues. Sentence recognition operates on phonetic word models adapted to the demisyllable structure.
Downloads
Published
How to Cite
Issue
Section
License
Copyright on articles is held by the author(s). The corresponding author has the right to grant on behalf of all authors and does grant on behalf of all authors, a worldwide exclusive licence (or non-exclusive license for government employees) to the Publishers and its licensees in perpetuity, in all forms, formats and media (whether known now or created in the future)
i) to publish, reproduce, distribute, display and store the Contribution;
ii) to translate the Contribution into other languages, create adaptations, reprints, include within collections and create summaries, extracts and/or, abstracts of the Contribution;
iii) to exploit all subsidiary rights in the Contribution,
iv) to provide the inclusion of electronic links from the Contribution to third party material where-ever it may be located;
v) to licence any third party to do any or all of the above.