The RACAD speech corpus of New Brunswick Acadian French: Design and applications
Keywords:
Applications, Audio systems, Linguistics, Automatic speech recognition, Geographical locations, High-quality audio, Linguistic analysis, Recognition models, Recognition systems, Speech corpora, Word recognitionAbstract
The RACAD (Reconnaissance automatique de l'acadien) speech corpus contains high quality audio recordings that can be used to develop recognition systems for the regional varieties of French spoken in the province of New Brunswick, Canada. Its design is informed by linguistic analyses of Acadian French. The corpus contains sentences read by 140 speakers who were selected according to age, gender and geographical region. This paper presents a preliminary application of the corpus in automatic speech recognition research; it outlines an original global monophone recognition model that is designed to handle linguistic variability. Global phone and word recognition rates for this model are satisfactory (about 90%), but they vary considerably across geographical locations. Possible applications of the RACAD corpus in acoustic phonetic and socio-phonetic studies of dialect variation are also described in this paper.Downloads
Published
How to Cite
Issue
Section
License
Copyright on articles is held by the author(s). The corresponding author has the right to grant on behalf of all authors and does grant on behalf of all authors, a worldwide exclusive licence (or non-exclusive license for government employees) to the Publishers and its licensees in perpetuity, in all forms, formats and media (whether known now or created in the future)
i) to publish, reproduce, distribute, display and store the Contribution;
ii) to translate the Contribution into other languages, create adaptations, reprints, include within collections and create summaries, extracts and/or, abstracts of the Contribution;
iii) to exploit all subsidiary rights in the Contribution,
iv) to provide the inclusion of electronic links from the Contribution to third party material where-ever it may be located;
v) to licence any third party to do any or all of the above.