Computer Assisted Segmentation of Tongue Ultrasound and Lip Videos

Pertti Palo

Computer Assisted Segmentation of Tongue Ultrasound and Lip Videos

Auteurs-es

Pertti Palo Indiana University, Department of Speech, Language and Hearing Sciences

Résumé

Compared to segmenting acoustic speech data, time domain analysis of tongue ultrasound videos and lip videos is challenging and lacks widely accepted tools. The most widely used method is to select time points for articulatory analysis on the basis of acoustic segmentation. In acoustic analysis the spectrogram provides an easy way of analysing time and frequency domain characteristics of the speech signal in one glance.

In an effort to address this discrepancy, Author (2019, 2020) have provided an analysis tool, which can be used for direct phonetic analysis of tongue ultrasound data. The tool is an application of the Euclidean distance metric to the whole ultrasound image. It can be used to easily visualise general change in the data (see Figure) and provides a good basis for segmentation.

This study extends the tool for simultaneous analysis of synchronised tongue ultrasound and lip videos. A data set from a single speaker is analysed with the new method to provide a proof-of-concept.

Biographie de l'auteur-e

Pertti Palo, Indiana University, Department of Speech, Language and Hearing Sciences

Post-doc at Indiana University, Department of Speech, Language and Hearing SciencesPhD in phonetics from Queen Margaret University, Edinburgh, Scotland

Fichiers supplémentaires

PDF (English)

Publié-e

2021-08-30

Comment citer

Palo P. Computer Assisted Segmentation of Tongue Ultrasound and Lip Videos. Canadian Acoustics [Internet]. 30 août 2021 [cité 24 août 2024];49(3):44-5. Disponible à: https://jcaa.caa-aca.ca/index.php/jcaa/article/view/3912

Télécharger la référence

Numéro

Vol. 49 No. 3 (2021)

Rubrique

Actes du congrès de la Semaine canadienne d'acoustique

Licence

Author Licensing Addendum

This Licensing Addendum ("Addendum") is entered into between the undersigned Author(s) and Canadian Acoustics journal published by the Canadian Acoustical Association (hereinafter referred to as the "Publisher"). The Author(s) and the Publisher agree as follows:

Retained Rights: The Author(s) retain(s) the following rights:
- The right to reproduce, distribute, and publicly display the Work on the Author's personal website or the website of the Author's institution.
- The right to use the Work in the Author's teaching activities and presentations.
- The right to include the Work in a compilation for the Author's personal use, not for sale.
Grant of License: The Author(s) grant(s) to the Publisher a worldwide exclusive license to publish, reproduce, distribute, and display the Work in Canadian Acoustics and any other formats and media deemed appropriate by the Publisher.
Attribution: The Publisher agrees to include proper attribution to the Author(s) in all publications and reproductions of the Work.
No Conflict: This Addendum is intended to be in harmony with, and not in conflict with, the terms and conditions of the original agreement entered into between the Author(s) and the Publisher.
Copyright Clause: Copyright on articles is held by the Author(s). The corresponding Author has the right to grant on behalf of all Authors and does grant on behalf of all Authors, a worldwide exclusive license to the Publisher and its licensees in perpetuity, in all forms, formats, and media (whether known now or created in the future), including but not limited to the rights to publish, reproduce, distribute, display, store, translate, create adaptations, reprints, include within collections, and create summaries, extracts, and/or abstracts of the Contribution.

Computer Assisted Segmentation of Tongue Ultrasound and Lip Videos

Auteurs-es

Résumé

Biographie de l'auteur-e

Pertti Palo, Indiana University, Department of Speech, Language and Hearing Sciences

Fichiers supplémentaires

Publié-e

Comment citer

Numéro

Rubrique

Licence

Langue

Abonnement

Faire une soumission

Renseignements