Computer Assisted Segmentation of Tongue Ultrasound and Lip Videos

Pertti Palo

Computer Assisted Segmentation of Tongue Ultrasound and Lip Videos

Authors

Pertti Palo Indiana University, Department of Speech, Language and Hearing Sciences

Abstract

Compared to segmenting acoustic speech data, time domain analysis of tongue ultrasound videos and lip videos is challenging and lacks widely accepted tools. The most widely used method is to select time points for articulatory analysis on the basis of acoustic segmentation. In acoustic analysis the spectrogram provides an easy way of analysing time and frequency domain characteristics of the speech signal in one glance.

In an effort to address this discrepancy, Author (2019, 2020) have provided an analysis tool, which can be used for direct phonetic analysis of tongue ultrasound data. The tool is an application of the Euclidean distance metric to the whole ultrasound image. It can be used to easily visualise general change in the data (see Figure) and provides a good basis for segmentation.

This study extends the tool for simultaneous analysis of synchronised tongue ultrasound and lip videos. A data set from a single speaker is analysed with the new method to provide a proof-of-concept.

Author Biography

Pertti Palo, Indiana University, Department of Speech, Language and Hearing Sciences

Post-doc at Indiana University, Department of Speech, Language and Hearing SciencesPhD in phonetics from Queen Margaret University, Edinburgh, Scotland

Additional Files

Published

2021-08-30

How to Cite

Palo P. Computer Assisted Segmentation of Tongue Ultrasound and Lip Videos. Canadian Acoustics [Internet]. 2021 Aug. 30 [cited 2026 Jul. 16];49(3):44-5. Available from: https://jcaa.caa-aca.ca/index.php/jcaa/article/view/3912

Download Citation

Issue

Vol. 49 No. 3 (2021)

Section

Proceedings of the Acoustics Week in Canada

License

Author Licensing Addendum

This Licensing Addendum ("Addendum") is entered into between the undersigned Author(s) and Canadian Acoustics journal published by the Canadian Acoustical Association (hereinafter referred to as the "Publisher"). The Author(s) and the Publisher agree as follows:

Retained Rights: The Author(s) retain(s) the following rights:
- The right to reproduce, distribute, and publicly display the Work on the Author's personal website or the website of the Author's institution.
- The right to use the Work in the Author's teaching activities and presentations.
- The right to include the Work in a compilation for the Author's personal use, not for sale.
Grant of License: The Author(s) grant(s) to the Publisher a worldwide exclusive license to publish, reproduce, distribute, and display the Work in Canadian Acoustics and any other formats and media deemed appropriate by the Publisher.
Attribution: The Publisher agrees to include proper attribution to the Author(s) in all publications and reproductions of the Work.
No Conflict: This Addendum is intended to be in harmony with, and not in conflict with, the terms and conditions of the original agreement entered into between the Author(s) and the Publisher.
Copyright Clause: Copyright on articles is held by the Author(s). The corresponding Author has the right to grant on behalf of all Authors and does grant on behalf of all Authors, a worldwide exclusive license to the Publisher and its licensees in perpetuity, in all forms, formats, and media (whether known now or created in the future), including but not limited to the rights to publish, reproduce, distribute, display, store, translate, create adaptations, reprints, include within collections, and create summaries, extracts, and/or abstracts of the Contribution.

Computer Assisted Segmentation of Tongue Ultrasound and Lip Videos

Authors

Abstract

Author Biography

Pertti Palo, Indiana University, Department of Speech, Language and Hearing Sciences

Additional Files

Published

How to Cite

Issue

Section

License

Language

Subscription

Make a Submission

Information