Prosodylab-aligner: A tool for forced alignment of laboratory speech

Kyle Gorman; Jonathan Howell; Michael Wagner

Prosodylab-aligner: A tool for forced alignment of laboratory speech

Authors

Kyle Gorman Department of Linguistics, University of Pennsylvania, 619 Williams Hall, 255 S. 36 th St., Philadelphia, PA 19104-6305, United States
Jonathan Howell Department of Linguistics, McGill University, 1085 Dr. Penfield, Montreal, QC H3A1A7, Canada
Michael Wagner Department of Linguistics, McGill University, 1085 Dr. Penfield, Montreal, QC H3A1A7, Canada

Keywords:

Computer operating systems, Acoustic model, Hidden Markov model toolkits, Mac OS X, Model estimation, Monophones, Open-source, Resampling, Television programming

Abstract

The Penn Forced Aligner automates the alignment process using the Hidden Markov Model Toolkit (HTK). The core of Prosodylab-Aligner is align. py, a script which performs acoustic model training and alignment. This script automates calls to HTK and SoX, an open-source command-line tool which is capable of resampling audio. The included README file provides instructions for installing HTK and SoX on Linux and Mac OS X, and can also be run on Windows. During training, the model is initialized with flat-start monophones, which are then submitted to a single round of model estimation. Then, a tied-state 'small pause' model is inserted and used in a second round of estimation. The data is then aligned once to choose the most likely pronunciation of all homonyms. Web audio is downloaded from Ramp, a company which indexes radio and television programming, including NBC, PBS, Fox and CBS Radio, and processed using standard UNIX tools.

Additional Files

Published

2011-09-01

How to Cite

Gorman K, Howell J, Wagner M. Prosodylab-aligner: A tool for forced alignment of laboratory speech. Canadian Acoustics [Internet]. 2011 Sep. 1 [cited 2026 Jul. 12];39(3):192-3. Available from: https://jcaa.caa-aca.ca/index.php/jcaa/article/view/2476

Download Citation

Issue

Vol. 39 No. 3 (2011)

Section

Proceedings of the Acoustics Week in Canada

License

Author Licensing Addendum

This Licensing Addendum ("Addendum") is entered into between the undersigned Author(s) and Canadian Acoustics journal published by the Canadian Acoustical Association (hereinafter referred to as the "Publisher"). The Author(s) and the Publisher agree as follows:

Retained Rights: The Author(s) retain(s) the following rights:
- The right to reproduce, distribute, and publicly display the Work on the Author's personal website or the website of the Author's institution.
- The right to use the Work in the Author's teaching activities and presentations.
- The right to include the Work in a compilation for the Author's personal use, not for sale.
Grant of License: The Author(s) grant(s) to the Publisher a worldwide exclusive license to publish, reproduce, distribute, and display the Work in Canadian Acoustics and any other formats and media deemed appropriate by the Publisher.
Attribution: The Publisher agrees to include proper attribution to the Author(s) in all publications and reproductions of the Work.
No Conflict: This Addendum is intended to be in harmony with, and not in conflict with, the terms and conditions of the original agreement entered into between the Author(s) and the Publisher.
Copyright Clause: Copyright on articles is held by the Author(s). The corresponding Author has the right to grant on behalf of all Authors and does grant on behalf of all Authors, a worldwide exclusive license to the Publisher and its licensees in perpetuity, in all forms, formats, and media (whether known now or created in the future), including but not limited to the rights to publish, reproduce, distribute, display, store, translate, create adaptations, reprints, include within collections, and create summaries, extracts, and/or abstracts of the Contribution.

Prosodylab-aligner: A tool for forced alignment of laboratory speech

Authors

Keywords:

Abstract

Additional Files

Published

How to Cite

Issue

Section

License

Most read articles by the same author(s)

Language

Subscription

Make a Submission

Information