A new relativistic vision in speaker discrimination

S. Ouamour; M. Guerti; H. Sayoud

A new relativistic vision in speaker discrimination

Authors

S. Ouamour USTHB University, USTHB, Institut d'Electronique, BP 32, Bab-Ezzouar, Alger, Algeria
M. Guerti Ecole Nationale Polytechnique, USTHB, Institut d'Electronique, BP 32, Bab-Ezzouar, Alger, Algeria
H. Sayoud USTHB University, USTHB, Institut d'Electronique, BP 32, Bab-Ezzouar, Alger, Algeria

Keywords:

Classifiers, Learning systems, Neural networks, Speech recognition, Discrimination accuracies, Document indexing, Learning time, Multi-Layer Perceptron, Neural network classifiers, New models, Speaker models, Speaker verifications, Speech database, Speech signals

Abstract

The present paper deals with the task of speaker discrimination using a new relativistic approach. Speaker discrimination has two practical applications: speaker verification and audio document indexing. In such applications, the speaker model is extracted directly from speaker's own speech signal as well as using speaker's own features. However, such a model can be rigid, inaccurate and not appropriate in fluctuating environments where a change in the recording conditions may occur. For instance, during telephone talks, the vocal features for the same speaker may change considerably. And hence, a new relative speaker model is introduced. The new model is based on a relative characterization of the speaker, called Relative Speaker Characteristic (RSC). RSC consists in modeling one speaker relative to another, meaning that each speaker model needs both its speech signal and its competing speech (speech of the speaker to be compared with). This investigation shows that the relative model, used as input at a neural network classifier, optimizes the training of the classifier, speeds up its learning time and also enhances the discrimination accuracy. The experiments of speaker discrimination are done on two different databases: Hub4 Broadcast-News database and a telephonic speech database by using a Multi-Layer Perceptron (MLP) with several input characteristics. Results indicate that the best characteristic is the RSC, when compared to other reduced features evaluated in the same manner.

Additional Files

Published

2008-12-01

How to Cite

Ouamour S, Guerti M, Sayoud H. A new relativistic vision in speaker discrimination. Canadian Acoustics [Internet]. 2008 Dec. 1 [cited 2026 May 11];36(4):24-35. Available from: https://jcaa.caa-aca.ca/index.php/jcaa/article/view/2101

Download Citation

Issue

Vol. 36 No. 4 (2008)

Section

Technical Articles

License

Author Licensing Addendum

This Licensing Addendum ("Addendum") is entered into between the undersigned Author(s) and Canadian Acoustics journal published by the Canadian Acoustical Association (hereinafter referred to as the "Publisher"). The Author(s) and the Publisher agree as follows:

Retained Rights: The Author(s) retain(s) the following rights:
- The right to reproduce, distribute, and publicly display the Work on the Author's personal website or the website of the Author's institution.
- The right to use the Work in the Author's teaching activities and presentations.
- The right to include the Work in a compilation for the Author's personal use, not for sale.
Grant of License: The Author(s) grant(s) to the Publisher a worldwide exclusive license to publish, reproduce, distribute, and display the Work in Canadian Acoustics and any other formats and media deemed appropriate by the Publisher.
Attribution: The Publisher agrees to include proper attribution to the Author(s) in all publications and reproductions of the Work.
No Conflict: This Addendum is intended to be in harmony with, and not in conflict with, the terms and conditions of the original agreement entered into between the Author(s) and the Publisher.
Copyright Clause: Copyright on articles is held by the Author(s). The corresponding Author has the right to grant on behalf of all Authors and does grant on behalf of all Authors, a worldwide exclusive license to the Publisher and its licensees in perpetuity, in all forms, formats, and media (whether known now or created in the future), including but not limited to the rights to publish, reproduce, distribute, display, store, translate, create adaptations, reprints, include within collections, and create summaries, extracts, and/or abstracts of the Contribution.

A new relativistic vision in speaker discrimination

Authors

Keywords:

Abstract

Additional Files

Published

How to Cite

Issue

Section

License

Language

Subscription

Make a Submission

Information