A new relativistic vision in speaker discrimination

Authors

  • S. Ouamour USTHB University, USTHB, Institut d'Electronique, BP 32, Bab-Ezzouar, Alger, Algeria
  • M. Guerti Ecole Nationale Polytechnique, USTHB, Institut d'Electronique, BP 32, Bab-Ezzouar, Alger, Algeria
  • H. Sayoud USTHB University, USTHB, Institut d'Electronique, BP 32, Bab-Ezzouar, Alger, Algeria

Keywords:

Classifiers, Learning systems, Neural networks, Speech recognition, Discrimination accuracies, Document indexing, Learning time, Multi-Layer Perceptron, Neural network classifiers, New models, Speaker models, Speaker verifications, Speech database, Speech signals

Abstract

The present paper deals with the task of speaker discrimination using a new relativistic approach. Speaker discrimination has two practical applications: speaker verification and audio document indexing. In such applications, the speaker model is extracted directly from speaker's own speech signal as well as using speaker's own features. However, such a model can be rigid, inaccurate and not appropriate in fluctuating environments where a change in the recording conditions may occur. For instance, during telephone talks, the vocal features for the same speaker may change considerably. And hence, a new relative speaker model is introduced. The new model is based on a relative characterization of the speaker, called Relative Speaker Characteristic (RSC). RSC consists in modeling one speaker relative to another, meaning that each speaker model needs both its speech signal and its competing speech (speech of the speaker to be compared with). This investigation shows that the relative model, used as input at a neural network classifier, optimizes the training of the classifier, speeds up its learning time and also enhances the discrimination accuracy. The experiments of speaker discrimination are done on two different databases: Hub4 Broadcast-News database and a telephonic speech database by using a Multi-Layer Perceptron (MLP) with several input characteristics. Results indicate that the best characteristic is the RSC, when compared to other reduced features evaluated in the same manner.

Published

2008-12-01

How to Cite

1.
Ouamour S, Guerti M, Sayoud H. A new relativistic vision in speaker discrimination. Canadian Acoustics [Internet]. 2008Dec.1 [cited 2021Jun.20];36(4):24-35. Available from: https://jcaa.caa-aca.ca/index.php/jcaa/article/view/2101

Issue

Section

Technical Articles

Most read articles by the same author(s)