Speaker-Dependent Feature Generalizability for the Detection of Alcohol Intoxication

Authors

  • Arian Shamei UBC
  • Xinglei Liu
  • Rima Seiilova

Keywords:

Acoustic automated detection, acoustic analysis, Artificial intelligence, Acoustic speech measures

Abstract

Impairments to speech motor control from alcohol intoxication are variable across individuals, making speaker-dependent approaches ideal for speech-based intoxication detection. Here we evaluated whether individual acoustic features have high generalizability across speaker-dependent models. We selected 97 speakers (54 male, 43 female) from the Alcohol Language Corpus  who had sufficient sober and intoxicated (>0.8% blood-alcohol concentration) recordings for speaker-dependent modeling. We extracted 17 features from each audio file, then fitted these features to speaker-dependent random forest models and evaluated feature importance using mean decrease in Gini impurity (GI). Results show that across all speakers, consonant-based features tend to have stronger generalizability than vowel-based features, with spectral skewness and kurtosis being the most generalizable (GI: .11 and .09), and vowel duration and F2 being the least generalizable (GI: .04 and .03).

Additional Files

Published

2024-05-12

How to Cite

1.
Shamei A, Liu X, Seiilova R. Speaker-Dependent Feature Generalizability for the Detection of Alcohol Intoxication. Canadian Acoustics [Internet]. 2024 May 12 [cited 2024 Oct. 7];52(1). Available from: https://jcaa.caa-aca.ca/index.php/jcaa/article/view/4197

Most read articles by the same author(s)