Prediction and Enhancement of Speech Intelligibility in Challenging Acoustic Environments

Loading...
Thumbnail Image

Authors

Edraki, Amin

Date

2025-01-07

Type

thesis

Language

eng

Keyword

Speech Signal Processing , Speech Intelligibility , Speech Enhancement , Bone-Conducted Speech

Research Projects

Organizational Units

Journal Issue

Alternative Title

Abstract

Speech intelligibility refers to the portion of a spoken message correctly identified by a listener, making it a critical perceptual attribute in challenging acoustic environments, such as low signal-to-noise ratios. This work addresses two key issues in speech intelligibility: prediction and enhancement. In the Speech Intelligibility Prediction (SIP) segment, we introduce two state-of-the-art, reference-based SIP algorithms that utilize Spectro-Temporal Modulation (STM) analysis of input speech. We present a data-driven, interpretable STM weighting function that assigns varying importance to different STM frequencies, with findings contextualized through psychoacoustic modulation transfer functions. Our algorithms achieved state-of-the-art performance across multiple unseen test datasets and under diverse distortion and processing conditions. Additionally, we propose a probabilistic linguistic augmentation method to modify existing SIP algorithms, incorporating the linguistic predictability of sentences. Using next-word probabilities from a pre-trained language model, we estimate contextual predictability for SIP, yielding improved intelligibility across datasets with varying levels of linguistic predictability. For speech enhancement under challenging acoustic conditions, we investigate bone-conducted speech. Bone-conducted speech captures speech signals transmitted through bone vibration rather than through the air, offering a resilient alternative in noisy environments. This thesis explores personalized enhancement approaches using deep neural networks to improve the intelligibility and quality of bone-conducted speech, with a focus on speaker adaptation.

Description

Citation

Publisher

License

Queen's University's Thesis/Dissertation Non-Exclusive License for Deposit to QSpace and Library and Archives Canada
ProQuest PhD and Master's Theses International Dissemination Agreement
Intellectual Property Guidelines at Queen's University
Copying and Preserving Your Thesis
This publication is made available by the authority of the copyright owner solely for the purpose of private study and research and may not be copied or reproduced except as permitted by the copyright laws without written authority from the copyright owner.
Attribution 4.0 International

Journal

Volume

Issue

PubMed ID

External DOI

ISSN

EISSN