论文标题

闭合闭合和从语音信号开始的即时检测

Glottal Closure and Opening Instant Detection from Speech Signals

论文作者

Drugman, Thomas, Dutoit, Thierry

论文摘要

本文提出了一种新的程序,以直接从语音波形中检测出闭合闭合和打开速度(GCIS和GOIS)。该过程分为两个连续的步骤。首先,计算一个基于平均的信号,并从中提取了预期语音事件的间隔。其次,在每个间隔下,语音事件的精确位置是通过在线性预测残差中找到不连续性来分配的。将提出的方法与CMU北极数据库上的DYPSA算法进行了比较。据报道,据报道有显着的改善和更好的噪声鲁棒性。此外,GOI识别精度的结果对于震源源表征有望。

This paper proposes a new procedure to detect Glottal Closure and Opening Instants (GCIs and GOIs) directly from speech waveforms. The procedure is divided into two successive steps. First a mean-based signal is computed, and intervals where speech events are expected to occur are extracted from it. Secondly, at each interval a precise position of the speech event is assigned by locating a discontinuity in the Linear Prediction residual. The proposed method is compared to the DYPSA algorithm on the CMU ARCTIC database. A significant improvement as well as a better noise robustness are reported. Besides, results of GOI identification accuracy are promising for the glottal source characterization.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源