论文标题

阿拉伯文本到语音(TTS)数据准备

Arabic Text-To-Speech (TTS) Data Preparation

论文作者

Masri, Hala Al, Za'ter, Muhy Eddin

论文摘要

除文本到语音(TTS),合成系统的进步之外,还存在录音数据集的声音数据集可能会令人们感到困惑,尽管事实并非如此。这项研究的目的是解释TTS以及数据准备程序的相关性。 TTS严重依赖于记录的数据,因为它可能会对TTS模块的结果产生重大影响。此外,无论域是专业的还是一般的,都应制定适当的数据来解决所有预测的语言变体和域。考虑到质量和行为,不同的记录方法在模块的开发中也可能是有利的。鉴于目前的合成系统中缺乏阿拉伯语,正在考虑影响记录发音流动的许多变量,以操纵阿拉伯语TTS模块。在这项研究中,将讨论两个观点:语言学和TTS高质量录音的创建。这项工作的目的是阐明基于自然,清晰度和理解的基础真相话语如何影响语音系统的演变。很好地提供配音演员的规格以及数据规格,这些规格将帮助录音室中的配音演员和语音教练以及将评估音频的注释者。

People may be puzzled by the fact that voice over recordings data sets exist in addition to Text-to-Speech (TTS), Synthesis system advancements, albeit this is not the case. The goal of this study is to explain the relevance of TTS as well as the data preparation procedures. TTS relies heavily on recorded data since it can have a substantial influence on the outcomes of TTS modules. Furthermore, whether the domain is specialized or general, appropriate data should be developed to address all predicted language variants and domains. Different recording methodologies, taking into account quality and behavior, may also be advantageous in the development of the module. In light of the lack of Arabic language in present synthesizing systems, numerous variables that impact the flow of recorded utterances are being considered in order to manipulate an Arabic TTS module. In this study, two viewpoints will be discussed: linguistics and the creation of high-quality recordings for TTS. The purpose of this work is to offer light on how ground-truth utterances may influence the evolution of speech systems in terms of naturalness, intelligibility, and understanding. Well provide voice actor specs as well as data specs that will assist both voice actors and voice coaches in the studio as well as the annotators who will be evaluating the audios.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源