论文标题
删除冗余的语义意识到文本传输的语音
Semantic-aware Speech to Text Transmission with Redundancy Removal
论文作者
论文摘要
近年来,已经探索了基于深度学习(DL)的语义交流方法,以有效地传输图像,文本和语音。与传统的无线通信方法相反,该方法着眼于抽象符号的传输,语义通信方法仅通过发送与源数据的语义相关信息来实现更好的传输效率。在本文中,我们考虑对文本传输的语义语音。我们提出了一种新型的基于DL的端到端收发器,其中包括一个基于注意力的软对准模块和一个冗余去除模块,以压缩传输数据。特别是,前者仅提取与文本相关的语义特征,后者进一步删除了语义上的冗余内容,与现有方法相比,语义冗余的量大大减少了。我们还提出了一个两阶段的训练计划,该方案加快了提议的DL模型的训练。模拟结果表明,我们所提出的方法在收到的文本和传输效率的准确性方面优于当前方法。此外,提出的方法还具有较小的型号尺寸,并且端到端运行时较短。
Deep learning (DL) based semantic communication methods have been explored for the efficient transmission of images, text, and speech in recent years. In contrast to traditional wireless communication methods that focus on the transmission of abstract symbols, semantic communication approaches attempt to achieve better transmission efficiency by only sending the semantic-related information of the source data. In this paper, we consider semantic-oriented speech to text transmission. We propose a novel end-to-end DL-based transceiver, which includes an attention-based soft alignment module and a redundancy removal module to compress the transmitted data. In particular, the former extracts only the text-related semantic features, and the latter further drops the semantically redundant content, greatly reducing the amount of semantic redundancy compared to existing methods. We also propose a two-stage training scheme, which speeds up the training of the proposed DL model. The simulation results indicate that our proposed method outperforms current methods in terms of the accuracy of the received text and transmission efficiency. Moreover, the proposed method also has a smaller model size and shorter end-to-end runtime.