论文标题

太慢了,无法有用?将人类纳入智能扬声器的循环中

Too Slow to Be Useful? On Incorporating Humans in the Loop of Smart Speakers

论文作者

Huang, Shih-Hong, Huang, Chieh-Yang, Deng, Yuxin, Shen, Hua, Kuan, Szu-Chi, Huang, Ting-Hao 'Kenneth'

论文摘要

合唱/evorus,Vizwiz和幻影等实时人群动力的系统已经表明,将人类纳入自动解决方案不足的地方如何补充自动化系统。但是,将这种体系结构应用于更多场景的一种不言而喻的瓶颈是将人类纳入自动化系统的循环中的较长延迟。对于在周转时间具有严格约束的应用程序,人类操作的组件的延长延迟和较大的速度变化似乎是显而易见的交易破坏者。本文通过使用基于文本的后端来解释和量化这些限制,通过仅语音智能扬声器与用户进行对话。智能扬声器必须在几秒钟内响应用户的请求,因此幕后的工人只有几秒钟来撰写答案。我们用八对参与者测量了端到端系统延迟和对话质量,显示了此类系统的挑战和优越性。

Real-time crowd-powered systems, such as Chorus/Evorus, VizWiz, and Apparition, have shown how incorporating humans into automated systems could supplement where the automatic solutions fall short. However, one unspoken bottleneck of applying such architectures to more scenarios is the longer latency of including humans in the loop of automated systems. For the applications that have hard constraints in turnaround times, human-operated components' longer latency and large speed variation seem to be apparent deal breakers. This paper explicates and quantifies these limitations by using a human-powered text-based backend to hold conversations with users through a voice-only smart speaker. Smart speakers must respond to users' requests within seconds, so the workers behind the scenes only have a few seconds to compose answers. We measured the end-to-end system latency and the conversation quality with eight pairs of participants, showing the challenges and superiority of such systems.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源