论文标题

对抗和安全扩展的问题产生

Adversarial and Safely Scaled Question Generation

论文作者

Sankar, Sreehari, Dong, Zhihang

论文摘要

最近的问题产生引起了很多研究兴趣,尤其是随着大型语言模型的出现。本身,问题的产生可以被视为“ AI-HARD”,因为缺乏使问题“好”或“坏”的一致意识到。在本文中,我们可以并行解决两个基本问题:一方面,我们尝试解决缩放问题,在这些问题上,问题生成和回答应用必须应用于大量的文本,而无需地面真相标记。解决此问题的通常方法是下样本或总结。但是,这些方法存在错误信息的关键风险。另一方面,与错误信息问题有关,我们试图解决“安全”问题,因为许多公共机构都依赖于其提供的内容的更高准确性。我们介绍了一种对抗性方法,可以通过规模解决问题的生成安全问题。具体来说,我们设计了一个提问系统,该系统专门修剪了可能产生的无法回答的问题,并进一步提高了生成的答案的质量。我们建立了一条准备就绪的,易于插入的管道,该管道可用于任何给定的文本,可扩展且免于产生任何仇恨言论,亵渎或错误信息。根据结果​​,我们能够产生超过六倍以上的质量问题数量,而抽象方法产生的质量问题数量高出44%,对168名参与者进行了调查。

Question generation has recently gained a lot of research interest, especially with the advent of large language models. In and of itself, question generation can be considered 'AI-hard', as there is a lack of unanimously agreed sense of what makes a question 'good' or 'bad'. In this paper, we tackle two fundamental problems in parallel: on one hand, we try to solve the scaling problem, where question-generation and answering applications have to be applied to a massive amount of text without ground truth labeling. The usual approach to solve this problem is to either downsample or summarize. However, there are critical risks of misinformation with these approaches. On the other hand, and related to the misinformation problem, we try to solve the 'safety' problem, as many public institutions rely on a much higher level of accuracy for the content they provide. We introduce an adversarial approach to tackle the question generation safety problem with scale. Specifically, we designed a question-answering system that specifically prunes out unanswerable questions that may be generated, and further increases the quality of the answers that are generated. We build a production-ready, easily-plugged pipeline that can be used on any given body of text, that is scalable and immune from generating any hate speech, profanity, or misinformation. Based on the results, we are able to generate more than six times the number of quality questions generated by the abstractive approach, with a perceived quality being 44% higher, according to a survey of 168 participants.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源