Journal of Information Security Reserach ›› 2024, Vol. 10 ›› Issue (2): 122-.

Previous Articles     Next Articles

Generative Fake Speech Security Issue and Solution#br#
#br#

Feng Chang1,2, Wu Xiaolong2,3, Zhao Yiyang1,2, Xu Mingxing1,2, and Zheng Fang1,2#br# #br#   

  1. 1(Department of Computer Science and Technology, Tsinghua University, Beijing 100084)
    2(Beijing National Research Center for Information Science and Technology, Tsinghua University, Beijing 100084)
    3(School of Computer Science and Technology, Xinjiang University, Urumqi 830046)

  • Online:2024-02-21 Published:2024-02-26

生成式伪造语音安全问题与解决方案

冯畅1,2吴晓龙2,3赵熠扬1,2徐明星1,2郑方1,2   

  1. 1(清华大学计算机科学与技术系北京100084)
    2(清华大学北京信息科学与技术国家研究中心北京100084)
    3(新疆大学计算机科学与技术学院乌鲁木齐830046)

  • 通讯作者: 郑方 博士,教授.主要研究方向为说话人识别、语音识别、自然语言处理. fzheng@tsinghua.edu.cn
  • 作者简介:冯畅 博士研究生.主要研究方向为伪造语音检测. fc19@mails.tsinghua.edu.cn 吴晓龙 博士研究生.主要研究方向为语音情感识别. wuxl@stu.xju.edu.cn 赵熠扬 硕士研究生.主要研究方向为说话人识别. zhaoyy22@mails.tsinghua.edu.cn 徐明星 博士,副研究员.主要研究方向为语音情感识别、声纹识别. xumx@tsinghua.edu.cn 郑方 博士,教授.主要研究方向为说话人识别、语音识别、自然语言处理. fzheng@tsinghua.edu.cn

Abstract: The development of generative artificial intelligence algorithms has made the generation of fake speech increasingly natural and fluid, making it challening for human listeners  to distinguish the genuine and fake speech. This paper firstly analyzes a series of threats to society posed by the improper abuse of generative fake speech, including an increase in telecommunication fraud, a decline in the security of voiceoperated applications, judicial fairness of forensic identification, and deception to the public through the combination of falsified information across various domains. Subsequently, the paper summarizes and classifies the algorithms of fake speech generation and fake speech detection technology from the perspective of technology development. We explains the procedural aspects of the technologies and their key points, along with an analysis of the challenges encountered in the process of application. Finally, this paper outlines strategies to prevent and address these security issues from four aspects: technical application, institutional regulation, public education and international cooperation.

Key words: generative artificial intelligence, fake speech, security issue of fake speech, fake speech detection, solution to fake speech threat

摘要: 生成式人工智能算法的发展使得生成式伪造语音更加自然流畅,人类听力难以分辨真伪.首先分析了生成式伪造语音不当滥用对社会造成的一系列威胁,如电信诈骗更加泛滥、语音应用程序安全性下降、司法鉴定公正性受到影响、综合多领域的伪造信息欺骗社会大众等.然后从技术发展角度,对生成式伪造语音的生成算法和检测算法分别进行总结与分类,阐述算法流程步骤及其中的关键点,并分析了技术应用的挑战点.最后从技术应用、制度规范、公众教育、国际合作4方面阐述了如何预防以及解决生成式伪造语音带来的安全问题.

关键词: 生成式人工智能, 伪造语音, 伪造语音安全问题, 伪造语音检测, 伪造语音威胁解决

CLC Number: