Journal of Information Security Reserach ›› 2025, Vol. 11 ›› Issue (8): 682-.

    Next Articles

Design of a Large Model Data Supervision System Based on Blockchain

Li Shouwei1,3,4, Zhang Jiazheng2, He Haibo2, and Chen Minghui2   

  1. 1(School of Economics and Management, Southeast University, Nanjing 211189)
    2(School of Cyber Science and Engineering, Southeast University, Nanjing 211189)
    3(Research and Development Center for System and Information Engineering, Southeast University, Nanjing 211189)
    4(Engineering Research Center of Blockchain Application, Supervision and Management (Southeast University), Ministry of Education, Nanjing 211189)
  • Online:2025-08-28 Published:2025-08-28

基于区块链的大模型数据监管体系设计

李守伟1,3,4张嘉政2何海波2陈明辉2   

  1. 1(东南大学经济管理学院南京211189)
    2(东南大学网络空间安全学院南京211189)
    3(东南大学系统与信息工程研究发展中心南京211189)
    4(教育部区块链应用与监管工程研究中心(东南大学)南京211189)
  • 通讯作者: 张嘉政 博士研究生.主要研究方向为区块链技术、隐私保护、大模型. zjz@seu.edu.cn
  • 作者简介:李守伟 博士,教授.主要研究方向为人工智能、大数据与智能决策、区块链技术与应用. lishouwei@seu.edu.cn 张嘉政 博士研究生.主要研究方向为区块链技术、隐私保护、大模型. zjz@seu.edu.cn 何海波 博士研究生.主要研究方向为区块链技术、智能合约、大数据. 23821150@qq.com 陈明辉 博士研究生.主要研究方向为区块链技术、共识算法、密码学. chenminghuiemail@163.com

Abstract: Large model (LM) has shown great potential in the fields of natural language processing, image and speech recognition, and has become a key force driving the technological revolution and social progress. However, the wide application of LM technology brings challenges such as data privacy risks, data compliance regulation, and data regulatory activation and intelligence.  This paper aims to explore how to utilize blockchain to design and construct an effective data regulatory system to promote its healthy development, in order to meet the challenges brought by the application of massive data to LM. This paper analyzes the trends and current status of the development of LM at home and abroad, and points out the main challenges to LM data regulation, including data privacy risks, data compliance, and the difficulty of effective supervision by regulators . A blockchainbased data regulation system design scheme is proposed to address these challenges, which realizes the fullcycle data regulation of LM data from the native metadata to the input of training until the posttraining feedback through four interconnected modules, namely, privacy protection, consensus algorithm, incentive mechanism, and smart contract. Finally, the application prospect of blockchain in LM data supervision is summarized, and the future trend of data supervision is outlooked.

Key words: large model, blockchain, large model data regulation, big data, privacy protection, data security

摘要: 大模型(large model, LM)在自然语言处理、图像、语音识别等领域展现出巨大潜力,成为推动科技革命与社会进步的关键力量.但大模型技术的广泛应用带来了数据隐私风险、数据合规性监管、数据监管活跃性与智能化等挑战.旨在探讨如何利用区块链技术设计和构建一个有效的大模型数据监管体系促进其健康发展,以应对海量数据应用于大模型所带来的挑战.分析了国内外大模型发展的趋势和现状,指出了大模型数据监管面临的主要挑战,包括数据隐私问题、数据合规性、监管机构难以有效监督等.针对这些挑战提出一种基于区块链技术的数据监管体系设计方案,通过隐私保护、共识算法、激励机制和智能合约4个互相联动的模块实现对大模型数据从原生元数据到输入大模型训练,直至训练后反馈的全周期数据监管.最后总结了区块链技术在大模型数据监管中的应用前景,并对未来大模型数据监管的发展趋势进行了展望.

关键词: 大模型, 区块链, 大模型数据监管, 大数据, 隐私保护, 数据安全

CLC Number: