Journal of Information Security Reserach ›› 2026, Vol. 12 ›› Issue (5): 452-.

Previous Articles     Next Articles

Highperformance Hardware Architecture Design and Implementation of the MLKEM Algorithm#br#

Li Jiacheng, Xu Songyan, and Cao Lei   

  1. (Information Security and Cryptographic Engineering Research Center, Beijing Research Institute of Telemetry, Beijing 100076)
  • Online:2026-05-23 Published:2026-05-23

MLKEM算法的高性能硬件架构设计与实现

李嘉澄徐松艳曹雷   

  1. (北京遥测技术研究所信息安全与密码工程技术研究中心北京100076)
  • 通讯作者: 徐松艳 硕士,研究员.主要研究方向为认证与密钥协商协议. 15293778508@163.com
  • 作者简介:李嘉澄 硕士研究生.主要研究方向为传统公钥与后量子密码算法的FPGA硬件实现与优化. 2019302180037@whu.edu.cn 徐松艳 硕士,研究员.主要研究方向为认证与密钥协商协议. 15293778508@163.com 曹雷 硕士,研究员.主要研究方向为基于FPGA的密码算法实现与加速. 17786526757@163.com

Abstract: The postquantum cryptographic algorithm MLKEM has emerged as a key standard for resisting quantum computing attacks, and its highspeed hardware implementation is crucial for future network security. This paper proposes a hardware architecture for MLKEM and optimizes several specific modules within the design. For the encoding and decoding operations, FIFObased processing is employed to reduce resource consumption. In addition, the compression operation is analyzed to eliminate division operations during the compression process, and division and rounding are combined through specific parameter selection. Furthermore, a compact operation schedule is adopted to increase parallelism while reducing the overall computation latency. The proposed design is implemented and synthesized on a Xilinx Artix7 platform. Experimental results show that the maximum operating frequency reaches 125MHz, with resource utilization of 19791 LUTs, 8364 FFs, 22 BRAMs, and 9 DSPs. Under three security levels, the latencies of key generation, encapsulation, and decapsulation are 354149576593 cycles, 404255047183 cycles, and 6822907211274 cycles, respectively. The proposed architecture achieves a favorable balance between resource overhead and computational performance.

Key words: postquantum cryptography, MLKEM, Kyber, hardware acceleration, polynomial arithmetic

摘要: 格基密钥封装机制(module latticebased key encapsulation mechanismm, MLKEM)已成为抵御量子计算攻击的关键标准,其高速硬件实现对于未来网络安全至关重要.提出了一种MLKEM硬件架构,并对其中的特定模块进行了优化.使用FIFO进行编解码操作,减少了资源消耗.同时,还对压缩运算进行了分析,避免了压缩过程中的除法运算,并通过特定的参数选择将除法与舍入操作合并进行.最后,对操作时序进行了紧凑安排,在提高并行度的同时缩短了整体运算时间.在Xilinx Artix7平台上进行了实验.结果表明,最高运行频率为125MHz,资源消耗为19791个LUT、8364个FF、22个BRAM和9个DSP,3种安全强度下密钥生成、封装与解封装操作的延迟分别为354149576593时钟周期、404255047183时钟周期和6822907211274时钟周期,在资源开销与运算速度之间取得了较好平衡.

关键词: 后量子密码, MLKEM, Kyber, 硬件加速, 多项式运算

CLC Number: