[CL]《Recurrent Drafter for Fast Speculative Decoding in Large Language Models》A Zhang, C Wang, Y Wang, X Zhang, Y Cheng [Apple] (2024) http://t.cn/A6T7SS8w #机器学习##人工智能##论文#
发布于 北京
