[CL]《Encode Once and Decode in Parallel: Efficient Transformer Decoding》B Lu, N Haduong, C Lin, H Cheng, N A. Smith, M Ostendorf [University of Washington & Microsoft Research] (2024) http://t.cn/A6TApXKC #机器学习##人工智能##论文#
发布于 北京
