[LG]《Self-Play Preference Optimization for Language Model Alignment》Y Wu, Z Sun, H Yuan, K Ji, Y Yang, Q Gu [University of California, Los Angeles & CMU] (2024) http://t.cn/A6Hv3EgJ #机器学习##人工智能##论文#
发布于 北京
