[LG]《RLVF: Learning from Verbal Feedback without Overgeneralization》M Stephan, A Khazatsky, E Mitchell, A S Chen, S Hsu, A Sharma, C Finn [Stanford University] (2024) http://t.cn/A6YoCU6N #机器学习##人工智能##论文#
发布于 北京
