[BUG]kl_ctl大于0时(使用reference model),训练报错
March 27, 2026 · #1099
Python
Difficulty: Medium
Labels
bug npu
Parent Repository
inclusionAI/AReaL
Python repository
5,011 456
Labels
inclusionAI/AReaL
Python repository
Sign in required
Authenticate to use favourites & bookmarks