[BUG]kl_ctl大于0时(使用reference model),训练报错

March 27, 2026 · #1099
View on GitHub
Python Difficulty: Medium

Labels

bug npu

Sign in required

Authenticate to use favourites & bookmarks

5