v0.50.1: Live HF Trainer callback for the 7 GRPO stability/efficiency knobs
May 11, 2026 ยท #127
Python
Difficulty: Medium
Labels
enhancement help wanted
Parent Repository
MakazhanAlpamys/Soup
Python repository
57 9
Labels
MakazhanAlpamys/Soup
Python repository
Sign in required
Authenticate to use favourites & bookmarks