miles-rl-training
miles-rl-training is a production-ready RL framework optimized for large-scale model post-training, supporting features like FP8 quantization-aware training and train-inference alignment.
★ 0
⑂ 0
Browse and install thousands of AI Agent skills in the Killer-Skills directory. Supports Claude Code, Windsurf, Cursor, and more.
miles-rl-training is a production-ready RL framework optimized for large-scale model post-training, supporting features like FP8 quantization-aware training and train-inference alignment.