Jinn's Hub
about / blog / projects / ZH /
All tags

Posts tagged with "RL"

    NeMo-RL vs slime: RL Training Framework Comparison
    Deep comparison of two RL training frameworks: algorithms, engineering quality, MoE support, and ROCm compatibility
    TritonForge: Server-based Multi-turn RL for Triton Kernel Generation
    End-to-end server-based RL training and evaluation system for Triton kernel generation across NVIDIA and AMD, built on slime + Megatron
    SFT & RL Training Guide
    Complete guide from SFT vs RL fundamentals, loss computation, dataset construction to RLHF in practice
© 2026 • Jinn's Hub 🔬
Press Esc or click anywhere to close