Running 193 The ultimate guide to RL environments: building and scaling them in the LLM era π 193 Building and scaling RL environments for LLM training
sorry-bench/ft-mistral-7b-instruct-v0.2-sorry-bench-202406 Text Generation β’ 7B β’ Updated Jul 2, 2024 β’ 3.87k β’ 9
Running Featured 49 Porting nanochat to Transformers: an AI modeling history lesson π 49 Learn about ML and Transformers through nanochat
Running on CPU Upgrade Featured 3.22k The Smol Training Playbook π 3.22k The secrets to building world-class LLMs
Build error Agents 1 Hugging Research π 1 CodeAgent-based research assistant for the Hugging Face Hub
Runtime error 16 FastAPI + React Template π» 16 Template to vibe code a demo running on FastAPI + React