GigaEvo: An Open Source Optimization Framework Powered By LLMs And Evolution Algorithms Paper • 2511.17592 • Published Nov 17, 2025 • 118
Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models Paper • 2506.06395 • Published Jun 5, 2025 • 133
cadrille: Multi-modal CAD Reconstruction with Online Reinforcement Learning Paper • 2505.22914 • Published May 28, 2025 • 36
Position of Uncertainty: A Cross-Linguistic Study of Positional Bias in Large Language Models Paper • 2505.16134 • Published May 22, 2025 • 18
bartowski/Vikhr-Nemo-12B-Instruct-R-21-09-24-GGUF Text Generation • 12B • Updated Sep 23, 2024 • 826 • 14
Vikhrmodels/Vikhr-Nemo-12B-Instruct-R-21-09-24 Text Generation • 12B • Updated Oct 25, 2024 • 12.3k • 137
Running 3.62k The Ultra-Scale Playbook 🌌 3.62k The ultimate guide to training LLM on large GPU Clusters