Less is More: Recursive Reasoning with Tiny Networks Paper • 2510.04871 • Published Oct 6, 2025 • 504
Running Featured 567 Image Arena Leaderboard 📊 567 Image Generation and Image Editing Arena & Leaderboard
Running on CPU Upgrade 240 MMLU-Pro Leaderboard 🥇 240 More advanced and challenging multi-task evaluation
indic-evals Collection Translated versions of popular LLM benchmarks. • 9 items • Updated May 23, 2025 • 10
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey Paper • 2509.02547 • Published Sep 2, 2025 • 228