swiss-ai/Apertus-8B-Instruct-2509 Text Generation • 8B • Updated Nov 14, 2025 • 437k • • 417
FreedomIntelligence/medical-o1-verifiable-problem Viewer • Updated Dec 30, 2024 • 40.6k • 468 • 119
FreedomIntelligence/medical-o1-reasoning-SFT Viewer • Updated Apr 22, 2025 • 90.1k • 5.05k • 1.03k
🧠Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 24 items • Updated May 19, 2025 • 178
nvidia/Nemotron-Research-Reasoning-Qwen-1.5B Text Generation • 2B • Updated Nov 21, 2025 • 1.95k • 234
Discovering Preference Optimization Algorithms with and for Large Language Models Paper • 2406.08414 • Published Jun 12, 2024 • 16
Discovering Preference Optimization Algorithms with and for Large Language Models Paper • 2406.08414 • Published Jun 12, 2024 • 16