MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use Paper • 2509.24002 • Published Sep 28, 2025 • 180
tiny-aya-safety/sorry-bench-202503-cohere-translation-tier1 Viewer • Updated 20 days ago • 59.4k • 27
tiny-aya-safety/sorry-bench-202503-cohere-translation-tier1 Viewer • Updated 20 days ago • 59.4k • 27
sorry-bench/ft-mistral-7b-instruct-v0.2-sorry-bench-202406 Text Generation • 7B • Updated Jul 2, 2024 • 1.32k • 8
view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 Feb 20 • 505
PaperBanana: Automating Academic Illustration for AI Scientists Paper • 2601.23265 • Published Jan 30 • 227