Towards Resiliency in Large Language Model Serving with KevlarFlow Paper • 2601.22438 • Published 27 days ago • 3
CUA-Skill: Develop Skills for Computer Using Agent Paper • 2601.21123 • Published 28 days ago • 13
TENET: Leveraging Tests Beyond Validation for Code Generation Paper • 2509.24148 • Published Sep 29, 2025 • 4
Unified Software Engineering agent as AI Software Engineer Paper • 2506.14683 • Published Jun 17, 2025 • 1
CORE: Benchmarking LLMs Code Reasoning Capabilities through Static Analysis Tasks Paper • 2507.05269 • Published Jul 3, 2025 • 1
TENET: Leveraging Tests Beyond Validation for Code Generation Paper • 2509.24148 • Published Sep 29, 2025 • 4