SWE-Explore: Benchmarking How Coding Agents Explore Repositories Paper • 2606.07297 • Published 25 days ago • 121
ContextBench: A Benchmark for Context Retrieval in Coding Agents Paper • 2602.05892 • Published Feb 5 • 5