arxiv:2407.03651
Amanda Dsouza
andsouzasnorkelai
AI & ML interests
None yet
Recent Activity
upvoted a paper 15 days ago
SlopCodeBench: Benchmarking How Coding Agents Degrade Over Long-Horizon Iterative Tasks upvoted a paper about 1 month ago
SkillOrchestra: Learning to Route Agents via Skill Transfer liked a dataset 5 months ago
snorkelai/Tau2-Bench-Airline-With-Code-Agents