Darshan Deshpande
DarshanDeshpande
AI & ML interests
Explainability, Robustness, Evaluations
Recent Activity
liked
a dataset
4 days ago
PatronusAI/trace-dataset
upvoted
a
paper
4 days ago
Benchmarking Reward Hack Detection in Code Environments via Contrastive Analysis
submitted
a paper
4 days ago
Benchmarking Reward Hack Detection in Code Environments via Contrastive Analysis