hallucinations-leaderboard

community

https://www.neuralnoise.com

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

rohitsaxena authored a paper 2 days ago

VLM-RobustBench: A Comprehensive Benchmark for Robustness of Vision-Language Models

rohitsaxena authored a paper 2 days ago

Do Composed Image Retrieval Benchmarks Require Multimodal Composition?

pminervini authored a paper 17 days ago

VLM-RobustBench: A Comprehensive Benchmark for Robustness of Vision-Language Models

View all activity

spaces 1

Hallucinations Leaderboard

View and submit LLM evaluations

models 0

None public yet

datasets 2

hallucinations-leaderboard/requests

Preview • Updated Oct 31, 2024 • 1.76k

hallucinations-leaderboard/results

Updated Oct 31, 2024 • 24.2k • 2