Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

hallucinations-leaderboard

community
https://www.neuralnoise.com
pminervini
pminervini
Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

rohitsaxena  authored a paper 2 days ago
VLM-RobustBench: A Comprehensive Benchmark for Robustness of Vision-Language Models
rohitsaxena  authored a paper 2 days ago
Do Composed Image Retrieval Benchmarks Require Multimodal Composition?
pminervini  authored a paper 17 days ago
VLM-RobustBench: A Comprehensive Benchmark for Robustness of Vision-Language Models
View all activity

Pasquale Minervini's profile pictureClémentine Fourrier's profile picturePing Nie's profile pictureRohit Saxena's profile pictureAryo Pradipta Gema's profile pictureYu Zhao's profile pictureXuanli He's profile pictureXiaotang Du's profile pictureGiwon Hong's profile pictureAntonio Valerio Miceli Barone's profile picture

spaces 1

pinned
Runtime error
Agents
145

Hallucinations Leaderboard

🔥

View and submit LLM evaluations

Jun 12, 2024

models 0

None public yet

datasets 2

hallucinations-leaderboard/requests

Preview • Updated Oct 31, 2024 • 1.76k

hallucinations-leaderboard/results

Updated Oct 31, 2024 • 24.2k • 2
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs