Track, rank and evaluate open LLMs and chatbots
Explore model compression scaling and compare performance