-
BERT-as-a-Judge: A Robust Alternative to Lexical Methods for Efficient Reference-Based LLM Evaluation
Paper • 2604.09497 • Published • 21 -
artefactory/BERTJudge
0.2B • Updated • 18 • 1 -
artefactory/BERTJudge-Formatted-QCR
0.2B • Updated -
artefactory/BERTJudge-Formatted-CR
0.2B • Updated • 6
AI & ML interests
NLP, Information Retrieval, Computer Vision, Uncertainty Estimation, Trustworthy AI, Bias Estimation, Unbalanced ML, Choice Modeling, Time Series
Recent Activity
Papers
BERT-as-a-Judge: A Robust Alternative to Lexical Methods for Efficient Reference-Based LLM Evaluation
Learned Hallucination Detection in Black-Box LLMs using Token-level Entropy Production Rate
Artefact is a data-driven company specializing in Artificial Intelligence and Machine Learning solutions.
Our mission:
👉 We help organizations unlock the full potential of their data, empowering them to make smarter decisions and drive digital transformation.
👉 We place a strong emphasis on research-oriented innovation, actively contributing to the AI and data science community.
Visit our website | Follow us on LinkedIn
-
BERT-as-a-Judge: A Robust Alternative to Lexical Methods for Efficient Reference-Based LLM Evaluation
Paper • 2604.09497 • Published • 21 -
artefactory/BERTJudge
0.2B • Updated • 18 • 1 -
artefactory/BERTJudge-Formatted-QCR
0.2B • Updated -
artefactory/BERTJudge-Formatted-CR
0.2B • Updated • 6