BERT-as-a-Judge: A Robust Alternative to Lexical Methods for Efficient Reference-Based LLM Evaluation Paper • 2604.09497 • Published 8 days ago • 28
pszemraj/franken-gemma-4-dense-1b-finevisi-1.5K Image-Text-to-Text • 1.0B • Updated about 4 hours ago
pszemraj/franken-gemma-4-dense-1b-finevisi-1.5K Image-Text-to-Text • 1.0B • Updated about 4 hours ago
Dive into Claude Code: The Design Space of Today's and Future AI Agent Systems Paper • 2604.14228 • Published 4 days ago • 6
Cross-Tokenizer LLM Distillation through a Byte-Level Interface Paper • 2604.07466 • Published 5 days ago • 4
How to Fine-Tune a Reasoning Model? A Teacher-Student Cooperation Framework to Synthesize Student-Consistent SFT Data Paper • 2604.14164 • Published 26 days ago • 18
Running on Zero Agents 6 Gemma 4 26B-A4B It 🚀 6 Chat with a multimodal AI using text, images, or video
MSA: Memory Sparse Attention for Efficient End-to-End Memory Model Scaling to 100M Tokens Paper • 2603.23516 • Published Mar 6 • 48
SlopCodeBench: Benchmarking How Coding Agents Degrade Over Long-Horizon Iterative Tasks Paper • 2603.24755 • Published 23 days ago • 30