view article Article Continuous batching from first principles +1 ror, ArthurZ, mcpotato • Nov 25, 2025 • 411
view article Article Beyond LoRA: Can you beat the most popular fine-tuning technique? +2 BenjaminB, sayakpaul, hubnemo, kashif • 10 days ago • 62
KITAB-Bench Collection A Comprehensive Multi-Domain Benchmark for Arabic OCR and Document Understanding • 24 items • Updated Feb 24, 2025 • 19
view article Article LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family lightonai • Jan 19 • 96
UniCorn: Towards Self-Improving Unified Multimodal Models through Self-Generated Supervision Paper • 2601.03193 • Published Jan 6 • 51
view article Article We Got Claude to Fine-Tune an Open Source LLM burtenshaw, evalstate • Dec 4, 2025 • 630
view article Article Open ASR Leaderboard: Trends and Insights with New Multilingual & Long-Form Tracks +2 bezzam, Steveeeeeeen, eustlb, reach-vb • Nov 21, 2025 • 27
AraLingBench A Human-Annotated Benchmark for Evaluating Arabic Linguistic Capabilities of Large Language Models Paper • 2511.14295 • Published Nov 18, 2025 • 74
Baseer: A Vision-Language Model for Arabic Document-to-Markdown OCR Paper • 2509.18174 • Published Sep 17, 2025 • 134
Qari-OCR: A High-Accuracy Model for Arabic Optical Character Collection 𝐵𝑢𝑖𝑙𝑡 𝑜𝑛 𝑡ℎ𝑒 𝑝𝑜𝑤𝑒𝑟𝑓𝑢𝑙 𝑄𝑤𝑒𝑛2 𝑉𝐿 2𝐵 𝑎𝑛𝑑 𝑓𝑖𝑛𝑒-𝑡𝑢𝑛𝑒𝑑 𝑜𝑛 𝑎𝑛 𝐴𝑟𝑎𝑏𝑖𝑐 𝑂𝐶𝑅 𝑑𝑎𝑡𝑎𝑠𝑒𝑡, 𝑄𝑎𝑟𝑖 𝑣0.1 𝑑𝑒 • 8 items • Updated Mar 2 • 18
Pearl Collection PEARL: A Multimodal Culturally-Aware Arabic Instruction Dataset • 4 items • Updated Oct 27, 2025 • 6
QARI-OCR: High-Fidelity Arabic Text Recognition through Multimodal Large Language Model Adaptation Paper • 2506.02295 • Published Jun 2, 2025 • 14
SmallThinker: A Family of Efficient Large Language Models Natively Trained for Local Deployment Paper • 2507.20984 • Published Jul 28, 2025 • 59
NeuralOS: Towards Simulating Operating Systems via Neural Generative Models Paper • 2507.08800 • Published Jul 11, 2025 • 81