view article Article LightOnOCR-1B: The Case for End-to-End and Efficient Domain-Specific Vision-Language Models for OCR Oct 23 β’ 62
view article Article Exploring Environments Hub: Your Language Model needs better (open) environments to learn Sep 4 β’ 28
view article Article Building Enterprise-Ready Text Classifiers in Minutes with Adaptive Learning Aug 9 β’ 12
SauerkrautLM-Multilingual-(Reason)-ColBERT Collection SauerkrautLM ColBERT is a suite of Late-Interaction retrieval models built with PyLateβs ColBERT architecture and tuned for seven European languages. β’ 7 items β’ Updated Aug 3 β’ 20
NER ITA Collection This collection presents my best models tailored for Named Entity Recognition (NER) tasks, exclusively designed for the Italian language. β’ 3 items β’ Updated Jul 20 β’ 3
view article Article Training and Finetuning Reranker Models with Sentence Transformers v4 Mar 26 β’ 177
view article Article Train 400x faster Static Embedding Models with Sentence Transformers Jan 15 β’ 222
view article Article **Intelligence Potentiation: An Evolutionary Perspective on AI Agent Designs** Dec 19, 2024 β’ 4
view article Article SauerkrautLM's Multi-Phase Spectrum Training: A Technical Deep Dive Nov 9, 2024 β’ 9