Advancing Polish Language Modeling through Tokenizer Optimization in the Bielik v3 7B and 11B Series Paper • 2604.10799 • Published 20 days ago • 6
speakleash/Bielik-Minitron-7B-v3.0-Instruct-FP8-Dynamic Text Generation • 7B • Updated Mar 30 • 1.88k • 1
speakleash/Bielik-Minitron-7B-v3.0-Instruct-FP8-Dynamic Text Generation • 7B • Updated Mar 30 • 1.88k • 1
The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain Paper • 2509.26507 • Published Sep 30, 2025 • 550