BabyLM Turns 4: Call for Papers for the 2026 BabyLM Workshop Paper • 2602.20092 • Published 4 days ago
BabyBabelLM: A Multilingual Benchmark of Developmentally Plausible Training Data Paper • 2510.10159 • Published Oct 11, 2025 • 3
BERnaT: Basque Encoders for Representing Natural Textual Diversity Paper • 2512.03903 • Published Dec 3, 2025
BabyBabelLM: A Multilingual Benchmark of Developmentally Plausible Training Data Paper • 2510.10159 • Published Oct 11, 2025 • 3
Open Korean Historical Corpus: A Millennia-Scale Diachronic Collection of Public Domain Texts Paper • 2510.24541 • Published Oct 28, 2025