Rethinking Multilingual Continual Pretraining: Data Mixing for Adapting LLMs Across Languages and Resources
Paper • 2504.04152 • Published • 1
Rethinking Multilingual Continual Pretraining: Data Mixing for Adapting LLMs Across Languages and Resources