The latest members of the Olmo 3 family: another 3 weeks of RL for 32B Think, the 32B Instruct model, large post-training research datasets...
-
allenai/Olmo-3.1-32B-Think
Text Generation β’ 32B β’ Updated β’ 2.25k β’ β’ 57 -
allenai/Olmo-3.1-32B-Instruct-SFT
32B β’ Updated β’ 1.83k β’ 5 -
allenai/Olmo-3.1-32B-Instruct-DPO
Text Generation β’ 32B β’ Updated β’ 723 β’ 4 -
allenai/Olmo-3.1-32B-Instruct
Text Generation β’ 32B β’ Updated β’ 3.53k β’ β’ 35