The *RLMT* collection. Coming soon!
Princeton NLP group
princeton-nlp
AI & ML interests
None yet
Organizations
SimPO
This collections contains a list of SimPO and baseline models.
-
princeton-nlp/gemma-2-9b-it-SimPO
Text Generation • 9B • Updated • 498 • • 172 -
princeton-nlp/gemma-2-9b-it-DPO
Text Generation • 9B • Updated • 28 • • 9 -
princeton-nlp/Llama-3-Base-8B-SFT-IPO
Text Generation • 8B • Updated • 43 • • 1 -
princeton-nlp/Llama-3-Base-8B-SFT-DPO
Text Generation • 8B • Updated • 810 •
RLMT Experiments
The *RLMT* collection. Coming soon!
SimPO
This collections contains a list of SimPO and baseline models.
-
princeton-nlp/gemma-2-9b-it-SimPO
Text Generation • 9B • Updated • 498 • • 172 -
princeton-nlp/gemma-2-9b-it-DPO
Text Generation • 9B • Updated • 28 • • 9 -
princeton-nlp/Llama-3-Base-8B-SFT-IPO
Text Generation • 8B • Updated • 43 • • 1 -
princeton-nlp/Llama-3-Base-8B-SFT-DPO
Text Generation • 8B • Updated • 810 •
models 306
princeton-nlp/warm-start__grpo__nothink__Qwen2.5-7B-Instruct
8B • Updated • 1
princeton-nlp/warm-start__grpo__nothink__Llama-3.1-8B-Instruct
8B • Updated • 1
princeton-nlp/warm-start__grpo__nothink__Qwen2.5-7B
8B • Updated • 1
princeton-nlp/warm-start__grpo__nothink__Llama-3.1-8B
8B • Updated • 4
princeton-nlp/warm-start__grpo__think__Qwen2.5-7B-Instruct
8B • Updated • 7
princeton-nlp/warm-start__grpo__think__Llama-3.1-8B-Instruct
8B • Updated • 3
princeton-nlp/warm-start__grpo__think__Qwen2.5-7B
8B • Updated • 1
princeton-nlp/warm-start__grpo__think__Llama-3.1-8B
8B • Updated • 3
princeton-nlp/zero__grpo__nothink__Qwen2.5-7B
8B • Updated • 1
princeton-nlp/zero__grpo__nothink__Llama-3.1-8B
8B • Updated • 2
datasets 47
princeton-nlp/rl_tulu3_wildchat-if_prompts
Viewer • Updated • 7.79k • 61 • 5
princeton-nlp/gemini_2.5_flash_0417_sft-data
Viewer • Updated • 6k • 11 • 1
princeton-nlp/prolong-data-512K
Updated • 6.36k • 11
princeton-nlp/SWE-bench_Lite
Viewer • Updated • 323 • 85.1k • 57
princeton-nlp/SWE-bench
Viewer • Updated • 21.5k • 24.4k • 135
princeton-nlp/SWE-bench_Verified
Viewer • Updated • 500 • 699k • 331
princeton-nlp/TextbooksBySubject
Viewer • Updated • 129 • 17 • 1
princeton-nlp/TextbookChapters
Viewer • Updated • 77.9k • 39 • 12
princeton-nlp/SWE-bench_Multimodal
Viewer • Updated • 612 • 3.12k • 21
princeton-nlp/fineweb_edu-swahili-translated
Viewer • Updated • 137k • 15 • 2