AI & ML interests
None yet
Organizations
sohyunan/DeepSeek-R1-Distill-Qwen-1.5B-sft-lora
Updated
sohyunan/DeepSeek-R1-Distill-Qwen-1.5B-sft-full
Updated
sohyunan/gemma-2-2b-it-maze-sft-sys0.0
Text Generation
•
3B
•
Updated
•
9
•
sohyunan/gemma-2-2b-it-maze-sft-ctrl-sys0.5-a_star
Text Generation
•
3B
•
Updated
•
4
•
sohyunan/gemma-2-2b-it-maze-sft-sys1.0-a_star
Text Generation
•
3B
•
Updated
•
3
sohyunan/gemma-2-2b-it_controller_sft_random_grpo
Text Generation
•
3B
•
Updated
•
4
Text Generation
•
Updated
•
9
sohyunan/gemma-2-2b-it_controller_sft_random_grpo_lora
Updated
sohyunan/gemma-2-2b-it_controller_grpo_lora
Updated
sohyunan/gemma-2-2b-it_controller_sft_random
Text Generation
•
3B
•
Updated
•
7
sohyunan/gemma-2-2b-it_controller-grpo
Text Generation
•
3B
•
Updated
•
4
sohyunan/Mistral-7B-Instruct-v0.2_controller
Text Generation
•
7B
•
Updated
•
2
sohyunan/gemma-2-2b-it_controller
Text Generation
•
3B
•
Updated
•
2
sohyunan/Mistral-7B-Instruct-v0.2_system1
Text Generation
•
7B
•
Updated
•
2
sohyunan/Mistral-7B-Instruct-v0.2_a_star_obstacles_system2
Text Generation
•
7B
•
Updated
•
2
sohyunan/Qwen2.5-1.5B-Open-R1-GRPO
Updated