AmberYifan/Llama-3.1-8B-Instruct-wildfeedback-DRIFT-iter2 Text Generation • 8B • Updated Jul 19, 2025
AmberYifan/Llama-3.1-8B-Instruct-wildfeedback-iterDPO-iter1 Text Generation • 8B • Updated Jul 18, 2025 • 3
AmberYifan/Llama-3.1-8B-Instruct-wildfeedback-SPIN-iter1 Text Generation • 8B • Updated Jul 18, 2025 • 1
AmberYifan/Llama-3.1-8B-Instruct-wildfeedback-DRIFT-iter1 Text Generation • 8B • Updated Jul 18, 2025 • 1
AmberYifan/Qwen2.5-7B-Instruct-userfeedback-on-policy-iter3 Text Generation • 8B • Updated Jul 11, 2025 • 1
AmberYifan/Qwen2.5-7B-Instruct-userfeedback-SPIN-iter3 Text Generation • 8B • Updated Jul 11, 2025 • 1
AmberYifan/Qwen2.5-7B-Instruct-ultrafeedback-spin-iter3 Text Generation • 8B • Updated Jul 10, 2025 • 1
AmberYifan/Qwen2.5-7B-Instruct-ultrafeedback-nspin-iter3 Text Generation • 8B • Updated Jul 10, 2025 • 1
AmberYifan/Qwen2.5-7B-Instruct-ultrafeedback-iterdpo-iter3 Text Generation • 8B • Updated Jul 10, 2025 • 1
AmberYifan/llama3-8b-full-pretrain-mix-low-tweet-1m-en-gpt-sft Text Generation • 8B • Updated Jul 3, 2025 • 1
AmberYifan/llama3-8b-full-pretrain-mix-mid-tweet-1m-en-gpt-sft Text Generation • 8B • Updated Jul 3, 2025 • 4
AmberYifan/llama3-8b-full-pretrain-mix-high-tweet-1m-en-gpt-sft Text Generation • 8B • Updated Jul 3, 2025 • 2
AmberYifan/llama3-8b-full-pretrain-junk-tweet-1m-en-gpt-sft Text Generation • 8B • Updated Jul 3, 2025 • 3
AmberYifan/llama3-8b-full-pretrain-control-tweet-1m-en-gpt-sft Text Generation • 8B • Updated Jul 3, 2025 • 4
AmberYifan/llama3-8b-full-pretrain-mix-low-tweet-1m-en-gpt Text Generation • 8B • Updated Jul 3, 2025 • 6
AmberYifan/llama3-8b-full-pretrain-mix-mid-tweet-1m-en-gpt Text Generation • 8B • Updated Jul 3, 2025 • 1
AmberYifan/llama3-8b-full-pretrain-mix-high-tweet-1m-en-gpt Text Generation • 8B • Updated Jul 3, 2025 • 3
AmberYifan/llama3-8b-full-pretrain-control-tweet-1m-en-gpt Text Generation • 8B • Updated Jul 3, 2025
AmberYifan/llama3-8b-full-pretrain-junk-tweet-1m-en-gpt Text Generation • 8B • Updated Jul 3, 2025 • 3
AmberYifan/Qwen2.5-7B-Instruct-ultrafeedback-iterDPO-iter2 Text Generation • 8B • Updated Jun 30, 2025 • 1 • 1