AI & ML interests
None yet
Organizations
None yet
rhinosaur0/gsm8k-1.5B-rl-base-4
2B
•
Updated
•
1
rhinosaur0/gsm8k-1.5B-rl-base-3
2B
•
Updated
•
2
rhinosaur0/gsm8k-1.5B-rl-base-2
2B
•
Updated
•
2
rhinosaur0/tensorstax-32b-22000-lora-32-2e-5-format-32b-no-bird-1704-gspo-1000
33B
•
Updated
•
1
rhinosaur0/tensorstax-14b-tool-call-sft
Text Generation
•
15B
•
Updated
rhinosaur0/gsm8k-1.5B-rl-split-450
2B
•
Updated
•
1
rhinosaur0/gsm8k-1.5B-rl-split-200
2B
•
Updated
•
1
rhinosaur0/gsm8k-1.5B-rl-base-479
2B
•
Updated
•
2
rhinosaur0/tensorstax-14b-tool-call-light-sft
Text Generation
•
15B
•
Updated
•
1
rhinosaur0/gsm8k-1.5B-rl-base
2B
•
Updated
•
1
rhinosaur0/tensorstax-32b-lora-22000-dapo-630
33B
•
Updated
•
1
rhinosaur0/tensorstax-32b-plan-only-sft-2400
Text Generation
•
33B
•
Updated
rhinosaur0/tensorstax-32b-plan-mask-sft-2400
Text Generation
•
33B
•
Updated
rhinosaur0/tensorstax-32b-plan-mask-sft-1200
Text Generation
•
33B
•
Updated
•
1
rhinosaur0/tensorstax-32b-plan-mask-sft-1000
Text Generation
•
33B
•
Updated
rhinosaur0/tensorstax-32b-lora-22000-dapo-440
33B
•
Updated
•
4
rhinosaur0/tensorstax-32b-plan-mask
Text Generation
•
33B
•
Updated
rhinosaur0/TensorStax-8B-dapo-step-300
rhinosaur0/TensorStax-8B-dapo-step-650
rhinosaur0/TensorStax-8B-dapo-step-400