Reinforcement Learning
Transformers
Safetensors
English
qwen2
text-generation
Search
QuestionAnswering
text-generation-inference
Instructions to use SEGAgentRL/LLDS-R-GRPO-Qwen2.5-3B-Base with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use SEGAgentRL/LLDS-R-GRPO-Qwen2.5-3B-Base with Transformers:
# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("SEGAgentRL/LLDS-R-GRPO-Qwen2.5-3B-Base") model = AutoModelForCausalLM.from_pretrained("SEGAgentRL/LLDS-R-GRPO-Qwen2.5-3B-Base") - Notebooks
- Google Colab
- Kaggle