Curated SFT datasets for instruction-following and conversational fine-tuning
Behrooz Azarkhalili
ermiaazarkhalili
AI & ML interests
LLMs, VLMs, PEFT, RL for LLMs and VLMs.
Recent Activity
published a model about 1 hour ago
ermiaazarkhalili/granite-4.0-micro-GRPO-NuminaMath-10K published a model about 1 hour ago
ermiaazarkhalili/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-DAPO-Math-17k-Processed-10K published a model about 1 hour ago
ermiaazarkhalili/LFM2.5-1.2B-Instruct-GRPO-NuminaMath-10K