Ray121381/eveo_anchor_advantage_independent-qwen2.5-7b-sciworld-self-sum-self-gen-maxlen-2048 Updated 11 days ago
Ray121381/eveo_anchor_advantage_independent-qwen2.5-7b-sciworld-self-sum-self-gen-maxlen-2048 Updated 11 days ago
Guided Self-Evolving LLMs with Minimal Human Supervision Paper • 2512.02472 • Published Dec 2, 2025 • 50