ZIQI ZHANG's picture

3

ZIQI ZHANG

forencegan

·

AI & ML interests

None yet

Organizations

None yet

upvoted a paper 5 months ago

Group-in-Group Policy Optimization for LLM Agent Training

Paper • 2505.10978 • Published May 16, 2025 • 19

upvoted 2 articles 5 months ago

Article

From GRPO to DAPO and GSPO: What, Why, and How

Aug 9, 2025

•

78

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7, 2025

•

272