HiPO: Hybrid Policy Optimization for Dynamic Reasoning in LLMs Paper • 2509.23967 • Published Sep 28, 2025 • 2