Liouville's picture

4

Liouville

M-best

AI & ML interests

None yet

Recent Activity

upvoted a paper 29 days ago

ProRL: Effective Reinforcement Learning for Proactive Recommendation via Rectified Policy Gradient Estimation

upvoted a paper about 1 month ago

GoLongRL: Capability-Oriented Long Context Reinforcement Learning with Multitask Alignment

upvoted a paper 6 months ago

QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management

View all activity

Organizations

None yet

upvoted a paper 29 days ago

ProRL: Effective Reinforcement Learning for Proactive Recommendation via Rectified Policy Gradient Estimation

Paper • 2605.28293 • Published about 1 month ago • 88

upvoted a paper about 1 month ago

GoLongRL: Capability-Oriented Long Context Reinforcement Learning with Multitask Alignment

Paper • 2605.19577 • Published May 19 • 59

upvoted a paper 6 months ago

QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management

Paper • 2512.12967 • Published Dec 15, 2025 • 113

upvoted a paper 7 months ago

Entropy Ratio Clipping as a Soft Global Constraint for Stable Reinforcement Learning

Paper • 2512.05591 • Published Dec 5, 2025 • 17