Libero
updated
RLinf/RLinf-OpenSora-LIBERO-Spatial
RLinf/RLinf-OpenVLAOFT-LIBERO-90-Base-Lora
Reinforcement Learning
• 8B • Updated • 245
RLinf/RLinf-OpenVLAOFT-GRPO-LIBERO-object
Reinforcement Learning
• 8B • Updated • 5
RLinf/RLinf-OpenVLAOFT-GRPO-LIBERO-goal
Reinforcement Learning
• 8B • Updated • 3
RLinf/RLinf-OpenVLAOFT-GRPO-LIBERO-90
Reinforcement Learning
• 8B • Updated • 2
RLinf/RLinf-OpenVLAOFT-LIBERO-130-Base-Lora
Reinforcement Learning
• 8B • Updated • 206
RLinf/RLinf-OpenVLAOFT-GRPO-LIBERO-spatial
Reinforcement Learning
• 8B • Updated • 17
RLinf/RLinf-OpenVLAOFT-GRPO-LIBERO-long
Reinforcement Learning
• 8B • Updated • 69
RLinf/RLinf-OpenVLAOFT-LIBERO-130
Reinforcement Learning
• 8B • Updated • 739
• 3
RLinf/RLinf-Gr00t-RL-Spatial-Step400
RLinf/RLinf-Gr00t-RL-Goal-Step500
Updated
RLinf/RLinf-Gr00t-RL-Long-Step300
RLinf/RLinf-Gr00t-RL-Object-Step400
Updated
RLinf/RLinf-Gr00t-SFT-Object
3B • Updated • 4
RLinf/RLinf-Gr00t-SFT-Goal
3B • Updated • 5
RLinf/RLinf-Gr00t-SFT-Long
3B • Updated • 5
RLinf/RLinf-Gr00t-SFT-Spatial
3B • Updated • 31
• 1
RLinf/RLinf-Pi05-PPO-LIBERO-130
Updated
RLinf/RLinf-Pi05-LIBERO-130-fullshot-SFT
RLinf/RLinf-Pi0-LIBERO-130-fullshot-SFT
RLinf/RLinf-Pi05-LIBERO-SFT
Robotics
• 4B • Updated • 2
RLinf/RLinf-Pi0-LIBERO-Spatial-Object-Goal-SFT
4B • Updated
RLinf/RLinf-Pi0-LIBERO-Long-SFT
4B • Updated
π_RL: Online RL Fine-tuning for Flow-based
Vision-Language-Action Models
Paper
• 2510.25889
• Published • 66