5 14 186

Jian Hu

chuyi777

https://hujian.website

hijkzzz

AI & ML interests

Reinforcement Learning

Recent Activity

updated a dataset about 15 hours ago

OpenRLHF/aime-2024

updated a dataset about 15 hours ago

OpenRLHF/dapo-math-17k

authored a paper 3 days ago

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

View all activity

Organizations

updated 2 datasets about 15 hours ago

OpenRLHF/aime-2024

Viewer • Updated about 15 hours ago • 30 • 32

OpenRLHF/dapo-math-17k

Viewer • Updated about 15 hours ago • 17.4k • 30

authored a paper 3 days ago

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

Paper • 2601.22975 • Published 7 days ago • 80

published 2 datasets 4 days ago

OpenRLHF/aime-2024

Viewer • Updated about 15 hours ago • 30 • 32

OpenRLHF/dapo-math-17k

Viewer • Updated about 15 hours ago • 17.4k • 30

upvoted a paper 4 days ago

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

Paper • 2601.22975 • Published 7 days ago • 80

upvoted 2 papers 4 months ago

DLER: Doing Length pEnalty Right - Incentivizing More Intelligence per Token via Reinforcement Learning

Paper • 2510.15110 • Published Oct 16, 2025 • 16

BroRL: Scaling Reinforcement Learning via Broadened Exploration

Paper • 2510.01180 • Published Oct 1, 2025 • 19

liked 2 models 5 months ago

moonshotai/Kimi-K2-Instruct-0905

Text Generation • 1T • Updated 8 days ago • 12.3k • • 666

nvidia/NVIDIA-Nemotron-Nano-12B-v2

Text Generation • 12B • Updated Nov 25, 2025 • 92k • • 152

updated a dataset 5 months ago

OpenRLHF/gem_guess_game

Viewer • Updated Aug 30, 2025 • 2.05k • 5 • 1

published a dataset 5 months ago

OpenRLHF/gem_guess_game

Viewer • Updated Aug 30, 2025 • 2.05k • 5 • 1

New activity in nvidia/NVIDIA-Nemotron-Nano-9B-v2 5 months ago

some problem when I asked the model: 你是谁？

🤯 2

#8 opened 6 months ago by

wenzel94

upvoted a paper 6 months ago

Part I: Tricks or Traps? A Deep Dive into RL for LLM Reasoning

Paper • 2508.08221 • Published Aug 11, 2025 • 50

liked 2 models 6 months ago

openai/gpt-oss-20b

Text Generation • 22B • Updated Aug 26, 2025 • 5.98M • • 4.31k

mistralai/Devstral-Small-2505

24B • Updated Aug 18, 2025 • 29.3k • 860

liked a dataset 6 months ago

MegaScience/MegaScience

Viewer • Updated Jul 24, 2025 • 1.25M • 8.73k • 123

updated a model 6 months ago

OpenRLHF/Llama-3-8b-rm-700k

Text Ranking • 8B • Updated Jul 28, 2025 • 375 • 3

liked 2 datasets 7 months ago

newfacade/LeetCodeDataset

Viewer • Updated May 29, 2025 • 2.87k • 1.46k • 57

JustinTX/WildSci

Viewer • Updated 25 days ago • 56.8k • 87 • 11

Jian Hu

AI & ML interests

Recent Activity

Organizations

chuyi777's activity

some problem when I asked the model: 你是谁？