Lin Nianyi's picture

4

Lin Nianyi

linny2002

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards

upvoted a paper 3 months ago

Boundary-Guided Policy Optimization for Memory-efficient RL of Diffusion Large Language Models

updated a model 3 months ago

THU-KEG/LLaDA-8B-BGPO-sudoku

View all activity

Organizations

upvoted a paper 4 days ago

Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards

Paper • 2601.06021 • Published 6 days ago • 38

upvoted a paper 3 months ago

Boundary-Guided Policy Optimization for Memory-efficient RL of Diffusion Large Language Models

Paper • 2510.11683 • Published Oct 13, 2025 • 14

updated 4 models 3 months ago

THU-KEG/LLaDA-8B-BGPO-sudoku

Reinforcement Learning • 8B • Updated Oct 14, 2025 • 4 • 1

THU-KEG/LLaDA-8B-BGPO-countdown

Reinforcement Learning • 8B • Updated Oct 14, 2025 • 2 • 1

THU-KEG/LLaDA-8B-BGPO-code

Reinforcement Learning • 8B • Updated Oct 14, 2025 • 3 • 1

THU-KEG/LLaDA-8B-BGPO-math

Reinforcement Learning • 8B • Updated Oct 14, 2025 • 3 • 1

upvoted a collection 3 months ago

LLaDA-8B-BGPO

Boundary-Guided Policy Optimization for Memory-Efficient RL of Diffusion Large Language Models • 4 items • Updated Oct 11, 2025 • 4

authored a paper 8 months ago

AdaptThink: Reasoning Models Can Learn When to Think

Paper • 2505.13417 • Published May 19, 2025 • 83

upvoted a paper 8 months ago

AdaptThink: Reasoning Models Can Learn When to Think

Paper • 2505.13417 • Published May 19, 2025 • 83