1 8 3

Yibo Li

liushiliushi

https://liushiliushi.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

DLLM-Searcher: Adapting Diffusion Large Language Model for Search Agents

upvoted a paper 27 days ago

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

upvoted a paper 27 days ago

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

View all activity

Organizations

None yet

upvoted a paper 1 day ago

DLLM-Searcher: Adapting Diffusion Large Language Model for Search Agents

Paper • 2602.07035 • Published 9 days ago • 27

upvoted 2 papers 27 days ago

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

Paper • 2601.08763 • Published 30 days ago • 147

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

Paper • 2601.09667 • Published 29 days ago • 89

upvoted a paper about 1 month ago

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

Paper • 2511.11793 • Published Nov 14, 2025 • 187

liked 2 models 5 months ago

liushiliushi/ConfTuner-Qwen

8B • Updated Sep 19, 2025 • 3 • 2

liushiliushi/ConfTuner-Ministral

Text Generation • 8B • Updated Sep 20, 2025 • 5 • 3

updated a model 5 months ago

liushiliushi/ConfTuner-Ministral

Text Generation • 8B • Updated Sep 20, 2025 • 5 • 3

New activity in liushiliushi/ConfTuner-Ministral 5 months ago

Improve model card: Add pipeline tag, library, description, and usage instructions

#1 opened 5 months ago by

nielsr

authored 3 papers 5 months ago

updated 2 models 5 months ago

liushiliushi/ConfTuner-LLaMA

8B • Updated Sep 19, 2025 • 226

liushiliushi/ConfTuner-Qwen

8B • Updated Sep 19, 2025 • 3 • 2

upvoted a paper 5 months ago

ConfTuner: Training Large Language Models to Express Their Confidence Verbally

Paper • 2508.18847 • Published Aug 26, 2025 • 2

updated a model 8 months ago

liushiliushi/Qwen2.5-7B-Instruct_gpt

8B • Updated Jun 18, 2025

published 2 models 8 months ago

liushiliushi/ConfTuner-LLaMA

8B • Updated Sep 19, 2025 • 226

liushiliushi/Qwen2.5-7B-Instruct_gpt

8B • Updated Jun 18, 2025

updated 2 models 8 months ago

liushiliushi/Llama-3.1-8B-Instruct_gpt

8B • Updated Jun 18, 2025

liushiliushi/llama-uncertainty

8B • Updated Jun 18, 2025 • 1

published a model 8 months ago

liushiliushi/Llama-3.1-8B-Instruct_gpt

8B • Updated Jun 18, 2025

Yibo Li

AI & ML interests

Recent Activity

Organizations

liushiliushi's activity

Improve model card: Add pipeline tag, library, description, and usage instructions