kaneziki

xuanxianyou

AI & ML interests

None yet

Recent Activity

liked a model 16 days ago

openbmb/AgentCPM-Report

liked a Space 2 months ago

nanotron/ultrascale-playbook

liked a Space 2 months ago

HuggingFaceTB/smol-training-playbook

View all activity

Organizations

None yet

liked a model 16 days ago

openbmb/AgentCPM-Report

8B • Updated 11 days ago • 1.25k • 291

liked 2 Spaces 2 months ago

The Ultra-Scale Playbook

🌌

3.67k

The ultimate guide to training LLM on large GPU Clusters

The Smol Training Playbook

📚

2.95k

The secrets to building world-class LLMs

liked 2 datasets 3 months ago

HuggingFaceFW/fineweb

Viewer • Updated Jul 11, 2025 • 52.5B • 204k • 2.64k

HuggingFaceFW/finepdfs

Viewer • Updated 28 days ago • 476M • 37.7k • 815

liked 2 models 5 months ago

openbmb/VoxCPM-0.5B

Text-to-Speech • Updated Sep 19, 2025 • 1.07k • 787

openbmb/MiniCPM4.1-8B

Text Generation • 8B • Updated Oct 24, 2025 • 15.9k • 383

liked a model 6 months ago

openbmb/MiniCPM-V-4

Image-Text-to-Text • 4B • Updated Sep 15, 2025 • 58.3k • 462

liked a model 11 months ago

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27, 2025 • 449k • • 13k

upvoted a paper 12 months ago

LongDPO: Unlock Better Long-form Generation Abilities for LLMs via Critique-augmented Stepwise Information

Paper • 2502.02095 • Published Feb 4, 2025 • 4

upvoted a paper about 1 year ago

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published Feb 3, 2025 • 61

liked a model about 1 year ago

openbmb/MiniCPM3-4B

Text Generation • Updated Feb 27, 2025 • 10.9k • 417

upvoted a paper about 1 year ago

Free Process Rewards without Process Labels

Paper • 2412.01981 • Published Dec 2, 2024 • 34

upvoted a paper over 1 year ago

LLMtimesMapReduce: Simplified Long-Sequence Processing using Large Language Models

Paper • 2410.09342 • Published Oct 12, 2024 • 39

liked a model over 1 year ago

openbmb/MiniCPM-Llama3-V-2_5

Image-Text-to-Text • 9B • Updated Jan 15, 2025 • 51.1k • 1.41k

liked 2 models almost 2 years ago

berkeley-nest/Starling-LM-7B-alpha

Text Generation • 7B • Updated Mar 20, 2024 • 1.21k • 557

R0k1e/UltraLink-LM

Text Generation • 13B • Updated Feb 22, 2024 • 12 • 5