Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
23
29
Yiping Wang
ypwang61
Follow
vectorzhou's profile picture
Howieeeee's profile picture
sanaka87's profile picture
5 followers
·
11 following
https://ypwang61.github.io/
ypwang61
ypwang61
AI & ML interests
machine learning
Recent Activity
liked
a dataset
10 days ago
siegelz/core-bench
upvoted
a
paper
24 days ago
Stabilizing Reinforcement Learning with LLMs: Formulation and Practices
upvoted
a
paper
about 1 month ago
DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research
View all activity
Organizations
None yet
ypwang61
's models
26
Sort: Recently updated
ypwang61/One-Shot-RLVR-Qwen2.5-Math-1.5B-pi1_pi2
2B
•
Updated
Sep 2
•
6
ypwang61/One-Shot-RLVR-Qwen2.5-Math-1.5B-pi1209
2B
•
Updated
Sep 2
•
6
ypwang61/One-Shot-RLVR-Qwen2.5-Math-1.5B-pi2
2B
•
Updated
Sep 2
•
7
ypwang61/One-Shot-RLVR-Llama3.2-3B-Instruct-pi1_pi13
4B
•
Updated
Sep 2
•
7
ypwang61/One-Shot-RLVR-Llama3.2-3B-Instruct-1.2k-dsr-sub
4B
•
Updated
Sep 2
•
6
ypwang61/One-Shot-RLVR-Llama3.2-3B-Instruct-pi1
4B
•
Updated
Sep 2
•
6
ypwang61/One-Shot-RLVR-R1-Distill-1.5B-1.2k-dsr-sub
Text Generation
•
2B
•
Updated
Aug 27
•
14
ypwang61/One-Shot-RLVR-R1-Distill-1.5B-16-shot
Text Generation
•
2B
•
Updated
Aug 27
•
14
ypwang61/One-Shot-RLVR-R1-Distill-1.5B-4-shot
Text Generation
•
2B
•
Updated
Aug 27
•
14
ypwang61/One-Shot-RLVR-R1-Distill-1.5B-pi1
Text Generation
•
2B
•
Updated
Aug 27
•
9
ypwang61/One-Shot-RLVR-Qwen2.5-7B-1.2k-dsr-sub
Text Generation
•
8B
•
Updated
Aug 27
•
7
ypwang61/One-Shot-RLVR-Qwen2.5-7B-pi1
Text Generation
•
8B
•
Updated
Aug 27
•
13
ypwang61/One-Shot-RLVR-Qwen2.5-Math-7B-1.2k-dsr-sub
Text Generation
•
8B
•
Updated
Aug 27
•
8
ypwang61/One-Shot-RLVR-Qwen2.5-Math-1.5B-7.5k-MATH
Text Generation
•
2B
•
Updated
Aug 27
•
291
ypwang61/One-Shot-RLVR-Qwen2.5-Math-1.5B-1.2k-dsr-sub
Text Generation
•
2B
•
Updated
Aug 27
•
10
ypwang61/intermediate-qwen25-7b-step300
8B
•
Updated
Jun 12
•
6
ypwang61/sharp_s180
8B
•
Updated
Jun 3
•
8
ypwang61/sharp_s1560
2B
•
Updated
Jun 3
•
7
ypwang61/One-Shot-RLVR-Qwen2.5-Math-1.5B-pi1_pi13
Text Generation
•
2B
•
Updated
May 19
•
8
ypwang61/One-Shot-RLVR-Qwen2.5-Math-1.5B-pi1
Text Generation
•
2B
•
Updated
May 19
•
73
ypwang61/One-Shot-RLVR-Qwen2.5-Math-1.5B-pi13
Text Generation
•
2B
•
Updated
May 19
•
10
ypwang61/One-Shot-RLVR-Qwen2.5-Math-7B-pi1_pi13
Text Generation
•
8B
•
Updated
May 19
•
16
ypwang61/One-Shot-RLVR-Qwen2.5-Math-7B-pi1
Text Generation
•
8B
•
Updated
May 19
•
11
ypwang61/sharp_warmup
7B
•
Updated
Dec 23, 2024
•
8
•
1
ypwang61/xnnpack_test
Updated
Sep 22, 2024
ypwang61/negCLIPLoss_NormSim
Updated
Jun 22, 2024
•
1