Stella Li's picture

2 4

Stella Li PRO

stellalisy

·

https://stellalisy.com/

AI & ML interests

None yet

Recent Activity

updated a model 5 days ago

stellalisy/DeepScaleR-qwen3_1.7b_gs_lr1e-5_ep2_step218

published a model 5 days ago

stellalisy/DeepScaleR-qwen3_1.7b_gs_lr1e-5_ep2_step218

updated a dataset 11 days ago

stellalisy/cognitive_foundations

View all activity

Organizations

Collections 2

Papers 10

arxiv:2510.00177

arxiv:2507.13541

arxiv:2506.10947

arxiv:2505.03054

models 31

stellalisy/DeepScaleR-qwen3_1.7b_gs_lr1e-5_ep2_step218

2B • Updated 5 days ago • 11

stellalisy/system_select_dpo-3b-lr1e-5-b0.1

Text Generation • 3B • Updated Aug 6 • 8

stellalisy/system_select_dpo-3b-lr1e-6-b0.1

Text Generation • 3B • Updated Aug 6 • 11

stellalisy/system_select_dpo-3b-lr1e-5-b0.0

Text Generation • 3B • Updated Aug 6 • 7

stellalisy/system_select_dpo-1b-lr1e-6-b0.1

Text Generation • 1B • Updated Aug 6 • 8

stellalisy/system_select_dpo-1b-lr1e-5-b0.1

Text Generation • 1B • Updated Aug 6 • 4

stellalisy/system_select_dpo-1b-lr1e-6-b0.0

Text Generation • 1B • Updated Aug 6 • 6

stellalisy/system_select_dpo-1b-lr1e-5-b0.0

Text Generation • 1B • Updated Aug 6 • 7

stellalisy/rethink_rlvr_reproduce-incorrect-qwen2.5_math_7b-lr5e-7-kl0.00-step150

Text Generation • 8B • Updated Jun 13 • 9

stellalisy/rethink_rlvr_reproduce-incorrect-qwen2.5_math_7b-lr5e-7-kl0.00-step100

Text Generation • 8B • Updated Jun 13 • 9

datasets 23

stellalisy/cognitive_foundations

Preview • Updated 11 days ago • 6

stellalisy/Dolci-RLZero-Math-7B_random

Viewer • Updated Nov 19 • 13.3k • 11

stellalisy/PrefPalette

Viewer • Updated Oct 29 • 2.01M • 6

stellalisy/HorizonPref_natural_0827

Viewer • Updated Oct 15 • 1.75k • 2

stellalisy/DAPO-Math-14k-Processed-RLVR_random

Viewer • Updated Sep 14 • 14.1k • 12

stellalisy/rlvr_orz_math_57k_collected_random

Viewer • Updated Aug 26 • 56.9k • 7

stellalisy/personalized_simpleqa

Preview • Updated Aug 26 • 1

stellalisy/personalized_socialiqa

Preview • Updated Aug 26 • 1

stellalisy/personalized_scienceqa

Preview • Updated Aug 26 • 1

stellalisy/personalized_mmlu

Preview • Updated Aug 26 • 1

View 23 datasets