3 12 10

Shawn Nie

shawn2333

AI & ML interests

None yet

Recent Activity

liked a model about 2 months ago

allenai/Olmo-3-7B-Instruct

upvoted a collection about 2 months ago

Olmo 3

liked a model 3 months ago

microsoft/UserLM-8b

View all activity

Organizations

None yet

upvoted a collection about 2 months ago

Olmo 3

Collection

Artifacts for the Olmo 3 release. • 9 items • Updated 13 days ago • 157

upvoted 2 papers 3 months ago

Flipping the Dialogue: Training and Evaluating User Language Models

Paper • 2510.06552 • Published Oct 8, 2025 • 1

REASONING GYM: Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Paper • 2505.24760 • Published May 30, 2025 • 74

upvoted 3 papers 4 months ago

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4, 2025 • 195

Reasoning Vectors: Transferring Chain-of-Thought Capabilities via Task Arithmetic

Paper • 2509.01363 • Published Sep 1, 2025 • 58

Implicit Actor Critic Coupling via a Supervised Learning Framework for RLVR

Paper • 2509.02522 • Published Sep 2, 2025 • 25

upvoted a collection 6 months ago

SmolLM3 pretraining datasets

Collection

datasets used in SmolLM3 pretraining • 15 items • Updated Aug 12, 2025 • 42

upvoted an article 6 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

Jul 8, 2025

•

743

upvoted 3 papers 8 months ago

Expanding RL with Verifiable Rewards Across Diverse Domains

Paper • 2503.23829 • Published Mar 31, 2025 • 23

Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning

Paper • 2505.13866 • Published May 20, 2025 • 17

General-Reasoner: Advancing LLM Reasoning Across All Domains

Paper • 2505.14652 • Published May 20, 2025 • 24

upvoted a paper 9 months ago

Tina: Tiny Reasoning Models via LoRA

Paper • 2504.15777 • Published Apr 22, 2025 • 56

Shawn Nie

AI & ML interests

Recent Activity

Organizations

shawn2333's activity

SmolLM3: smol, multilingual, long-context reasoner