arxiv:2602.10693
floyed shen
floyed
AI & ML interests
None yet
Recent Activity
submitted a paper 25 days ago
Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information upvoted a paper about 1 month ago
From Generic Correlation to Input-Specific Credit in On-Policy Self Distillation