Zhi Zheng's picture

Zhi Zheng

zz1358m

·

https://zz1358m.github.io/zhizheng.github.io/

AI & ML interests

LLM reasoning, Trustworthy LLM, LLM application, Neural combinatorial optimization.

Recent Activity

upvoted a paper 12 days ago

Rethinking Muon Beyond Pretraining: Spectral Failures and High-Pass Remedies for VLA and RLVR

upvoted a paper 13 days ago

Memory is Reconstructed, Not Retrieved: Graph Memory for LLM Agents

upvoted a paper 15 days ago

Inference-Time Attribute Distribution Alignment for Unconditional Diffusion

View all activity

Organizations

commented a paper 8 months ago

SofT-GRPO: Surpassing Discrete-Token LLM Reinforcement Learning via Gumbel-Reparameterized Soft-Thinking Policy Optimization

Paper • 2511.06411 • Published Nov 9, 2025 • 18 •