Submitted by
Yifan Zhang
AI & ML interests
reasoning
Recent Activity
Papers
Residual Stream Duality in Modern Transformer Architectures
FlashSampling: Fast and Memory-Efficient Exact Sampling
reasoning
Residual Stream Duality in Modern Transformer Architectures
FlashSampling: Fast and Memory-Efficient Exact Sampling