Knot Forcing: Taming Autoregressive Video Diffusion Models for Real-time Infinite Interactive Portrait Animation Paper • 2512.21734 • Published 9 days ago • 3
LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation Paper • 2512.23576 • Published 5 days ago • 62
Both Semantics and Reconstruction Matter: Making Representation Encoders Ready for Text-to-Image Generation and Editing Paper • 2512.17909 • Published 15 days ago • 36
Scaling Behavior of Discrete Diffusion Language Models Paper • 2512.10858 • Published 23 days ago • 6
Ditto: Motion-Space Diffusion for Controllable Realtime Talking Head Synthesis Paper • 2411.19509 • Published Nov 29, 2024 • 3
Semantics Lead the Way: Harmonizing Semantic and Texture Modeling with Asynchronous Latent Diffusion Paper • 2512.04926 • Published about 1 month ago • 41
TwinFlow: Realizing One-step Generation on Large Models with Self-adversarial Flows Paper • 2512.05150 • Published Dec 3, 2025 • 74
Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length Paper • 2512.04677 • Published about 1 month ago • 167
view article Article SARLO-80: Worldwide Slant SAR Language Optic Dataset at 80 cm Resolution Dec 1, 2025 • 3
Evaluating In Silico Creativity: An Expert Review of AI Chess Compositions Paper • 2510.23772 • Published Oct 27, 2025 • 2
Prompt-to-Prompt Image Editing with Cross Attention Control Paper • 2208.01626 • Published Aug 2, 2022 • 3
UltraEdit: Instruction-based Fine-Grained Image Editing at Scale Paper • 2407.05282 • Published Jul 7, 2024 • 15
Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech Paper • 2106.06103 • Published Jun 11, 2021 • 4