5 523 1

Literate Goggles

literate-goggles

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Knot Forcing: Taming Autoregressive Video Diffusion Models for Real-time Infinite Interactive Portrait Animation

upvoted a paper 4 days ago

LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation

upvoted a paper 12 days ago

Both Semantics and Reconstruction Matter: Making Representation Encoders Ready for Text-to-Image Generation and Editing

View all activity

Organizations

None yet

upvoted 2 papers 4 days ago

Knot Forcing: Taming Autoregressive Video Diffusion Models for Real-time Infinite Interactive Portrait Animation

Paper • 2512.21734 • Published 9 days ago • 3

LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation

Paper • 2512.23576 • Published 5 days ago • 62

upvoted a paper 12 days ago

Both Semantics and Reconstruction Matter: Making Representation Encoders Ready for Text-to-Image Generation and Editing

Paper • 2512.17909 • Published 15 days ago • 36

upvoted a paper 19 days ago

Scaling Behavior of Discrete Diffusion Language Models

Paper • 2512.10858 • Published 23 days ago • 6

upvoted 2 papers 20 days ago

Ditto: Motion-Space Diffusion for Controllable Realtime Talking Head Synthesis

Paper • 2411.19509 • Published Nov 29, 2024 • 3

Semantics Lead the Way: Harmonizing Semantic and Texture Modeling with Asynchronous Latent Diffusion

Paper • 2512.04926 • Published about 1 month ago • 41

upvoted a paper 25 days ago

TwinFlow: Realizing One-step Generation on Large Models with Self-adversarial Flows

Paper • 2512.05150 • Published Dec 3, 2025 • 74

upvoted a paper 29 days ago

Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length

Paper • 2512.04677 • Published about 1 month ago • 167

upvoted 3 articles about 1 month ago

Article

Continuous batching from first principles

Nov 25, 2025

•

291

Article

Diffusers welcomes FLUX-2

Nov 25, 2025

•

167

Article

SARLO-80: Worldwide Slant SAR Language Optic Dataset at 80 cm Resolution

Dec 1, 2025

•

upvoted 7 papers about 2 months ago

upvoted 2 papers 2 months ago

FARMER: Flow AutoRegressive Transformer over Pixels

Paper • 2510.23588 • Published Oct 27, 2025 • 58

Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Paper • 2106.06103 • Published Jun 11, 2021 • 4

Literate Goggles

AI & ML interests

Recent Activity

Organizations

literate-goggles's activity

Continuous batching from first principles

Diffusers welcomes FLUX-2

SARLO-80: Worldwide Slant SAR Language Optic Dataset at 80 cm Resolution