Zhongrui Wang

zhongruiwang

AI & ML interests

None yet

Recent Activity

upvoted a paper 7 days ago

Self-Evaluation Unlocks Any-Step Text-to-Image Generation

upvoted a paper 13 days ago

Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models

upvoted a paper about 1 month ago

MG-Nav: Dual-Scale Visual Navigation via Sparse Spatial Memory

View all activity

Organizations

upvoted a paper 7 days ago

Self-Evaluation Unlocks Any-Step Text-to-Image Generation

Paper • 2512.22374 • Published 12 days ago • 16

upvoted a paper 13 days ago

Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models

Paper • 2512.20557 • Published 15 days ago • 49

upvoted a paper about 1 month ago

MG-Nav: Dual-Scale Visual Navigation via Sparse Spatial Memory

Paper • 2511.22609 • Published Nov 27, 2025 • 48

upvoted a paper about 2 months ago

DeepEyesV2: Toward Agentic Multimodal Model

Paper • 2511.05271 • Published Nov 7, 2025 • 42

upvoted a paper 4 months ago

Hunyuan3D Studio: End-to-End AI Pipeline for Game-Ready 3D Asset Generation

Paper • 2509.12815 • Published Sep 16, 2025 • 40

upvoted 3 papers 6 months ago

Vision Foundation Models as Effective Visual Tokenizers for Autoregressive Image Generation

Paper • 2507.08441 • Published Jul 11, 2025 • 61

SeqTex: Generate Mesh Textures in Video Sequence

Paper • 2507.04285 • Published Jul 6, 2025 • 9

Scaling RL to Long Videos

Paper • 2507.07966 • Published Jul 10, 2025 • 159

upvoted a paper 10 months ago

UniTok: A Unified Tokenizer for Visual Generation and Understanding

Paper • 2502.20321 • Published Feb 27, 2025 • 30

upvoted a paper about 1 year ago

TEXGen: a Generative Diffusion Model for Mesh Textures

Paper • 2411.14740 • Published Nov 22, 2024 • 17

upvoted 2 papers over 1 year ago

Block Transformer: Global-to-Local Language Modeling for Fast Inference

Paper • 2406.02657 • Published Jun 4, 2024 • 41

How Good Are Low-bit Quantized LLaMA3 Models? An Empirical Study

Paper • 2404.14047 • Published Apr 22, 2024 • 45

upvoted 2 papers about 2 years ago

Text-to-3D with classifier score distillation

Paper • 2310.19415 • Published Oct 30, 2023 • 5

Set-of-Mark Prompting Unleashes Extraordinary Visual Grounding in GPT-4V

Paper • 2310.11441 • Published Oct 17, 2023 • 29

upvoted 2 papers over 2 years ago

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Paper • 2310.00426 • Published Sep 30, 2023 • 61

Conditional Diffusion Distillation

Paper • 2310.01407 • Published Oct 2, 2023 • 20

Zhongrui Wang

AI & ML interests

Recent Activity

Organizations

zhongruiwang's activity