Siteng Huang
huangsiteng
AI & ML interests
vision-language models
Recent Activity
upvoted
a
paper
about 5 hours ago
Watching, Reasoning, and Searching: A Video Deep Research Benchmark on Open Web for Agentic Video Reasoning
authored
a paper
about 1 month ago
Unicorn: Text-Only Data Synthesis for Vision Language Model Training
authored
a paper
about 1 month ago
OpenHelix: A Short Survey, Empirical Analysis, and Open-Source
Dual-System VLA Model for Robotic Manipulation