14 37 49

XuHao Hu

Foreshhh

AI & ML interests

NLP MM

Recent Activity

liked a dataset 3 days ago

agents-last-exam/agents-last-exam-data-archive

upvoted a paper 7 days ago

WeaveBench: A Long-Horizon, Real-World Benchmark for Computer-Use Agents with Hybrid Interfaces

liked a dataset 11 days ago

wanlilll/WeaveBench

View all activity

Organizations

upvoted a paper 7 days ago

WeaveBench: A Long-Horizon, Real-World Benchmark for Computer-Use Agents with Hybrid Interfaces

Paper • 2606.09426 • Published 12 days ago • 101

upvoted a paper 14 days ago

PhoneWorld: Scaling Phone-Use Agent Environments

Paper • 2605.29486 • Published 23 days ago • 11

upvoted a paper 24 days ago

MobileGym: A Verifiable and Highly Parallel Simulation Platform for Mobile GUI Agent Research

Paper • 2605.26114 • Published 26 days ago • 64

upvoted a paper 26 days ago

SkillOpt: Executive Strategy for Self-Evolving Agent Skills

Paper • 2605.23904 • Published 29 days ago • 240

upvoted 2 papers about 1 month ago

WildClawBench: A Benchmark for Real-World, Long-Horizon Agent Evaluation

Paper • 2605.10912 • Published May 11 • 46

ToolCUA: Towards Optimal GUI-Tool Path Orchestration for Computer Use Agents

Paper • 2605.12481 • Published May 12 • 28

upvoted 4 papers about 2 months ago

Nemotron 3 Nano Omni: Efficient and Open Multimodal Intelligence

Paper • 2604.24954 • Published Apr 27 • 26

Synthetic Computers at Scale for Long-Horizon Productivity Simulation

Paper • 2604.28181 • Published Apr 30 • 20

TCOD: Exploring Temporal Curriculum in On-Policy Distillation for Multi-turn Autonomous Agents

Paper • 2604.24005 • Published Apr 27 • 9

Agent-World: Scaling Real-World Environment Synthesis for Evolving General Agent Intelligence

Paper • 2604.18292 • Published Apr 20 • 86

upvoted 4 papers 2 months ago

GameWorld: Towards Standardized and Verifiable Evaluation of Multimodal Game Agents

Paper • 2604.07429 • Published Apr 8 • 122

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published Apr 8 • 327

ClawGUI: A Unified Framework for Training, Evaluating, and Deploying GUI Agents

Paper • 2604.11784 • Published Apr 13 • 143

DARE: Diffusion Large Language Models Alignment and Reinforcement Executor

Paper • 2604.04215 • Published Apr 5 • 22

upvoted a collection 4 months ago

GUI-Owl-1.5

Collection

GUI-Owl-1.5 • 6 items • Updated May 14 • 9

upvoted a paper 4 months ago

ACE-Brain-0: Spatial Intelligence as a Shared Scaffold for Universal Embodiments

Paper • 2603.03198 • Published Mar 3 • 4

upvoted a collection 4 months ago

Qwen3.5

Collection

21 items • Updated Mar 9 • 1.68k

upvoted a paper 4 months ago

Mobile-Agent-v3.5: Multi-platform Fundamental GUI Agents

Paper • 2602.16855 • Published Feb 15 • 51

upvoted 2 papers 5 months ago

Youtu-VL: Unleashing Visual Potential via Unified Vision-Language Supervision

Paper • 2601.19798 • Published Jan 27 • 44

Think3D: Thinking with Space for Spatial Reasoning

Paper • 2601.13029 • Published Jan 19 • 48

XuHao Hu

AI & ML interests

Recent Activity

Organizations

Foreshhh's activity