Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Xiaojie Jin's picture
5

Xiaojie Jin

xjjin
zhangysk's profile picture
·
https://scholar.google.com/citations?view_op=list_works&hl=en&hl=en&user=OEZ816YAAAAJ
  • XiaojieJin
  • xiaojie-jin-b10513121

AI & ML interests

Multimodal Reasoning & Decision, GenAI, Computer Vision

Recent Activity

upvoted a paper 25 days ago
StereoWorld: Geometry-Aware Monocular-to-Stereo Video Generation
upvoted a paper 9 months ago
IV-Bench: A Benchmark for Image-Grounded Video Perception and Reasoning in Multimodal LLMs
upvoted a paper 10 months ago
VideoWorld: Exploring Knowledge Learning from Unlabeled Videos
View all activity

Organizations

None yet

upvoted a paper 25 days ago

StereoWorld: Geometry-Aware Monocular-to-Stereo Video Generation

Paper • 2512.09363 • Published 27 days ago • 71
upvoted a paper 9 months ago

IV-Bench: A Benchmark for Image-Grounded Video Perception and Reasoning in Multimodal LLMs

Paper • 2504.15415 • Published Apr 21, 2025 • 23
upvoted a paper 10 months ago

VideoWorld: Exploring Knowledge Learning from Unlabeled Videos

Paper • 2501.09781 • Published Jan 16, 2025 • 27
upvoted 2 papers over 1 year ago

Flash-VStream: Memory-Based Real-Time Understanding for Long Video Streams

Paper • 2406.08085 • Published Jun 12, 2024 • 17

COCONut: Modernizing COCO Segmentation

Paper • 2404.08639 • Published Apr 12, 2024 • 30
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs