1 7 14

JPShi

SJP2022

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

FlashMotion: Few-Step Controllable Video Generation with Trajectory Guidance

upvoted a paper 1 day ago

Can Vision-Language Models Solve the Shell Game?

upvoted a paper 2 days ago

WeEdit: A Dataset, Benchmark and Glyph-Guided Framework for Text-centric Image Editing

View all activity

Organizations

None yet

upvoted 2 papers 1 day ago

FlashMotion: Few-Step Controllable Video Generation with Trajectory Guidance

Paper • 2603.12146 • Published 6 days ago • 4

Can Vision-Language Models Solve the Shell Game?

Paper • 2603.08436 • Published 9 days ago • 36

upvoted a paper 2 days ago

WeEdit: A Dataset, Benchmark and Glyph-Guided Framework for Text-centric Image Editing

Paper • 2603.11593 • Published 6 days ago • 24

upvoted a paper 6 days ago

CaTok: Taming Mean Flows for One-Dimensional Causal Image Tokenization

Paper • 2603.06449 • Published 12 days ago • 6

liked a dataset 22 days ago

mxxxxxxxxxxxxxxxxx/ChronusAV

Updated 29 days ago • 198 • 1

liked a dataset about 1 month ago

CinematicT2vData/raw_videos_batched

Viewer • Updated Aug 17, 2025 • 9.26k • 42 • 1

submitted a paper to Daily Papers 2 months ago

VideoLoom: A Video Large Language Model for Joint Spatial-Temporal Understanding

Paper • 2601.07290 • Published Jan 12 • 7

authored a paper 2 months ago

VideoLoom: A Video Large Language Model for Joint Spatial-Temporal Understanding

Paper • 2601.07290 • Published Jan 12 • 7

updated a model 2 months ago

JPShi/VideoLoom-8B

8B • Updated Jan 13 • 41

upvoted a paper 2 months ago

VideoLoom: A Video Large Language Model for Joint Spatial-Temporal Understanding

Paper • 2601.07290 • Published Jan 12 • 7

published 2 models 2 months ago

JPShi/VideoLoom-8B

8B • Updated Jan 13 • 41

JPShi/VideoLoom-4B

4B • Updated Jan 13 • 1

updated a collection 2 months ago

VideoLoom

Collection

Model Zoo for VideoLoom: A Video Large Language Model for Joint Spatial-Temporal Understanding • 3 items • Updated Jan 13 • 1

updated a model 2 months ago

JPShi/VideoLoom-4B

4B • Updated Jan 13 • 1

updated a collection 2 months ago

VideoLoom

Collection

Model Zoo for VideoLoom: A Video Large Language Model for Joint Spatial-Temporal Understanding • 3 items • Updated Jan 13 • 1

upvoted a collection 2 months ago

VideoLoom

Collection

Model Zoo for VideoLoom: A Video Large Language Model for Joint Spatial-Temporal Understanding • 3 items • Updated Jan 13 • 1

liked a dataset 5 months ago

tomg-group-umd/cinepile

Viewer • Updated Oct 23, 2024 • 608k • 326 • 91

liked a dataset 6 months ago

Chat-UniVi/Chat-UniVi-Instruct

Preview • Updated May 27, 2024 • 345 • 8

updated a dataset 6 months ago

JPShi/Pascal

Viewer • Updated Sep 11, 2025 • 1 • 14

JPShi

AI & ML interests

Recent Activity

Organizations

JPShi's activity