InSight-o3 - a m-Just Collection

m-Just 's Collections

InSight-o3

updated Jan 15

Empowering Multimodal Foundation Models with Generalized Visual Search

m-Just/O3-Bench

Viewer • Updated 29 days ago • 345 • 383 • 16

Note Can your AI agent truly "think with images"? Test it out on O3-Bench!
m-Just/InSight-o3-vS

Image-Text-to-Text • 8B • Updated 27 days ago • 3

Note This is the vSearcher model introduced in our work.
m-Just/VisCoT_VStar_Collage

Viewer • Updated 27 days ago • 15.3k • 6.84k • 2

Note In-loop RL training data for vSearcher.
m-Just/InfoVQA_RegionLocalization

Viewer • Updated 27 days ago • 10.2k • 31 • 1

Note Out-of-loop RL training data for vSearcher.