xiaofang ru
Youhatang
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
25 days ago
AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand
Audio-Visual Information?
upvoted
a
paper
25 days ago
CaptionQA: Is Your Caption as Useful as the Image Itself?
upvoted
a
paper
over 1 year ago
Law of Vision Representation in MLLMs
Organizations
None yet