EgoObjects: A Large-Scale Egocentric Dataset for Fine-Grained Object Understanding Paper • 2309.08816 • Published Sep 15, 2023
Exploring Open-Vocabulary Semantic Segmentation without Human Labels Paper • 2306.00450 • Published Jun 1, 2023
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything Paper • 2312.00863 • Published Dec 1, 2023 • 4
SqueezeSAM: User friendly mobile interactive segmentation Paper • 2312.06736 • Published Dec 11, 2023
Enhanced Training of Query-Based Object Detection via Selective Query Recollection Paper • 2212.07593 • Published Dec 15, 2022
Feature Selective Anchor-Free Module for Single-Shot Object Detection Paper • 1903.00621 • Published Mar 2, 2019
LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding Paper • 2410.17434 • Published Oct 22, 2024 • 27
VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice Paper • 2601.05175 • Published 4 days ago • 29
LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding Paper • 2410.17434 • Published Oct 22, 2024 • 27