5:16CVPR 2026 (Oral) - Understanding Task Transfer in Vision-Language ModelsKaran Uppal18 viewsView & Download
4:50[CVPR 2026] Training-free Detection of Generated Videos via Spatial-Temporal LikelihoodsOmer B.H25 viewsView & Download
6:12[CVPR 2026] GenMatter: Perceiving Physical Objects with Generative Matter ModelsEric Li1 viewsView & Download
6:20[CVPR 2026] iSHIFT: Lightweight Slow-Fast GUI Agent with Adaptive PerceptionSarthak Mehrotra14 viewsView & Download
8:02[CVPR 2026] Act2See: Emergent Active Visual Perception for Video ReasoningMartin Ma4 viewsView & Download
4:54[CVPR 2026] Aligning What Vision-Language Models See and Perceive with Adaptive Information Flowcx liu9 viewsView & Download
5:50[CVPR 2026] Scene-Centric Unsupervised Video Panoptic SegmentationVisual Inference64 viewsView & Download
5:00CVPR 2026 Main Paper DEVA: Fine-tuning Multimodal Large Language Models for Visual Perception TasksDebasmit Das39 viewsView & Download