VLMs - a rivasmig Collection

rivasmig 's Collections

VLMs

updated Apr 19, 2025

Task Vectors are Cross-Modal

Paper • 2410.22330 • Published Oct 29, 2024 • 11
AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding

Paper • 2502.01341 • Published Feb 3, 2025 • 39
DASH: Detection and Assessment of Systematic Hallucinations of VLMs

Paper • 2503.23573 • Published Mar 30, 2025 • 12
Kimi-VL Technical Report

Paper • 2504.07491 • Published Apr 10, 2025 • 142
SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7, 2025 • 208
Video-R1: Reinforcing Video Reasoning in MLLMs

Paper • 2503.21776 • Published Mar 27, 2025 • 79