Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

One-click Deployment

Inference Endpoints

Microsoft Foundry

Amazon SageMaker AI

Misc

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

301

Base only

Active filters: VLM

nvidia/NVIDIA-Nemotron-Parse-v1.2

Image-Text-to-Text • 0.9B • Updated May 5 • 231k • 65

omlab/VLX-Seek-1.5-10B

Image-Text-to-Text • 10B • Updated 5 days ago • 56 • 4

lumimate/PhoneUIAnchor-829M

Image-Text-to-Text • 0.8B • Updated 16 days ago • 105 • 12

xlangai/OpenCUA-7B

Image-Text-to-Text • 8B • Updated Feb 1 • 6.93k • 33

xlangai/OpenCUA-32B

Image-Text-to-Text • 33B • Updated Jan 24 • 321 • 29

reece124/OpenCUA-7B-converted

Image-Text-to-Text • 8B • Updated Aug 19, 2025 • 21 • 1

nvidia/NVIDIA-Nemotron-Nano-12B-v2-VL-BF16

Image-Text-to-Text • 13B • Updated Dec 2, 2025 • 177k • 88

mradermacher/Garnet-OCR-3B-0422-i1-GGUF

Image-to-Text • 3B • Updated 21 days ago • 20.9k • 3

HongxinLi/GoClick-Large

Image-Text-to-Text • 0.8B • Updated May 10 • 158 • 3

NemoStation/Marlin-2B

Video-Text-to-Text • 2B • Updated May 30 • 6.47k • 570

prasannaJagadesh/marlin-2B-GPTQ-4BITS

Video-Text-to-Text • 2B • Updated 30 days ago • 249 • 3

Efficient-Large-Model/VILA-13b

Text Generation • 13B • Updated Mar 4, 2024 • 8 • 20

Efficient-Large-Model/VILA-7b

Text Generation • 7B • Updated Mar 4, 2024 • 258 • 27

Efficient-Large-Model/VILA-7b-4bit-awq

Text Generation • Updated Mar 4, 2024 • 5 • 2

Efficient-Large-Model/VILA-13b-4bit-awq

Text Generation • Updated Mar 4, 2024 • 4 • 2

Efficient-Large-Model/VILA-2.7b

Text Generation • 3B • Updated Mar 4, 2024 • 32 • 15

TIGER-Lab/Mantis-bakllava-7b

Image-Text-to-Text • 8B • Updated May 18, 2024 • 16 • 5

TIGER-Lab/Mantis-llava-7b

Image-Text-to-Text • 7B • Updated May 18, 2024 • 18 • 16

Efficient-Large-Model/VILA1.5-3b

Text Generation • Updated Jul 18, 2024 • 24.5k • 35

Efficient-Large-Model/VILA1.5-13b

Text Generation • Updated Jul 18, 2024 • 208 • 5

Efficient-Large-Model/Llama-3-VILA1.5-8B

Text Generation • Updated Aug 16, 2024 • 240 • 37

Efficient-Large-Model/VILA1.5-40b

Text Generation • Updated Jul 18, 2024 • 69 • 17

Efficient-Large-Model/VILA1.5-3b-s2

Text Generation • Updated Jul 18, 2024 • 161 • 2

Efficient-Large-Model/VILA1.5-3b-AWQ

Text Generation • Updated Jul 18, 2024 • 16 • 7

Efficient-Large-Model/VILA1.5-3b-s2-AWQ

Text Generation • Updated Jul 18, 2024 • 4 • 2

Efficient-Large-Model/Llama-3-VILA1.5-8b-AWQ

Text Generation • Updated Jul 18, 2024 • 8 • 7

Efficient-Large-Model/VILA1.5-13b-AWQ

Text Generation • Updated Jul 18, 2024 • 8 • 3

Efficient-Large-Model/VILA1.5-40b-AWQ

Text Generation • Updated Jul 18, 2024 • 5 • 3

RussRobin/SpatialBot-3B-LoRA

Visual Question Answering • Updated Sep 5, 2024 • 4

RussRobin/SpatialBot-3B

Visual Question Answering • 3B • Updated Sep 10, 2024 • 51 • 20