Inference Providers
Active filters: VLM
Video-Text-to-Text
• 2B • Updated • 20.7k
• 522
Image-Text-to-Text
• 8B • Updated • 121k
• 43
numind/NuMarkdown-8B-Thinking
Image-to-Text
• 8B • Updated • 30.6k
• 474
nvidia/NVIDIA-Nemotron-Parse-v1.2
Image-Text-to-Text
• 0.9B • Updated • 208k
• 39
Efficient-Large-Model/VILA1.5-3b
Text Generation
• Updated • 3.54k
• 35
One-RL-to-See-Them-All/Orsta-7B
Image-Text-to-Text
• 8B • Updated • 23
• 13
One-RL-to-See-Them-All/Orsta-32B-0321
Image-Text-to-Text
• 33B • Updated • 14
• 3
One-RL-to-See-Them-All/Orsta-32B-0326
Image-Text-to-Text
• 33B • Updated • 15
• 9
nvidia/Llama-3.1-Nemotron-Nano-VL-8B-V1
Image-Text-to-Text
• 9B • Updated • 1.18M
• 180
nvidia/Llama-3.1-Nemotron-Nano-VL-8B-V1-mcore
Image-Text-to-Text
• Updated • 3
Image-Text-to-Text
• 8B • Updated • 202
• 9
nvidia/Llama-3.1-Nemotron-Nano-VL-8B-V1-FP4-QAD
Image-Text-to-Text
• 6B • Updated • 837
• 15
nvidia/NVIDIA-Nemotron-Nano-12B-v2-VL-BF16
Image-Text-to-Text
• 13B • Updated • 134k
• 84
nvidia/NVIDIA-Nemotron-Parse-v1.1
Image-Text-to-Text
• 1.0B • Updated • 431k
• 169
Video-Text-to-Text
• 2B • Updated • 6.19k
• 6
Subh775/Perception-moondream2
Image-Text-to-Text
• 2B • Updated • 129
• 1
Efficient-Large-Model/VILA-13b
Text Generation
• 13B • Updated • 21
• 20
Efficient-Large-Model/VILA-7b
Text Generation
• 7B • Updated • 108
• 27
Efficient-Large-Model/VILA-7b-4bit-awq
Text Generation
• Updated • 13
• 2
Efficient-Large-Model/VILA-13b-4bit-awq
Text Generation
• Updated • 15
• 2
Efficient-Large-Model/VILA-2.7b
Text Generation
• 3B • Updated • 81
• 15
TIGER-Lab/Mantis-bakllava-7b
Image-Text-to-Text
• 8B • Updated • 52
• 5
TIGER-Lab/Mantis-llava-7b
Image-Text-to-Text
• 7B • Updated • 36
• 16
Efficient-Large-Model/VILA1.5-13b
Text Generation
• Updated • 78
• 5
Efficient-Large-Model/Llama-3-VILA1.5-8B
Text Generation
• Updated • 440
• 37
Efficient-Large-Model/VILA1.5-40b
Text Generation
• Updated • 20
• 17
Efficient-Large-Model/VILA1.5-3b-s2
Text Generation
• Updated • 24
• 2
Efficient-Large-Model/VILA1.5-3b-AWQ
Text Generation
• Updated • 24
• 7
Efficient-Large-Model/VILA1.5-3b-s2-AWQ
Text Generation
• Updated • 11
• 2
Efficient-Large-Model/Llama-3-VILA1.5-8b-AWQ
Text Generation
• Updated • 15
• 7