Inference Providers
Active filters: dapo
yannabadie/sage-topology-policy-v2
Reinforcement Learning
• Updated • 1
Text Generation
• Updated • 4
• 3
mradermacher/DAPO-Coding-Qwen2.5-1.5B-Instruct-GGUF
2B • Updated • 179
• 2
Text Generation
• 2B • Updated • 11
• • 1
Text Generation
• 2B • Updated • 5
• Text Generation
• 8B • Updated • 2
• Text Generation
• 8B • Updated • 4
• Text Generation
• 8B • Updated • 6
• Text Generation
• 8B • Updated • 2
• • 1
mradermacher/DAPO-No-DS-GGUF
2B • Updated • 62
mradermacher/MMR-DAPO-8B-GGUF
8B • Updated • 64
• 1
mradermacher/MMR-DAPO-7B-GGUF
8B • Updated • 62
mradermacher/DAPO-No-DS-8B-GGUF
8B • Updated • 51
mradermacher/DAPO-No-DS-7B-GGUF
8B • Updated • 62
Text Generation
• 2B • Updated • 11
• Text Generation
• 8B • Updated • 3
• Text Generation
• 8B • Updated • 6
• • 1
mradermacher/DAPO-7B-GGUF
8B • Updated • 82
• 1
mradermacher/DAPO-8B-GGUF
8B • Updated • 54
srallabandi0225/inframind-0.5b-grpo
Text Generation
• 0.5B • Updated • 4
• • 7
srallabandi0225/inframind-0.5b-dapo
Text Generation
• 0.5B • Updated • 4
AmirhoseinGH/Gnosis-Qwen3-1.7B-Hybrid
Text Classification
• 2B • Updated • 5
AmirhoseinGH/Gnosis-Qwen3-4B-Instruct-2507
Text Classification
• 4B • Updated • 24
AmirhoseinGH/Gnosis-Qwen3-4B-Thinking-2507
Text Classification
• 4B • Updated • 3
AmirhoseinGH/Gnosis-Qwen3-8B
Text Classification
• 8B • Updated • 3
mradermacher/inframind-0.5b-dapo-GGUF
Reinforcement Learning
• 0.5B • Updated • 57
mradermacher/inframind-0.5b-grpo-GGUF
Reinforcement Learning
• 0.5B • Updated • 39
kangdawei/MMR-Sigmoid-DAPO
Text Generation
• 2B • Updated • 3
• kangdawei/MMR-Sigmoid-DAPO-7B
Text Generation
• 8B • Updated • 10
• kangdawei/MMR-Sigmoid-DAPO-8B
Text Generation
• 8B • Updated • 174
•