An open-source audio understanding model supporting speech recognition, environmental sound analysis, music understanding, time-aware QA, and complex
-
MOSS Audio 8B Thinking
🐢26Generate answers to audio or video prompts
-
OpenMOSS-Team/MOSS-Audio-4B-Instruct
Audio-Text-to-Text • 5B • Updated • 3.3k • 72 -
OpenMOSS-Team/MOSS-Audio-4B-Thinking
Audio-Text-to-Text • 5B • Updated • 669 • 31 -
OpenMOSS-Team/MOSS-Audio-8B-Instruct
Audio-Text-to-Text • 9B • Updated • 1.7k • 43