Automatic Speech Recognition
NeMo
PyTorch
speech-recognition
cache-aware ASR
streaming-asr
multilingual
speech
audio
FastConformer
RNNT
Parakeet
ASR
NeMo
Eval Results (legacy)
Instructions to use nvidia/nemotron-3.5-asr-streaming-0.6b with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- NeMo
How to use nvidia/nemotron-3.5-asr-streaming-0.6b with NeMo:
import nemo.collections.asr as nemo_asr asr_model = nemo_asr.models.ASRModel.from_pretrained("nvidia/nemotron-3.5-asr-streaming-0.6b") transcriptions = asr_model.transcribe(["file.wav"]) - Notebooks
- Google Colab
- Kaggle
Speaker detection / diarisation?
#5
by smcleod - opened
Does this model have, or integrate well with any form of speaker detection?
Yes : https://huggingface.co/nvidia/diar_streaming_sortformer_4spk-v2
And an example with the same RNNT architecture: https://huggingface.co/nvidia/multitalker-parakeet-streaming-0.6b-v1
smcleod changed discussion status to closed