Automatic Speech Recognition
NeMo
PyTorch
English
speech
audio
Transducer
TDT
FastConformer
Conformer
NeMo
hf-asr-leaderboard
Eval Results (legacy)
Eval Results
Instructions to use nvidia/parakeet-tdt-1.1b with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- NeMo
How to use nvidia/parakeet-tdt-1.1b with NeMo:
import nemo.collections.asr as nemo_asr asr_model = nemo_asr.models.ASRModel.from_pretrained("nvidia/parakeet-tdt-1.1b") transcriptions = asr_model.transcribe(["file.wav"]) - Notebooks
- Google Colab
- Kaggle
Word timing
#2
by Amin24 - opened
How do I get word timing?
Currently we don;t support word timing with tdt models, we are working on it. However for word stamps you could use NeMo NFA.
smajumdar94 changed discussion status to closed