WhissleAI/indicvoices_pa_tagged_transcripts
Viewer • Updated • 5.76k • 213
How to use WhissleAI/stt_hi_conformer_ctc_large_with_meta with NeMo:
# tag did not correspond to a valid NeMo domain.
This speech tagger performs transcription for 5 Indian Languages: Hindi, Punjabi, Marathi, Bengali and Gujarati. It annotates key entities, predicts speaker age, dialect and intent.
import nemo.collections.asr as nemo_asr
# Load model
asr_model = nemo_asr.models.EncDecCTCModel.from_pretrained('WhissleAI/speech-tagger_indo-aryan_ctc_meta')
# Transcribe audio
transcription = asr_model.transcribe(['path/to/audio.wav'])
print(transcription[0])
Please cite AI4Bharat when using this model: https://indicvoices.ai4bharat.org/
Base model
parthiv11/stt_hi_conformer_ctc_large_v2