anzorq/kbd_speech
Viewer • Updated • 20.6k • 152 • 2
How to use anzorq/mms_finetune_kbd_murat with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("text-to-speech", model="anzorq/mms_finetune_kbd_murat") # Load model directly
from transformers import AutoTokenizer, AutoModelForTextToWaveform
tokenizer = AutoTokenizer.from_pretrained("anzorq/mms_finetune_kbd_murat")
model = AutoModelForTextToWaveform.from_pretrained("anzorq/mms_finetune_kbd_murat")This repository contains a fine-tuned version of Facebook's MMS-TTS model, adapted for generating speech in the Kabardian language. The model is trained on a dataset of audio recordings by the speaker Sokhov Murat.
To use this model for text-to-speech generation, you can leverage the pipeline functionality from the Transformers library. Here's an example:
from transformers import pipeline
import scipy
model_id = "anzorq/mms_finetune_kbd_murat"
synthesiser = pipeline("text-to-speech", model_id, device=0) # add device=0 if you want to use a GPU
text = "дауэ ущыт?"
speech = synthesiser(text)
# Save the generated audio to a file
scipy.io.wavfile.write("finetuned_output.wav", rate=speech["sampling_rate"], data=speech["audio"][0])
This code will generate an audio file finetuned_output.wav containing the speech synthesis for the provided Kabardian text.
The original MMS-TTS model by Meta is licensed under the CC-BY-NC-4.0 License. This fine-tuned version inherits the same license.