tarteel-ai/everyayah
Viewer β’ Updated β’ 127k β’ 1.49k β’ 34
This model is a fine-tuned version of OpenAIβs Whisper Large V3 model, adapted specifically for Arabic Quranic speech recognition using the Tarteel AI Everyayah Dataset. It is optimized to transcribe Quranic recitations with improved accuracy on this specialized dataset.
use_cache=True being incompatible with gradient checkpointing, which was automatically handled by disabling use_cache. from transformers import WhisperProcessor, WhisperForConditionalGeneration
import torch
import librosa
model_name = "ijyad/whisper-large-v3-Tarteel"
processor = WhisperProcessor.from_pretrained(model_name)
model = WhisperForConditionalGeneration.from_pretrained(model_name)
# Load audio (replace with your audio file)
audio, rate = librosa.load("path_to_quran_audio.wav", sr=16000)
input_features = processor(audio, sampling_rate=rate, return_tensors="pt").input_features
# Generate transcription
predicted_ids = model.generate(input_features)
transcription = processor.batch_decode(predicted_ids, skip_special_tokens=True)[0]
print(transcription)
Base model
openai/whisper-large-v3