bosonai/higgs-audio-v3-8b-stt-v2 Automatic Speech Recognition • 9B • Updated 30 days ago • 1.84k • 13
mistralai/Voxtral-Mini-4B-Realtime-2602 Automatic Speech Recognition • 4B • Updated Mar 11 • 1.57M • 886
Running on Zero Agents Featured 2.88k F5-TTS 🗣 2.88k F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
AudioSAE: Towards Understanding of Audio-Processing Models with Sparse AutoEncoders Paper • 2602.05027 • Published Feb 4 • 63
Green-VLA: Staged Vision-Language-Action Model for Generalist Robots Paper • 2602.00919 • Published Jan 31 • 325