Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Antigonish 's Collections
FINANCE
SPEECH TO TEXT
AGENTS
CHARACTER AI
RESEARCH ARXIV
TTS
PERSONALIZATION
VISION
GPT-OSS
DOCUMENT WRITER
PLAYGROUND
SPREADSHEET
LORAS
EMBEDDING
LAW
SEARCH
LEADERBOARD
HEALTH
VIDEO
WRITE
HARDWARE, VRAM
MODELS
SONGS
TRAINING
IMAGE EXPLANATION
IMAGES
OCR
SPACES

SPEECH TO TEXT

updated Jan 25
Upvote
-

  • Running
    Featured
    258

    Qwen3 ASR Demo

    👀
    258

    Transcribe audio files to text with language detection


  • Running on Zero
    Featured
    2.74k

    Whisper

    📉
    2.74k

    Transcribe audio files and YouTube videos into text


  • openai/whisper-large-v3

    Automatic Speech Recognition • Updated Aug 12, 2024 • 5.32M • • 5.48k

  • Running
    60

    Qwen3 Omni Captioner Demo

    🐠
    60

    Generate captions from audio


  • Qwen/Qwen3-Omni-30B-A3B-Captioner

    Any-to-Any • 32B • Updated Sep 22, 2025 • 5.51k • 207

  • nvidia/parakeet-tdt-0.6b-v3

    Automatic Speech Recognition • Updated Nov 27, 2025 • 204k • 717

  • LiquidAI/LFM2-Audio-1.5B

    Audio-to-Audio • Updated Jan 23 • 169 • 345

  • Running
    Featured
    1.24k

    Whisper Web

    🎤
    1.24k

    Transcribe spoken audio into written text


  • microsoft/VibeVoice-ASR

    Automatic Speech Recognition • 9B • Updated Jan 27 • 614k • 915
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs