End-to-End Joint ASR and Speaker Role Diarization with Child-Adult Interactions Paper • 2601.17640 • Published 7 days ago • 5
End-to-End Joint ASR and Speaker Role Diarization with Child-Adult Interactions Paper • 2601.17640 • Published 7 days ago • 5
End-to-End Joint ASR and Speaker Role Diarization with Child-Adult Interactions Paper • 2601.17640 • Published 7 days ago • 5
Quantifying Speaker Embedding Phonological Rule Interactions in Accented Speech Synthesis Paper • 2601.14417 • Published 11 days ago • 5
VoxCog: Towards End-to-End Multilingual Cognitive Impairment Classification through Dialectal Knowledge Paper • 2601.07999 • Published 19 days ago • 1
Quantifying Speaker Embedding Phonological Rule Interactions in Accented Speech Synthesis Paper • 2601.14417 • Published 11 days ago • 5
VoxCog: Towards End-to-End Multilingual Cognitive Impairment Classification through Dialectal Knowledge Paper • 2601.07999 • Published 19 days ago • 1
Quantifying Speaker Embedding Phonological Rule Interactions in Accented Speech Synthesis Paper • 2601.14417 • Published 11 days ago • 5
tiantiaf/whisper-large-v3-msp-podcast-emotion-dim Audio Classification • 2B • Updated Aug 10, 2025 • 3.31k • 1
tiantiaf/whisper-large-v3-msp-podcast-emotion Audio Classification • 2B • Updated Aug 10, 2025 • 3.15k • 5
Vox-Profile Collection This collection includes the implementation of models described in the Vox-Profile benchmark. (https://arxiv.org/pdf/2505.14648). • 14 items • Updated Dec 2, 2025 • 2
tiantiaf/voxlect-english-dialect-whisper-small Audio Classification • 90.4M • Updated Aug 10, 2025 • 2 • 2