microsoft
/

VibeVoice-Realtime-0.5B

vibevoice_streaming

Streaming text input

Long-form speech generation

Model card Files Files and versions

Resources

View closed (7)

Working on DGX Spark (ARM64 + CUDA 13) - Setup Notes

#23 opened 23 minutes ago by

great model

#22 opened 13 days ago by

For those who need a simplified execution on NVIDIA GPU

#21 opened 14 days ago by

How can we access the acoustic encoder and semantics encoder?

#20 opened 18 days ago by

the stream input works great

#18 opened 23 days ago by

Local implementation (tested on macos m4)

#17 opened 23 days ago by

Terrible Quality!

#16 opened 25 days ago by

How can we get the position of text in the generated audio?

#12 opened 27 days ago by

finetune guide

#10 opened 28 days ago by

Music played at the start of the 0.5model

#9 opened 29 days ago by

Tried to use this to generate chinese, sounds very foreign...

#8 opened 29 days ago by

Safety or the joke there in

#7 opened 30 days ago by

Tom-Neverwinter

no example code ?

#6 opened about 1 month ago by

Update README.md

#5 opened about 1 month ago by

English only?

#2 opened about 1 month ago by

PeacePeacepPeace