VibeVoice
PulseAugur coverage of VibeVoice — every cluster mentioning VibeVoice across labs, papers, and developer communities, ranked by signal.
-
Google's Gemma 4 adds MTP for faster local inference, VibeVoice ported to C++, Ollama gets desktop layer
Google has released Gemma 4 with Multi-Token Prediction (MTP), a feature that allows the model to predict multiple tokens simultaneously, significantly speeding up local inference. Additionally, a C++ port of Microsoft'…
-
Microsoft open-sources VibeVoice for long-form speech AI
Microsoft has open-sourced VibeVoice, a suite of advanced voice AI models. The VibeVoice family includes both Text-to-Speech (TTS) and Automatic Speech Recognition (ASR) capabilities. A key innovation is the use of cont…
-
Microsoft releases VibeVoice, an open-source speech-to-text AI model
Microsoft has released VibeVoice, an open-source speech-to-text model with built-in speaker diarization. The MIT-licensed model is available for local deployment, meaning audio data does not need to be sent to an API. O…