LeeBoonstra.dev
Getting Audio Data From Text (Text to Speech) and Play It in Your Browser. (Part IV)

Blog: Learn Machine Learning from a Google AI Engineer

Getting Audio Data From Text (Text to Speech) and Play It in Your Browser. (Part IV)

This is the fourth blog in the series:
A best practice for streaming audio from a browser microphone to Dialogflow & Google Cloud Speech To Text.

In case you haven’t read the other blogs, I recommend to browse back to these blogs:

In the next blog of this series, I will take text (or Dialogflow QueryResult text data) that’s currently available on the server-side, pass it to the Text to Speech API (to synthesize the text) and return the audio bytes back to the client app, to play it in the browser. It has to play the audio bytes automatically.

Building Your Own Conversational Voice AI Which Streams Audio From a Browser Microphone to a Server (Part III)

This is the third blog in the series:A best practice for streaming audio from a browser microphon...
Building Your Own Conversational Voice AI Which Streams Audio From a Browser Microphone to a Server (Part II)

This is the second blog in the series:A best practice for streaming audio from a browser micropho...
Building Your Own Conversational Voice AI With Dialogflow & Speech to Text in Web Apps. (Part I)

This is the first blog in the series: A best practice for streaming audio from a browser micropho...
Orchestrate Multiple Sub Chatbots From One Chat Interface by Using the Mega Agent Feature in Dialogflow

Dialogflow has the Mega Agent feature. (At the time of writing, this feature is still in beta but...
Disclaimer: The opinions stated here are my own, not those of my company. - 2025 ® Lee Boonstra