Name: Hands-on Sencha Touch 2
Author: Lee Boonstra
ISBN: 9781449366520

Chatbots Jan 4th 2021

Getting Audio Data From Text (Text to Speech) and Play It in Your Browser. (Part IV)

This is the fourth blog in the series:
A best practice for streaming audio from a browser microphone to Dialogflow & Google Cloud Speech To Text.

In case you haven’t read the other blogs, I recommend to browse back to these blogs:

Blog 1: Introduction to the GCP conversational AI components, and integrating your own voice AI in a web app.
Blog 2: Building a client-side web application which streams audio from a browser microphone to a server.
Blog 3: Building a web server which receives a browser microphone stream and uses Dialogflow or the Speech to Text API for retrieving text results.

In the next blog of this series, I will take text (or Dialogflow QueryResult text data) that’s currently available on the server-side, pass it to the Text to Speech API (to synthesize the text) and return the audio bytes back to the client app, to play it in the browser. It has to play the audio bytes automatically.

Lee Boonstra

Lee Boonstra (they/them) has been a presence in the tech world since 2007, wearing many hats from AI software engineer to prompt engineer, web developer to technical trainer, and developer advocate.

With eight years of experience at Google (and AI) under their belt, they now hold the role of AI Software Engineer (SWE) at the Google Cloud office of the CTO. Leading innovation projects, Lee aims to disrupt markets and foster collaboration globally. Their expertise in Conversational and Voice technology, alongside (Generative) AI, has led to recognition as a respected public keynote speaker and published author for O’Reilly and Apress. Lee eases tech headaches and celebrates those light bulb moments.

Lee has written the Google/Kaggle Prompt Engineering whitepaper and two books: "Hands-on Sencha Touch 2" (O'Reilly) and their more recent work, "The Definitive Guide to Conversational AI with Dialogflow and Google Cloud" (Apress).