LeeBoonstra.dev
Building Your Own Conversational Voice AI Which Streams Audio From a Browser Microphone to a Server (Part II)

Building Your Own Conversational Voice AI ...

This is the second blog in the series:
A best practice for streaming audio from a browser microphone to Dialogflow & Google Cloud Speech To Text.

In this first blog, I have introduced all the conversational components, and I addressed why customers would integrate their own conversational AI compared to building for the Google Assistant.

Today, I will make a start by building a client-side web application which uses a HTML5 Microphone with WebRTC, streaming the audio bytes to a Node.js backend.

Lee Boonstra

Lee Boonstra

Lee Boonstra (they/them) has been a presence in the tech world since 2007, wearing many hats from software engineer to prompt engineer, web developer to technical trainer, and developer advocate.

With eight years of experience at Google under their belt, they now hold the role of SWE Tech Lead at the Google Cloud office of the CTO. Leading innovation projects, Lee aims to disrupt markets and foster collaboration globally. Their expertise in Conversational and Voice technology, alongside (Generative) AI, has led to recognition as a respected public keynote speaker and published author for O’Reilly and Apress. Lee eases tech headaches and celebrates those light bulb moments.

Lee Boonstra

Lee wrote a book for O’Reilly: Hands-on Sencha Touch 2 and lately: the Definitive Guide to Conversational AI with Dialogflow and Google Cloud for Apress. And the Google Prompt Engineering whitepaper.

My book My book My book
Building Your Own Conversational Voice AI With Dialogflow & Speech to Text in Web Apps. (Part I)

Building Your Own Conversational Voice AI ...

This is the first blog in the series:

A best practice for streaming audio from a browse...

Orchestrate Multiple Sub Chatbots From One Chat Interface by Using the Mega Agent Feature in Dialogflow

Orchestrate Multiple Sub Chatbots From One...

Dialogflow has the Mega Agent feature. (At the time of writing, this feature is still in beta ...

How to Set Up Your Conversational Analytics

Type: Keynote / Breakout Talk
Time: 20min - 50min

Work Smarter With Chat and Conversational AI

Work smarter with Chat and Conversational AI for Google Cloud On Air

Disclaimer: The opinions stated here are my own, not those of my company. - 2025 ® Lee Boonstra