AI & Machine Learning

Build Voice AI into your apps with our top 3 Speech API codelabs

Anu Srivastava

Senior Developer Programs Engineer

April 20, 2022

With voice-controlled touchpoints becoming more and more the norm in human-computer interactions, our Speech-to-Text (STT) API is a great option for developers looking to build voice into their applications. The API processes over 1 billion spoken minutes of speech each month, enough to transcribe all Presidential inauguration speeches in U.S. history over 1 million times. Our customers use STT for everything from auto-generating captions, to generating insights to improve sales calls, to powering robots that help with childhood development.

With Speech-to-Text, you can accurately convert speech into text with several adaptations including:

Model Customization - customize for domain-specific terms
Speech Adaptation - provide context to influence results and formatting
Diarization - separate speakers on different channels or automatically detect when speakers change
Profanity Filtering - configure your request to detect profane words and edit them out of the transcript

Whether you’re using our pre-trained APIs for the first time or you’re a seasoned AI veteran, our codelabs are great resources for practicing and getting even more comfortable with our pre-trained models. In addition to helping you brush up on your skills, Codelabs also provide step-by-step instructions for how to set up your GCP project and get a $300 credit if you need it. They’ll also walk you through everything else you need to get your sample up and running, such as authentication, and installing the client libraries and tooling like the Cloud Shell Editor.

That’s why we’ve decided to round up some our top Speech codelabs, to help you get the most of our Speech-to-Text API, and our Text-to-Speech API as well:

1. Using the Speech-to-Text API with Python lab and C# lab

Speech-to-Text is easy to get started with; in the code snippet below you can see all you need is the client library, an audio file and a few lines of code to get a transcript created:

Build Voice AI into your apps with our top 3 Speech API codelabs

Build Voice AI into your apps with our top 3 Speech API codelabs

1. Using the Speech-to-Text API with Python lab and C# lab

Recommend

宁德时代总市值跌破万亿，官方否认近期将发布新的动力电池技术

0x Protocol宣布其为Coinbase NFT社交市场提供支持

Critical cryptographic Java security blunder patched – update now!

收购推特，马斯克的野心瞄向“总统”？

Nvidia: Just two AI GPUs can do better chip design in a few days than 10 people...

中国大陆成ASML第一季度最大客户，腾讯被曝还清90万房贷才能离职，清华控股100%股权将...

来来来，我们聊聊价值投机

亚马逊全球宣布新增37个可再生能源项目-品玩

严选三星颗粒，金百达DDR5-4800内存条评测！_原创_新浪众测

Get Agenda Premium 14 at 71% off for just $9.99

About Joyk