Google Builds Open-Source Voice Kit For AI Devices

A look at projects by Mozilla and Google to make large, open-source datasets containing crowdsourced voice samples available to developers

Google: At Google, we’re often asked how to get started using deep learning for speech and other audio recognition problems, like detecting keywords or commands. And while there are some great open source speech recognition systems like Kaldi that can use neural networks as a component, their sophistication makes them tough to use as a guide to a simpler tasks. Perhaps more importantly, there aren’t many free and openly available datasets ready to be used for a beginner’s tutorial (many require preprocessing before a neural network model can be built on them) or that are well suited for simple keyword detection.

To solve these problems, the TensorFlow and AIY teams have created the Speech Commands Dataset, and used it to add training * and inference sample code to TensorFlow. The dataset has 65,000 one-second long utterances of 30 short words, by thousands of different people, contributed by members of the public through the AIY website. It’s released under a Creative Commons BY 4.0 license, and will continue to grow in future releases as more contributions are received. The dataset is designed to let you build basic but useful voice interfaces for applications, with common words like “Yes”, “No”, digits, and directions included. The infrastructure we used to create the data has been open sourced too, and we hope to see it used by the wider community to create their own versions, especially to cover underserved languages and applications.

Image source: theusbport.com

IK Multimedia Releases ARC ON·EAR Hardware Headphone Correction/Virtual Monitoring System

Alfa Romeo Unveils the 2026 Tonale: The Most Expressive and Dynamic Yet

Wacom Introduces the MovinkPad Pro 14, the Next Step in Its Portable Creative Pad Lineup

Apple unveils iPhone 17 Pro and iPhone 17 Pro Max — the most powerful Pro lineup ever

Backed by $45M in Funding, Olares to Launch a Personal AI Device Bringing Cloud-Level Performance Home

DJI Agriculture Unveils Agras T100, T70P, and T25P at Agritechnica 2025 in Hannover

Wrap Your Phone in Cheer This Season with Limited-Edition Holiday Phone Case Collection from OtterBox

G-SHOCK and Jae Tips Redefine Color and Culture with the Vibrant New DW6900 Collaboration

SAS unveils turnkey, low-cost cloud analytics on Microsoft Azure

Land id Releases 3D 'Flyover Tours' and Multiple New Features to Elevate User Experience Across Web and Mobile

Grubhub Expands Partnership with Avride to Bring Autonomous Delivery Robots to Its Marketplace

2025 World Design Cities Conference opens in Shanghai

Bybit Partners With Plasma to List XPL and Unlock Zero-Fee USDT Transfers

PayPal Drives Crypto Payments into the Mainstream, Reducing Costs and Expanding Global Commerce

Bitcoin at the Crossroads: FBS Analysts Look at What's Next

OKX and McLaren F1 Team Reveal Riviera Livery Honoring F1 Heritage

Google Builds Open-Source Voice Kit For AI Devices

No comments:

Follow Us

Recent Posts

Backed by $45M in Funding, Olares to Launch a Personal AI Device Bringing Cloud-Level Performance Home

DJI Agriculture Unveils Agras T100, T70P, and T25P at Agritechnica 2025 in Hannover

Wrap Your Phone in Cheer This Season with Limited-Edition Holiday Phone Case Collection from OtterBox

PayPal Launches Agentic Commerce Services to Power AI-Driven Shopping

G-SHOCK and Jae Tips Redefine Color and Culture with the Vibrant New DW6900 Collaboration

Facebook

Popular Posts

PINTEREST

Contact Form

Contact / About / ADVERTISE

Categories

Popular Posts last 30 days

IK Multimedia Releases ARC ON·EAR Hardware Headphone Correction/Virtual Monitoring System

Alfa Romeo Unveils the 2026 Tonale: The Most Expressive and Dynamic Yet

Wacom Introduces the MovinkPad Pro 14, the Next Step in Its Portable Creative Pad Lineup

Apple unveils iPhone 17 Pro and iPhone 17 Pro Max — the most powerful Pro lineup ever

Backed by $45M in Funding, Olares to Launch a Personal AI Device Bringing Cloud-Level Performance Home

DJI Agriculture Unveils Agras T100, T70P, and T25P at Agritechnica 2025 in Hannover

Wrap Your Phone in Cheer This Season with Limited-Edition Holiday Phone Case Collection from OtterBox

G-SHOCK and Jae Tips Redefine Color and Culture with the Vibrant New DW6900 Collaboration

SAS unveils turnkey, low-cost cloud analytics on Microsoft Azure

Land id Releases 3D 'Flyover Tours' and Multiple New Features to Elevate User Experience Across Web and Mobile

Grubhub Expands Partnership with Avride to Bring Autonomous Delivery Robots to Its Marketplace

2025 World Design Cities Conference opens in Shanghai

Bybit Partners With Plasma to List XPL and Unlock Zero-Fee USDT Transfers

PayPal Drives Crypto Payments into the Mainstream, Reducing Costs and Expanding Global Commerce

Bitcoin at the Crossroads: FBS Analysts Look at What's Next

OKX and McLaren F1 Team Reveal Riviera Livery Honoring F1 Heritage

Google Builds Open-Source Voice Kit For AI Devices

You Might Also Like

SAS unveils turnkey, low-cost cloud analytics on Microsoft Azure

Land id Releases 3D 'Flyover Tours' and Multiple New Features to Elevate User Experience Across Web and Mobile

Grubhub Expands Partnership with Avride to Bring Autonomous Delivery Robots to Its Marketplace

No comments:

Follow Us

Recent Posts

Backed by $45M in Funding, Olares to Launch a Personal AI Device Bringing Cloud-Level Performance Home

DJI Agriculture Unveils Agras T100, T70P, and T25P at Agritechnica 2025 in Hannover

Wrap Your Phone in Cheer This Season with Limited-Edition Holiday Phone Case Collection from OtterBox

PayPal Launches Agentic Commerce Services to Power AI-Driven Shopping

G-SHOCK and Jae Tips Redefine Color and Culture with the Vibrant New DW6900 Collaboration

Facebook

Popular Posts

PINTEREST

Contact Form

Contact / About / ADVERTISE

Categories

Popular Posts last 30 days