NVIDIA GTC 2019: SoundHound releases hybrid voice AI and natural language understanding system for cars

SoundHound Inc., provider of voice enabled AI and conversational intelligence technologies, unveiled on Monday its large vocabulary, hybrid voice and natural language understanding interface for in-vehicle infotainment systems at the NVIDIA GPU Technology Conference (GTC) 2019. The event marks the first time the technology has been shown to the public, and highlights the NVIDIA DRIVE ecosystem collaboration between SoundHound and NVIDIA.

SoundHound’s Houndify technology is already being utilized by manufacturers including Mercedes-Benz, Groupe PSA, Hyundai, Honda, and others.

Leveraging the patented Speech-to-Meaning and Deep Meaning Understanding technologies from SoundHound’s Houndify Voice AI platform, running on NVIDIA DRIVE IX, the solution enables real-time responses to voice queries in vehicles, even without Internet connectivity. This is achieved with high speed and accuracy through a hybrid speech recognition system that processes voice requests both in the cloud and locally on the embedded system (for when an internet connection is not available) to return fast responses.

The embedded system also enables drivers to control their car’s functions when a connection to the cloud is unavailable including the car’s climate control, window controls, radio, navigation, and more.

NVIDIA DRIVE AGX integrates the high-performance, energy-efficient compute of the NVIDIA Xavier system-on-a-chip (SoC) and full stack AV software to monitor surroundings and the driver, localize to an HD map, and plan a safe path forward.

Within DRIVE software, NVIDIA DRIVE IX is a framework for the full cockpit experience. It combines the system, tools, and algorithms to enhance the driver’s situational awareness, assist in driving functions and provide intelligent interactions between the vehicle and its occupants. This is ideal for integrating voice technology that Houndify can provide, enabling the vehicle to seamlessly respond to human voice commands.

With Houndify, drivers can now interact with hundreds of domains—programs that provide users with relevant information or actions related to their queries. These include: navigation, weather, stock prices, sports scores, flight status, local business searches, and hotel searches with complex criteria, among others.

“The NVIDIA DRIVE platform has enabled us to create an embedded solution for interacting with cars using voice and natural language,” said Keyvan Mohajer, Founder and CEO, SoundHound. “By using NVIDIA GPUs for deep learning training, and the DRIVE IX platform for embedded computation using the GPU inside the Xavier SoC, we are able to scale to large vocabulary in natural language with the Houndify platform, maintaining speed and accuracy, even without a cloud connection.”

“Low-latency speech recognition is an important aspect of intelligent experiences in the vehicle,” said Danny Shapiro, senior director of automotive at NVIDIA. “SoundHound’s innovative solution on our open DRIVE IX platform will allow carmakers to offer systems that have an enormous vocabulary, understand a wide range of topics, and respond conversationally.”


IoT Innovator Newsletter

Get the latest updates and industry news in your inbox! Enter your email address and name below to be the first to know.

Name