If LLM's Could Talk: Attaching Voice Models to LLMs

by Dr. Phil Winder , CEO


When: Wed May 22, 2024 at 16:30 +0100

Large language models (LLMs) are an established consumer facing technology, providing chat like functionality with artificial agents. However, many companies are now considering adding vocal capabilities, primarily to handle phone calls.

In this panel-based discussion, join Luke Marsden, Phil Winder, and other friends for a discussion about connecting voice models. We’ll discuss the idea, the complications, and the downstream effects.

Of particular interest is the unspoken expectation that when people call phone lines they expect to speak to humans. Even with the prevalence of script-based phone systems, do people still expect to speak to humans? Is it ethical to use an LLM it its place?

At the end of the discussion you will have a more grounded understanding of some of the key challenges involved with using voice models. This presentation will be high-level although we will discuss architectural concerns. It is suitable for all experience levels.

More articles

Scaling StableAudio.com Generative Models Globally with NVIDIA Triton & Sagemaker

Learn from the trials and tribulations of scaling audio diffusion models with NVIDIA's Triton Inference Server and AWS Sagemaker.

Read more

Big Data in LLMs with Retrieval-Augmented Generation (RAG)

Explore how Retrieval-Augmented Generation (RAG) enhances Language Models by utilizing indexing, retrieval, and generation for up-to-date data access.

Read more