Artwork

Innhold levert av Kane Simms. Alt podcastinnhold, inkludert episoder, grafikk og podcastbeskrivelser, lastes opp og leveres direkte av Kane Simms eller deres podcastplattformpartner. Hvis du tror at noen bruker det opphavsrettsbeskyttede verket ditt uten din tillatelse, kan du følge prosessen skissert her https://no.player.fm/legal.
Player FM - Podcast-app
Gå frakoblet med Player FM -appen!

Bringing human-like performance to AI custom voices with Zohaib Ahmed

1:03:00
 
Del
 

Manage episode 283823057 series 2093893
Innhold levert av Kane Simms. Alt podcastinnhold, inkludert episoder, grafikk og podcastbeskrivelser, lastes opp og leveres direkte av Kane Simms eller deres podcastplattformpartner. Hvis du tror at noen bruker det opphavsrettsbeskyttede verket ditt uten din tillatelse, kan du følge prosessen skissert her https://no.player.fm/legal.

Resemble.ai is a cutting edge synthetic voice (text-to-speech) provider that allows anyone or any brand to create their own, customer TTS solution to use across any and all channels. From IVR, Alexa and Google Assistant to narrating videos and marketing materials and even voicing game characters. With just 50 lines of speech, you could have your own dedicated brand voice that brings a consistent and elevated customer experience to your conversational applications. With quicker-than-real-time processing, the low latency capabilities makes having a conversation with a Resemble.ai-voiced assistant as natural as can be.

In this episode of VUX World, we're joined by founder and CEO, Zohaib Ahmed, to discuss the work Resemble is doing to help brands create stand-out conversational experiences. We discuss what Resemble is and how it works, the value of having your own custom voice, the use cases and applications it's being used for, the process for creating, the editing and publishing tools for customising intonation and emotion, as well as the ethical considerations around voice cloning, fraud and deep fakes.


00:00 Intro

02:55 About Zohaib Ahmed and Resemble.ai

04:47 What are people using Resemble.ai for today?

07:37 How much dialogue is needed to create a TTS voice?

08:27 Voice Talent Pool: what's that?

11:50 Bondad: How far off are we from high quality rendering at run time with low latency?

14:00 What's the process for creating a custom brand voice?

17:00 Are call centres cloning staff voices?

19:00 An alternative approach from a branded assistant

21:10 Ethics of voice cloning

24:20 Resemblyzer and tackling speaker verification

27:40 Matthew James-Dewstowe: Can you tweak the voice once its created?

31:40 Jay Ruparel: How do you handle dynamic speech and keep the voice integrity?

37:40 Sean Thornton: Predicting emotion

41:00 Heidi Cohen: Regionalisation and localisation

44:45 Real time translation of speech

48:20 Jay Ruparel: Thoughts on filler words like 'um', 'ah' etc

51:37 Proving the value and ROI

01:00:40 Where can people find out more about Resemble.ai

Links

Visit Resemble.ai

Follow Resemble.ai on Twitter

Zohaib Ahmed on Twitter and LinkedIn



Hosted on Acast. See acast.com/privacy for more information.

  continue reading

306 episoder

Artwork
iconDel
 
Manage episode 283823057 series 2093893
Innhold levert av Kane Simms. Alt podcastinnhold, inkludert episoder, grafikk og podcastbeskrivelser, lastes opp og leveres direkte av Kane Simms eller deres podcastplattformpartner. Hvis du tror at noen bruker det opphavsrettsbeskyttede verket ditt uten din tillatelse, kan du følge prosessen skissert her https://no.player.fm/legal.

Resemble.ai is a cutting edge synthetic voice (text-to-speech) provider that allows anyone or any brand to create their own, customer TTS solution to use across any and all channels. From IVR, Alexa and Google Assistant to narrating videos and marketing materials and even voicing game characters. With just 50 lines of speech, you could have your own dedicated brand voice that brings a consistent and elevated customer experience to your conversational applications. With quicker-than-real-time processing, the low latency capabilities makes having a conversation with a Resemble.ai-voiced assistant as natural as can be.

In this episode of VUX World, we're joined by founder and CEO, Zohaib Ahmed, to discuss the work Resemble is doing to help brands create stand-out conversational experiences. We discuss what Resemble is and how it works, the value of having your own custom voice, the use cases and applications it's being used for, the process for creating, the editing and publishing tools for customising intonation and emotion, as well as the ethical considerations around voice cloning, fraud and deep fakes.


00:00 Intro

02:55 About Zohaib Ahmed and Resemble.ai

04:47 What are people using Resemble.ai for today?

07:37 How much dialogue is needed to create a TTS voice?

08:27 Voice Talent Pool: what's that?

11:50 Bondad: How far off are we from high quality rendering at run time with low latency?

14:00 What's the process for creating a custom brand voice?

17:00 Are call centres cloning staff voices?

19:00 An alternative approach from a branded assistant

21:10 Ethics of voice cloning

24:20 Resemblyzer and tackling speaker verification

27:40 Matthew James-Dewstowe: Can you tweak the voice once its created?

31:40 Jay Ruparel: How do you handle dynamic speech and keep the voice integrity?

37:40 Sean Thornton: Predicting emotion

41:00 Heidi Cohen: Regionalisation and localisation

44:45 Real time translation of speech

48:20 Jay Ruparel: Thoughts on filler words like 'um', 'ah' etc

51:37 Proving the value and ROI

01:00:40 Where can people find out more about Resemble.ai

Links

Visit Resemble.ai

Follow Resemble.ai on Twitter

Zohaib Ahmed on Twitter and LinkedIn



Hosted on Acast. See acast.com/privacy for more information.

  continue reading

306 episoder

Alle Folgen

×
 
Loading …

Velkommen til Player FM!

Player FM scanner netter for høykvalitets podcaster som du kan nyte nå. Det er den beste podcastappen og fungerer på Android, iPhone og internett. Registrer deg for å synkronisere abonnement på flere enheter.

 

Hurtigreferanseguide

Copyright 2024 | Sitemap | Personvern | Vilkår for bruk | | opphavsrett