By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
PulseReporterPulseReporter
  • Home
  • Entertainment
  • Lifestyle
  • Money
  • Tech
  • Travel
  • Investigations
Reading: Emotive voice AI startup Hume launches new EVI 3 mannequin with fast customized voice creation
Share
Notification Show More
Font ResizerAa
PulseReporterPulseReporter
Font ResizerAa
  • Home
  • Entertainment
  • Lifestyle
  • Money
  • Tech
  • Travel
  • Investigations
Have an existing account? Sign In
Follow US
  • Advertise
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
PulseReporter > Blog > Tech > Emotive voice AI startup Hume launches new EVI 3 mannequin with fast customized voice creation
Tech

Emotive voice AI startup Hume launches new EVI 3 mannequin with fast customized voice creation

Pulse Reporter
Last updated: May 30, 2025 5:20 am
Pulse Reporter 1 day ago
Share
Emotive voice AI startup Hume launches new EVI 3 mannequin with fast customized voice creation
SHARE

Be part of our each day and weekly newsletters for the newest updates and unique content material on industry-leading AI protection. Be taught Extra


New York-based AI startup Hume has unveiled its newest Empathic Voice Interface (EVI) conversational AI mannequin, EVI 3 (pronounced “Evee” Three, just like the Pokémon character), concentrating on every thing from powering buyer assist techniques and well being teaching to immersive storytelling and digital companionship.

EVI 3 lets customers create their very own voices by speaking to the mannequin (it’s voice-to-voice/speech-to-speech), and goals to set a brand new commonplace for naturalness, expressiveness, and “empathy” in accordance with Hume — that’s, how customers understand the mannequin’s understanding of their feelings and its capability to reflect or alter its personal responses, by way of tone and phrase alternative.

Designed for companies, builders, and creators, EVI 3 expands on Hume’s earlier voice fashions by providing extra refined customization, sooner responses, and enhanced emotional understanding.

Particular person customers can work together with it as we speak by Hume’s reside demo on its web site and iOS app, however developer entry by Hume’s proprietary software programming interface (API) is claimed to be made obtainable in “the approaching weeks,” as a weblog put up from the corporate states.

At that time, builders will have the ability to embed EVI 3 into their very own customer support techniques, artistic tasks, or digital assistants — for a worth (see under).

My very own utilization of the demo allowed me to create a brand new, customized artificial voice in seconds based mostly on qualities I described to it — a mixture of heat and assured, and a masculine tone. Chatting with it felt extra naturalistic and straightforward than different AI fashions and positively the inventory voices from legacy tech leaders such Apple with Siri and Amazon with Alexa.

What builders and companies ought to find out about EVI 3

Hume’s EVI 3 is constructed for a spread of makes use of—from customer support and in-app interactions to content material creation in audiobooks and gaming.

It permits customers to specify exact persona traits, vocal qualities, emotional tone, and dialog subjects.

This implies it could possibly produce something from a heat, empathetic information to a unusual, mischievous narrator—right down to requests like “a squeaky mouse whispering urgently in a French accent about its scheme to steal cheese from the kitchen.”

EVI 3’s core energy lies in its capability to combine emotional intelligence straight into voice-based experiences.

In contrast to conventional chatbots or voice assistants that rely closely on scripted or text-based interactions, EVI 3 adapts to how individuals naturally communicate — selecting up on pitch, prosody, pauses, and vocal bursts to create extra participating, humanlike conversations.

Nonetheless, one massive function Hume’s fashions at the moment lack — and which is obtainable by rivals open supply and proprietary, corresponding to ElevenLabs — is voice cloning, or the fast replication of a person’s or different voice, corresponding to an organization CEO.

But Hume has indicated it’s going to add such a functionality to its Octave text-to-speech mannequin, as it’s famous as “coming quickly” on Hume’s web site, and prior reporting by yours actually on the corporate discovered it’s going to permit customers to duplicate voices from as little as 5 seconds of audio.

Hume has acknowledged it’s prioritizing safeguards and moral issues earlier than making this function broadly obtainable. At the moment, this cloning functionality shouldn’t be obtainable in EVI itself, with Hume emphasizing versatile voice customization as a substitute.

Inside benchmarks present customers choose EVI 3 to OpenAI’s GPT-4o voice mannequin

In accordance with Hume’s personal checks with 1,720 customers, EVI 3 was most popular over OpenAI’s GPT-4o in each class evaluated: naturalness, expressiveness, empathy, interruption dealing with, response velocity, audio high quality, voice emotion/model modulation on request, and emotion understanding on request (the “on request” options are lined in “instruction following” seen under).

It additionally normally bested Google’s Gemini mannequin household and the brand new open supply AI mannequin agency Sesame from former Oculus co-creator Brendan Iribe.

It additionally boasts decrease latency (~300 milliseconds), strong multilingual assist (English and Spanish, with extra languages coming), and successfully limitless customized voices. As Hume writes on its web site (see screenshot instantly under):

Key capabilities embody:

  • Prosody technology and expressive text-to-speech with modulation.
  • Interruptibility, enabling dynamic conversational movement.
  • In-conversation voice customizability, so customers can alter talking model in actual time.
  • API-ready structure (coming quickly), so builders can combine EVI 3 straight into apps and companies.

Pricing and developer entry

Hume affords versatile, usage-based pricing throughout its EVI, Octave TTS, and Expression Measurement APIs.

Whereas EVI 3’s particular API pricing has not been introduced but (marked as TBA), the sample suggests it is going to be usage-based, with enterprise reductions obtainable for giant deployments.

For reference, EVI 2 is priced at $0.072 per minute — 30% decrease than its predecessor, EVI 1 ($0.102/minute).

For creators and builders working with text-to-speech tasks, Hume’s Octave TTS plans vary from a free tier (10,000 characters of speech, ~10 minutes of audio) to enterprise-level plans. Right here’s the breakdown:

  • Free: 10,000 characters, limitless customized voices, $0/month
  • Starter: 30,000 characters (~half-hour), 20 tasks, $3/month
  • Creator: 100,000 characters (~100 minutes), 1,000 tasks, usage-based overage ($0.20/1,000 characters), $10/month
  • Professional: 500,000 characters (~500 minutes), 3,000 tasks, $0.15/1,000 further, $50/month
  • Scale: 2,000,000 characters (~2,000 minutes), 10,000 tasks, $0.13/1,000 further, $150/month
  • Enterprise: 10,000,000 characters (~10,000 minutes), 20,000 tasks, $0.10/1,000 further, $900/month
  • Enterprise: Customized pricing and limitless utilization

For builders engaged on real-time voice interactions or emotional evaluation, Hume additionally affords a Pay as You Go plan with $20 in free credit and no upfront dedication. Excessive-volume enterprise clients can go for a devoted Enterprise plan that includes dataset licenses, on-premises options, customized integrations, and superior assist.

Hume’s historical past of emotive AI voice fashions

Based in 2021 by Alan Cowen, a former researcher at Google DeepMind, Hume goals to bridge the hole between human emotional nuance and AI interplay.

The corporate skilled its fashions on an expansive dataset drawn from lots of of 1000’s of individuals worldwide—capturing not simply speech and textual content, but in addition vocal bursts and facial expressions.

“Emotional intelligence consists of the power to deduce intentions and preferences from conduct. That’s the very core of what AI interfaces try to realize,” Cowen advised VentureBeat. Hume’s mission is to make AI interfaces extra responsive, humanlike, and in the end extra helpful—whether or not that’s serving to a buyer navigate an app or narrating a narrative with simply the best mix of drama and humor.

In early 2024, the corporate launched EVI 2, which provided 40% decrease latency and 30% lowered pricing in comparison with EVI 1, alongside new options like dynamic voice customization and in-conversation model prompts.

February 2025 noticed the debut of Octave, a text-to-speech engine for content material creators able to adjusting feelings on the sentence degree with textual content prompts.

With EVI 3 now obtainable for hands-on exploration and full API entry simply across the nook, Hume hopes to permit builders and creators to reimagine what’s potential with voice AI.

Day by day insights on enterprise use circumstances with VB Day by day

If you wish to impress your boss, VB Day by day has you lined. We provide the inside scoop on what corporations are doing with generative AI, from regulatory shifts to sensible deployments, so you’ll be able to share insights for max ROI.

Learn our Privateness Coverage

Thanks for subscribing. Try extra VB newsletters right here.

An error occured.


You Might Also Like

Benjamin Capital Companions acquires AR agency New Factor Co. to spice up cash moments

Manchester United vs. Actual Sociedad 2025 livestream: Watch Europa League free of charge

7 Greatest Bassinets (2024), Examined and Reviewed

What Brought on the European Energy Outage?

Popeye and Tintin are actually within the public area

Share This Article
Facebook Twitter Email Print
Previous Article Salesforce shares fall as software program maker reveals pockets of weak point Salesforce shares fall as software program maker reveals pockets of weak point
Next Article Would We Be Actual Mates Primarily based On The Films And Exhibits We've Each Seen? Would We Be Actual Mates Primarily based On The Films And Exhibits We've Each Seen?
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Weekly Newsletter

Subscribe to our newsletter to get our newest articles instantly!

More News

Rank Motion pictures Blindly In Our Chaotic New Recreation
Rank Motion pictures Blindly In Our Chaotic New Recreation
23 minutes ago
The challenges of translating The Final of Us to the tv viewers | The DeanBeat
The challenges of translating The Final of Us to the tv viewers | The DeanBeat
42 minutes ago
Why evening owls see sooner charges of cognitive decline, in accordance with new research
Why evening owls see sooner charges of cognitive decline, in accordance with new research
48 minutes ago
17 Celebrities Who Bought Refreshingly Actual About Durations
17 Celebrities Who Bought Refreshingly Actual About Durations
1 hour ago
Is Utilizing a Stair Machine the Identical as Climbing Stairs?
Is Utilizing a Stair Machine the Identical as Climbing Stairs?
2 hours ago

About Us

about us

PulseReporter connects with and influences 20 million readers globally, establishing us as the leading destination for cutting-edge insights in entertainment, lifestyle, money, tech, travel, and investigative journalism.

Categories

  • Entertainment
  • Investigations
  • Lifestyle
  • Money
  • Tech
  • Travel

Trending

  • Rank Motion pictures Blindly In Our Chaotic New Recreation
  • The challenges of translating The Final of Us to the tv viewers | The DeanBeat
  • Why evening owls see sooner charges of cognitive decline, in accordance with new research

Quick Links

  • About Us
  • Contact Us
  • Privacy Policy
  • Terms Of Service
  • Disclaimer
2024 © Pulse Reporter. All Rights Reserved.
Welcome Back!

Sign in to your account