Tom's Guide

I just had a conversation with Hume's new AI voice assistant — and I forgot it wasn't human

By Ryan Morrison,

5 hours ago

Hume EVI is an artificial intelligence speech-to–speech voice assistant, and with the most recent version 2 update, it may be more natural and intuitive than OpenAI’s GPT-4o Advanced Voice .

The brainchild of Hume cofounder Alan Cowen and his team, EVI 2 builds on the previous generation model with a more natural-sounding voice and better emotional understanding.

According to Hume: “EVI 2 can converse rapidly with users with sub-second response times, understand a user’s tone of voice, generate any tone of voice, and even respond to some more niche requests like changing its speaking rate or rapping.”

My testing found it more natural than OpenAI’s Advanced Voice but slightly slower and with fewer capabilities. For example, EVI is more empathic in its vocal tone, but ChatGPT is better at laughing and conveying other sounds associated with the human voice.

What is Hume EVI 2?

EVI 2 is an empathic voice assistant, available like ChatGPT Voice or Gemini Live as a dedicated smartphone app, online or as an API developers can use in their own projects.

Hume's EVI 2 stands out from the crowd because of its flexibility. It is natively speech-to-speech and has its own LLM brain, but you can swap that for any other model, including GPT-4o or Gemini. You could even use EVI to give voice to Grok or Meta's Llama 3.1 .

We’re building systems that can adapt the voice to the user automatically including adopting the right accent, taking a more relaxed or formal personality, whatever works to help you engage with the AI
Alan Cowen, Hume AI CEO

I spoke to Dr. Cowen ahead of the release of EVI 2 and he said the goal is to “give developers the tools to build what they want,” explaining that the other players in the space are building ecosystems around themselves. “We train on top of open-source models to give them voice.”

“The developer can take this model, and use whichever framework they want, we also enable voice modulation and personality voices,” he added. He also said in the future, there could be a small version of the model that could run on the edge, on a laptop or even on a smart speaker.

Outside of the API and developer tools, the Hume AI app is an impressive experience, allowing you to hold a conversation, brainstorm ideas or even get something off your chest with a natural-sounding AI voice that detects your vocal tone and reacts accordingly.

For fun I also had EVI 2 have a conversation with ChatGPT Advanced Voice. This is something I’ve tried with other AI models to limited effect but here it worked well. They started chatting away like old friends talking about recipes and hobbies.

“We’re building systems that can adapt the voice to the user automatically including adopting the right accent, taking a more relaxed or formal personality, whatever works to help you engage with the AI,” Dr Cowen told Tom’s Guide.

As well as using set voices developed by Hume, EVI 2 can also clone voices but this feature has been restricted, with users able to set identity-related voice characteristics to create a custom voice for each user, without cloning a real voice directly.

“GPT-4o is focused on the shiny capabilities, we’re focused on what the developer actually needs including the ability to modulate the voice without cloning,” Dr Cowen told me during an interview before the launch of the new model.

Their approach to voice development is prompt-based, where users just type what they want the voice to sound like, and the AI does the world. “We came up with voice prompting and it can just follow that personality,” he said. It can also generate other languages and accents.

How well does EVI 2 work?

I tried EVI 2 on the Hume AI website with several voices. I found it impressively natural sounding and could adapt its voice depending on how I spoke.

It is also a good storyteller, able to convey the emotional depth of a character. While it does match or even exceed the emotion mimicry of ChatGPT Voice, it lacks other features such as breathing sounds and holding noises that are common in human voice. That said, I still got distracted during a conversation, enough to forget it wasn’t human.

For fun, I also had EVI 2 have a conversation with ChatGPT Advanced Voice. I’ve tried this with other AI models to limited effect, but it worked well here. They started chatting away like old friends, talking about recipes and hobbies.

What makes EVI 2 an important step isn’t its capabilities; it is the company's wider approach. While you might use Advanced Voice in ChatGPT or Gemini Live on an Android handset, EVI could be built into any software or device — so it could be everywhere.

Its ability to track emotional responses through vocal tone could also prove helpful in the care sector, giving medical robots a bedside manner. Or it could be used to replace the automated voice on call waiting, able to soothe you out of an angry state despite still being number five million in the queue. It's got to be better than the lie: “your call is important to us.”

More from Tom's Guide

Expand All

Read in NewsBreak

Comments /

Add a Comment

YOU MAY ALSO LIKE

Local News

You win, night owls — new sleep study suggests staying up late makes you mentally sharper

Tom's Guide1 day ago

Puffy Cloud vs Siena: Which supportive memory foam mattress suits your sleep?

Tom's Guide2 days ago

5 best movies like 'Beetlejuice Beetlejuice' to stream now

Tom's Guide2 days ago

The Secret Gay Life & Horrific Car Accident of Actor Van Johnson: 16 Years After His Tragic Death

Herbie J Pilato14 days ago

I asked Meta AI to interpret my recurring dreams — and I was surprised by the results

Tom's Guide13 hours ago

I’m a personal trainer — I think this one hip flexor stretch is better than pigeon pose for releasing lower body tension

Tom's Guide13 hours ago

The Reflection 70B model held huge promise for AI but now its creators are accused of fraud — here's what went wrong

Tom's Guide10 hours ago

I ran 40 miles in the Hoka Tecton X3 and it’s the bounciest carbon plate trail running shoe I’ve tested

Tom's Guide1 day ago

5 new to Hulu movies with 90% or higher on Rotten Tomatoes

Tom's Guide1 day ago

The Most Ruthless Zodiac Signs - Ranked from Most to Least

Emily Standley Allard22 days ago

5 reasons why your mattress is making your sleep worse, according to a bed expert

Tom's Guide1 day ago

Google's Pixel 9 Pro Fold is design evolution done right

Tom's Guide18 hours ago

The AirPods Max announcement was disappointing — unless you really like orange

Tom's Guide2 days ago

Samsung Galaxy S25 performance tipped through fresh Snapdragon 8 Gen 4 benchmark leak

Tom's Guide23 hours ago

Apple Watch battery running low? Here's how to check its health

Tom's Guide11 hours ago

Wonder Jelly

Alameda Post17 days ago

iPhone 16 vs iPhone 16 Pro: Which new iPhone should you buy?

Tom's Guide2 days ago

I read 100 books a year —and Meta AI struggled to come up with new recommendations

Tom's Guide9 hours ago

iPhone 16's MagSafe upgrade is a change I’ve wanted for ages — but now there’s a new problem

Tom's Guide10 hours ago

The best modern war movie ever is now streaming for free — and it’s 96% on Rotten Tomatoes

Tom's Guide2 days ago

In Memory of 'Tough Guy' Actor Robert Conrad ('Wild, Wild West): 4 Years After His Tragic Death

Herbie J Pilato27 days ago

This romantic comedy movie just crashed the Netflix top 10 — and viewers seem to love it more than critics

Tom's Guide2 days ago

In Memory of Former Child Star and Oscar-winning Actress Patty Duke: 8 Years After Her Tragic Death

Herbie J Pilato20 days ago

How thick should my mattress topper be? A guide to choosing the right thickness

Tom's Guide3 hours ago

I used ChatGPT to compare game controllers — here's what I bought for my gamer son

Tom's Guide2 days ago

Adobe is entering the AI video space with new Firefly Video model

Tom's Guide1 day ago

Movie review: Say ‘Beetlejuice’ twice and an unnecessary sequel appears

The Lantern1 day ago

USA vs New Zealand live stream: How to watch 2024 international friendly game online and on TV anywhere today

Tom's Guide2 days ago

Samsung Galaxy S25 Ultra renders just leaked — and they look a little iPhone

Tom's Guide1 day ago

New study finds even light drinking linked to cancer deaths among older adults

Northern Kentucky Tribune5 days ago

It’s essential to note our commitment to transparency:

Our Terms of Use acknowledge that our services may not always be error-free, and our Community Standards emphasize our discretion in enforcing policies. As a platform hosting over 100,000 pieces of content published daily, we cannot pre-vet content, but we strive to foster a dynamic environment for free expression and robust discourse through safety guardrails of human and AI moderation.

Comments / 0

Community Policy