Tom's Guide

Lip-sync battle — I put 3 leading AI video tools to the test

By Ryan Morrison,

20 hours ago

One of the fastest-improving areas of artificial intelligence video is lip-sync — that is, being able to make an AI character speak and look like it is speaking the words it says.

There are a number of companies offering lip-synching including Pika Labs, Synchlabs and character-based platforms like Hey Gen and Synthesia. The latter two are potentially the best examples of lip-synching I’ve seen, but they are focused more on avatar than animation.

For this story, I’ve focused on platforms working in the AI video space, rather than avatar creation. Kling and Runway are the most similar, offering full video creation platforms with lip-sync as a feature. Hedra is currently focused on the character, but it's building a wider steerable video model that starts with the character. So I’ve picked those three for this test.

Designing the battle

This is going to be a five-round competition between the trio of models, three rounds using an image I’ve given them and two using their own image/video generation capabilities. (I'll explain how many rounds I ended up running at the end.)

We will use the same image with each tool but use their own built-in voices and the same monologue script. I’ve focused on 10-second snippets even though Hedra can go up to a minute. This is to keep consistency across all three models.

Hedra works slightly differently to Kling and Runway. The latter two begin with a video and map lip movement within the video; Hedra begins with an image. The final results are similar.

Round 1: The Static Face Test

https://img.particlenews.com/image.php?url=2rnA9d_0w5rm80P00

This should be the easiest. We’ve given Midjourney the prompt: “A neutral, close-up portrait of a person with minimal expression, well-lit in a natural studio setting, showing a front-facing view of the face. The background is a soft, blurred color gradient with no distractions. Skin tones should be natural, and the character should look calm, with no notable emotion.”

We’ve then picked a custom voice from each of the three models and decided to make it say “Hello, welcome to the future of AI video generation. I don’t really exist but can still speak to you thanks to the wonders of lip-synching".

This first test should have taken 20 minutes to run even with the added complexity of lip-syncing, but Kling, as good as it is in terms of visual and motion realism, is by far the slowest AI video model. Runway, thanks to Turbo is near real-time and Hedra is animating an image, so it's quick.

This was a close round between Hedra, with the more realistic voice and mouth movement and Kling for the more impressive movement. I wasn't convinced by the flickering, so I'm giving it to Hedra on this occasion.

Round 2: The Expression Challenge

https://img.particlenews.com/image.php?url=1Mya5v_0w5rm80P00

In this test, we’ve got an ultra close-up image made in Midjourney: “A close-up portrait of a person with an expressive, happy face, showing teeth in a wide smile. The lighting is bright and warm, creating a cheerful and energetic mood. The background is a soft, light pastel colour that doesn't distract from the facial expression.”

Each of the three models were asked to say the phrase: “Life can be odd sometimes, but it is a good odd, a happy way of being. Something to smile about.” This will test the ability to capture emotional context.

All three were nightmarish renders. It is clear that if you want a good lip-sync, you should start with a closed mouth. I can't crown a winner, but I will reluctantly give it Hedra for the least horrific mouth movement.

Round 3: The Action Scene

https://img.particlenews.com/image.php?url=3Qym4c_0w5rm80P00

Finally, we’re going to see how well each contender can animate the lips of someone mid-conversation and not facing the camera directly. We use the Midjourney prompt: “A mid-action shot of a person slightly turned to the side, speaking with a hand raised as if gesturing during an intense conversation. The face shows determination and focus. The background is a dynamic, slightly blurred urban street scene, with movement to suggest the person is speaking while in motion.”

I’ve given the character the script: “So I told him if he wants to buy the car he’ll have to come back with a better price. Never heard from him again.”

None of them were perfect but I think Hedra and Runway did a better job than Kling. Overall, I think Runway took this round for the most realistic lip-sync.

The winner: Hedra

I had originally planned five rounds but Kling took so long to generate each video it made it impossible to complete in enough time. The last two tests were going to be of the text-to-video capabilities, without the starting image, but the results were too sporadic to be viable.

Hedra's Character-2 came out on top and to some inevitable degree. It starts with an image and animation it where the other two have to map the mouth movement within a video and sync the lips to the sound. Of the video models, I think Kling was better overall, but this was a first past the post-test, so technically Runway came in second.

If I were to repeat this experiment, I'd use external sounds to. create more consistency, always use generated images and carry out a wider range of tests. I just wish Kling were quicker.

More from Tom's Guide

Expand All

Read in NewsBreak

Comments /

Add a Comment

YOU MAY ALSO LIKE

Local News

ChatGPT can remember some of what you tell it — here's how to find out what it knows

Tom's Guide11 hours ago

The Suicide of 'Happy Days' Actress & Former Child Star Kathy O'Dare: 14 Sad Years Later

Herbie J Pilato4 days ago

Earth Will Have a "Second Moon" for 57 Days: What does this mean?

M Henderson10 days ago

In Memory of Groundbreaking Actor Ron Glass ('Barney Miller'): 8 Years After His Tragic Death

Herbie J Pilato5 days ago

Elon Musk unveils Tesla self-driving Cybercab for under $30,000 — what the heck is it?

Tom's Guide3 days ago

I asked ChatGPT to roast my Instagram feed — here's how you can too

Tom's Guide2 days ago

Samsung Galaxy S25 FE tipped for 2025 release — and it could be thinner than ever

Tom's Guide12 hours ago

Sonja Morgan To Perform At The Cascade Lounge At Agua Caliente Casino Palm Springs

Palm Springs Tribune6 days ago

Higher Death Rates Prompt Pfizer to Withdraw Medication Over Serious Safety Concerns

Uncovering Florida17 days ago

Cowboys vs Lions live stream today: How to watch NFL online and on TV from anywhere, injuries and inactives

Tom's Guide2 days ago

What is a cooling gel mattress topper and should you buy one?

Tom's Guide2 days ago

I just tried Ray-Ban Meta’s latest AI updates — my favorite smart glasses just got a whole lot smarter

Tom's Guide1 day ago

5 ways to make your old mattress more comfortable without replacing it

Tom's Guide2 days ago

How to watch 'The Last of the Sea Women' online and from anywhere

Tom's Guide3 days ago

I put two tiny webcams to the test — and there was a clear winner

Tom's Guide2 days ago

Microdosing Products from Smoke & Vape Shops Linked to Hospitalizations, Deaths

Uncovering Florida11 days ago

Vision screenings for driver’s license renewals will be required in ’25 to enhance driver safety

Northern Kentucky Tribune24 days ago

NYT Strands today — hints, spangram and answers for game #224 (Sunday, October 13 2024)

Tom's Guide2 days ago

No Kindle? No problem: 5 places to buy DRM-free e-books

Tom's Guide2 days ago

Harry’s Bold New Life: Why He No Longer Needs Meghan as His 'Security Blanket'

André Emilio6 days ago

What is a fall simmer pot and how to make one

Tom's Guide1 day ago

Single vs double oven — which is best for you?

Tom's Guide2 days ago

iPhone 17 line just tipped for major design upgrade

Tom's Guide2 days ago

5 signs you need a cooling mattress all year round and not just in summer

Tom's Guide3 days ago

The Black Hills are thick with lions. But those that leave are unlikely to repopulate the East, study finds.

WyoFile14 days ago

I tried Apple Intelligence to improve my writing — here’s what happened

Tom's Guide2 days ago

TOPDON TC001/TC002 thermal lens review

Tom's Guide3 days ago

Week's Buzz: Diddy's Mom, Cardi B Divorce & Freebies

Bryce Gruber6 days ago

“Please, Sir, Can I Have Some Smores?”

Alameda Post16 days ago

I tried this 8-minute no-equipment barre workout to help strengthen my core — here's what happened

Tom's Guide21 hours ago

It’s essential to note our commitment to transparency:

Our Terms of Use acknowledge that our services may not always be error-free, and our Community Standards emphasize our discretion in enforcing policies. As a platform hosting over 100,000 pieces of content published daily, we cannot pre-vet content, but we strive to foster a dynamic environment for free expression and robust discourse through safety guardrails of human and AI moderation.

Comments / 0

Community Policy