TechRadar

Google's AI robots are learning from watching movies – just like the rest of us

By Eric Hal Schwartz,

5 days ago

Google DeepMind's robotics team is teaching robots to learn how a human intern would: by watching a video. The team has published a new paper demonstrating how Google's RT-2 robots embedded with the Gemini 1.5 Pro generative AI model can absorb information from videos to learn how to get around and even carry out requests at their destination.

Thanks to the Gemini 1.5 Pro model's long context window, training a robot like a new intern is possible. This window allows the AI to process extensive amounts of information simultaneously. The researchers would film a video tour of a designated area, such as a home or office. Then, the robot would watch the video and learn about the environment.

The details in the video tours let the robot complete tasks based on its learned knowledge, using both verbal and image outputs. It's an impressive way of showing how robots might interact with their environment in ways reminiscent of human behavior. You can see how it works in the video below, as well as examples of different tasks the robot might carry out.

Robot AI Expertise

Those demonstrations aren't rare flukes, either. In practical tests, Gemini-powered robots operated within a 9,000-square-foot area and successfully followed over 50 different user instructions with a 90 percent success rate. This high level of accuracy opens up many potential real-world uses for AI-powered robots, helping out at home with chores or at work with menial or even more complex tasks.

That's because one of the more notable aspects of the Gemini 1.5 Pro model is its ability to complete multi-step tasks. DeepMind's research has found that the robots can work out how to answer questions like whether there's a specific drink available by navigating to a refrigerator, visually processing what's within, and then returning and answering the question.

The idea of planning and carrying out the entire sequence of actions demonstrates a level of understanding and execution that goes beyond the current standard of single-step orders for most robots.

Don't expect to see this robot for sale any time soon, though. For one thing, it takes up to 30 seconds to process each instruction, which is way slower than just doing something yourself in most cases. The chaos of real-world homes and offices will be much harder for a robot to navigate than a controlled environment, no matter how advanced the AI model is.

Still, integrating AI models like Gemini 1.5 Pro into robotics is part of a larger leap forward in the field. Robots equipped with models like Gemini or its rivals could transform healthcare, shipping, and even janitorial duties.

Expand All

Read in NewsBreak

Comments / 0

Add a Comment

TechCrunch1 day ago

Yet another Google service has been axed

Digital Trends5 days ago

He's sworn by the same résumé format for the past 9 years. In 2022, it landed him a $350,000 job at Google — check it out.

Business Insider23 days ago

Investigation finds companies are training AI models with YouTube content without permission

TechRadar7 hours ago

Apple, Nvidia, and other tech companies trained AI with thousands of YouTube videos

Quartz10 hours ago

Favorite Chicken Chain Suddenly Closes All Stores, Heartfelt Message Found on Doors

Lancaster County, PA11 hours ago

Amazon reportedly thinks people will pay up to $10 per month for next-gen Alexa

Engadget25 days ago

Check Your iPhone Settings! Tech Experts Say Having These 4 Turned On May Be Making Your Phone So Much Slower: 5G Setting & More

shefinds21 days ago

‘Complete garbage’ rage Samsung users over ‘unusable’ text feature – but fans share alternative that’s ‘working wonders’

The US Sun15 days ago

Chelsea Clinton is rumored to be living in Virginia but probably still in NYC

New York City, NY2 days ago

'One in a million'– pro photographer reveals how he took the President Trump 'flying bullet' photo with a Sony A1

Butler, PA1 day ago

Single and Japanese? The government will find you a date.

World19 days ago

The iPad 10 just won Prime Day in the US

TrustedReviews11 hours ago

Did You Feel the Earthquake?

Chicago, IL1 day ago

You Shouldn't Wear Yellow on First Dates, Here's Why

Bryce Gruber26 days ago

Computer inspired by Japanese art of paper-cutting has no electronics and stores data in tiny cubes

LiveScience12 days ago

5 Zodiac Signs That Make The Best Friends

Total Apex Sports & Entertainment18 days ago

Web scraping is becoming a major issue — and these are the most scraped websites around today

TechRadar11 days ago

The Tesla Model 3 Long Range RWD has returned – and it's one of the biggest EV bargains right now

TechRadar13 hours ago

New Ray-Ban Meta glasses have outsold previous version, Essilux CEO says

Reuters16 hours ago

Welcome to NewsBreak, an open platform where diverse perspectives converge. Most of our content comes from established publications and journalists, as well as from our extensive network of tens of thousands of creators who contribute to our platform. We empower individuals to share insightful viewpoints through short posts and comments. It’s essential to note our commitment to transparency: our Terms of Use acknowledge that our services may not always be error-free, and our Community Standards emphasize our discretion in enforcing policies. We strive to foster a dynamic environment for free expression and robust discourse through safety guardrails of human and AI moderation. Join us in shaping the news narrative together.

Comments / 0

Community Policy

Google's AI robots are learning from watching movies – just like the rest of us

Robot AI Expertise

You might also like

Comments / 0