TechRadar

Meta admits it scraped all Australian Facebook posts since 2007 to train its AI

By Ellen Jennings-Trace,

5 hours ago

Meta has admitted it used Facebook and Instagram publicposts for Australian users to train its Artificial Intelligence models, and has scraped information from as far back as 2007.

An Australian Parliamentary committee has heard that whilst European users can opt out thanks to GDPR laws, Australian customers are not given that choice.

Meta has denied using the information of anyone under 18, but did confirm it had used over a decade’s worth of data. The firm could not answer whether it has scraped the photos of children who are now adults (i.e. those who created their accounts as a child, but have since turned 18).

A turning tide

The process of ‘scraping’ is essential for the development of AI and is basically data harvesting from websites, extracting the information and feeding it back to a Large Language Models (LLMs) which learns from the data. This means that GDPR regulations are becoming troublesome for more and more LLMs such as ChatGPT , which collects data from all over the internet without consent from the original source.

Meta’s global privacy director Melinda Claybaugh sat before the inquiry and admitted that the company was forced to pause the launch of AI products in Europe due to a lack of certainty, and it has had to give European users an opt-out due to more robust privacy laws. Senator Shoebridge grilled the Meta representative,

“The truth of the matter is that, unless you consciously had set those posts to private, since 2007, Meta has just decided you will scrape all of the photos and all of the text from every public post on Instagram or Facebook that Australians have shared since 2007, unless there was a conscious decision to set them on private. But that’s actually the reality, isn’t it?”

Claybaugh replied, “Correct”. She added that users can set their posts to private now to prevent future scraping, but this would have no effect on the data already taken.

The realization seems to be creeping in for the public and for tech companies that training AI models requires such vast amounts of data that it is ‘impossible’ to do so without using copyrighted materials . Considering millions of user's posts have been used without their consent, it looks like tech giants might face much stricter regulations in future.

Via The Guardian

More from TechRadar Pro

Take a look at our choice for best productivity tools
Doomed to fail? Most AI projects flop within 12 months, with billions of dollars being wasted
Check out our pick of the best small business software

Expand All

Read in NewsBreak

Comments /

Add a Comment

YOU MAY ALSO LIKE

Local News

Every iPhone 16 model seemingly has 8GB of RAM, which could be bad news for Apple Intelligence

TechRadar1 day ago

Mercedes announces breakthrough in solid-state EV battery tech, but Chinese rivals are still way ahead

TechRadar20 hours ago

‘For $6, who cares?’: Applebee’s customer shocked after ordering both adult and kids’ cheeseburgers

NewsNinja16 days ago

Denver marijuana arrests surge despite lower consumption, report reveals

David Heitz10 days ago

More DUI patrols possible if marijuana issue passes

Jacksonville Today25 days ago

Apple Intelligence features explained - everything you need to know about Apple AI and when you can use it

TechRadar16 hours ago

Every household can get four free COVID-19 tests by mail, starting late September

Northern Kentucky Tribune4 days ago

Thousands of parents die of overdoses; advocates say their kids need more help

Northern Kentucky Tribune23 days ago

Over 80 Cruise Passengers Seeking Compensation from Cruise Line After Getting Sick

J. Souza8 days ago

Fentanyl-meth combo ravages homeless in Denver, so why aren't there better treatments?

David Heitz4 days ago

23 Defendants Charged in Federal Indictments for Drone-Smuggling Drug Operation in Georgia

Daily Coffee Press21 days ago

Six Georgia Drug Traffickers Sentenced to Long Federal Terms in Major Meth Operation Bust

Daily Coffee Press6 days ago

Keep The Kitchen Sink Area Decluttered & Organized

Declutterbuzz6 days ago

iPhone 16 Plus review – Fulfills your big screen affordable dreams

TechRadar1 day ago

Chick-fil-A's customer gets charged for a bag. ‘"What’s next?" she asks

NewsNinja25 days ago

Opinion: Denver homeless hotel diary: Overdoses common here

David Heitz12 days ago

A huge AirPods Pro 2 update is rolling out now, ahead of iOS 18 – including head gestures

TechRadar21 hours ago

Big Lots files bankruptcy amid closing 74 stores in California

The HD Post2 days ago

More than 30 Students in Georgia Recently Charged for School Threats

Daily Coffee Press19 hours ago

This super-mini portable SSD lets you expand your phone and tablet storage up to 2TB — and I'm in love with the design

TechRadar2 days ago

Wonder Jelly

Alameda Post16 days ago

NJ Businessman Pleads Guilty to Multimillion-Dollar Jewel Trade Fraud

Morristown Minute13 hours ago

Six wonderful cats to adopt this Labor Day weekend

Cats of Kansas City13 days ago

15 missing children in Missouri since June 2024: some cases resolved and more added

CJ Coombs7 days ago

Rollin' 60s Crips Member Pleads Guilty to Racketeering & Cocaine Charges

Morristown Minute25 days ago

Meet The Tiny 6lb Dog Looking For Love

Dianna Carney14 days ago

How Legal Cannabis Could Help Your Property Value Grow

Morristown Minute7 days ago

UPDATED: Husband Arrested Missing Manassas Park Mother: Fate is Unknown

The Inside Scoop - PWC20 days ago

Religious-Minded Actress Ann B. Davis Defended 'The Brady Bunch' on TV's 'Sally Jesse Raphael' Show

Herbie J Pilato19 days ago

No longer ‘half slave, half free’

The Lens19 days ago

It’s essential to note our commitment to transparency:

Our Terms of Use acknowledge that our services may not always be error-free, and our Community Standards emphasize our discretion in enforcing policies. As a platform hosting over 100,000 pieces of content published daily, we cannot pre-vet content, but we strive to foster a dynamic environment for free expression and robust discourse through safety guardrails of human and AI moderation.

Comments / 0

Community Policy