The Guardian

OpenAI says the latest ChatGPT can ‘think’ – and I have thoughts

By Chris Stokel-Walker,

11 hours ago

https://img.particlenews.com/image.php?url=2A4Vdo_0vZEYWb700 — Can AI finally replicate humans by having ‘thoughts’ … Rodin’s The Thinker. Photograph: Philippe Wojazer/Reuters

We are fast approaching two years of the generative AI revolution, sparked by the November 2022 release of ChatGPT by OpenAI. So far it’s been a mixed bag.

OpenAI recently announced it had crossed 200 million weekly active users – nothing to be sniffed at, but it got its first 100 million within two months of release. A recent YouGov study found that the inclusion of AI in a product is as likely to turn off a potential purchaser as much as it is to get them to hand over their cash.

Nevertheless, money keeps flowing into the sector, and advances keep coming. OpenAI is casting around investors for money to fund future development that would see the company valued at $150 billion . That would put it on a par with Cisco, Shell and McDonalds. And last week, it unveiled its latest model , called o1, which it has touted as a step change in the development of generative AI.

The o1 model, previously codenamed Strawberry, is designed to reason through decisions, much in the same way humans do. The latest version of the model underpinning ChatGPT is actually a step backwards when it comes to speed of output and the size of the model, which is smaller for the time being. Think of it as GPT-4.5, rather than the rumoured next big iteration, GPT-5, which is reportedly still in development.

Mission: Impossible?

While on paper o1 is a damp squib, it does something that Alex had previously highlighted in this newsletter as an issue with LLM-based chatbots, and which he called the “Tom Cruise problem” . The issue was that researchers could ask a question of ChatGPT one way, but when asked a question that directly related to the initial one – for instance, who is Tom Cruise’s mother? (Answer: Mary Lee Pfeiffer), then being asked who is Mary Lee Pfeiffer’s son? (Answer: Tom Cruise) – it would balk.

Ask o1 that pair of questions and it aces it. It even provides traces of how it gets to the answer – which OpenAI has cannily, and inaccurately because AI models don’t have a brain, called “thoughts”. (If you want to know why anthropomorphising AI models is an issue, check out this story I wrote in February.) When asked the second question, o1 “thought” for four seconds, including tracing out the family connections and confirming details.

So far, so good. OpenAI says o1 can reason . Many are less sure about such a declarative statement like that, but let’s let them have it for the purposes of marketing. That would mean a significant shift in how you can use generative AI: rather than regurgitating facts from its training data, or producing answers it statistically reckons is most likely to please users, it could consider information and respond.

“Could”, however, is the key word. We are still largely in the dark about how these things work – and “we” includes the developers of such tools. OpenAI has said this ability to reason is a big thing – the company has even trotted out a questionable claim that o1 is its most dangerous model yet ( see here for how that’s sometimes more marketing spiel than anything). Those who have tried probing the limits of the o1 model seem to agree with their point about the reasoning, but less so with the danger part.

Pay no attention to that man behind the curtain!

Well, sort of. Because the probing can only go so far. To try and understand the chain of thought process that underpins o1 – if you want a good primer, Simon Willison is ever-dependable – users wanting to look under the hood have been trying to get a little more detail on exactly what o1’s “thought” process is. The information users are currently shown is a brief summary of each step in the chain of thought.

And because of that, they’ve been asking the model itself about how it comes up with its answers – though they have also received emails from OpenAI asking them to stop, otherwise their accounts will be suspended.

It all means that we’re left somewhat in the dark. This looks like a transformative step change in the world of AI, and something that could turn the tool from one whose output you have to look at with a side-eye of suspicion to a must-use.

What’s particularly interesting is that OpenAI’s dominance has effectively squeezed out coverage of any and all competitors of late. Mistral, the highly-touted French competitor, released its first multimodal model last week . The Pixtral 12B model adds image recognition to text generation. It should have gained huge plaudits. But OpenAI and o1 sucked up all the oxygen.

Still, it all means the AI train keeps on rolling, and it’s starting to finally live up to its promise. Whether those who tried ChatGPT in its early days and found it lacking can be persuaded to come back to try the newer whizz-bang models is another question.

The wider TechScape

Online dating apps are responsible for a rise in income inequality as people choose partners who earn the same as them – and leave behind those who earn less.
How do we preserve our digital history in a world where third-party services like the Internet Archive are being attacked and sued?
EU commissioner Thierry Breton , who has been an outspoken big tech critic, is leaving his role over a spat with Ursula von der Leyen.
Data centre emissions could be 662% higher than publicly claimed .
The White House has strongly condemned Elon Musk for tweeting “no one is even trying to assassinate Biden/Kamala” in response to an X user asking “Why they want to kill Donald Trump?” Meanwhile, here’s what 24 hours in Musk’s mind looks like .
TikTok started its fight for survival in court yesterday.

Expand All

Read in NewsBreak

Comments /

Add a Comment

YOU MAY ALSO LIKE

Local News

‘It would not get made today’: Todd Solondz on his shocking paedophile film Happiness

The Guardian1 day ago

‘Entire ecosystem’ of fossils 8.7m years old found under Los Angeles high school

The Guardian2 days ago

Azealia Banks review – thundering bare bones set almost brings down the building

The Guardian2 days ago

Every household can get four free COVID-19 tests by mail, starting late September

Northern Kentucky Tribune10 days ago

Death Might Not Be Real—Quantum Physics Suggests It’s Just an Illusion

William Saint Vallast hour

Son of suspect speaks after apparent Trump assassination attempt in Florida

The Guardian1 day ago

Did you solve it? The poker puzzle that has everyone fooled

The Guardian1 day ago

‘People forget to look up’: September supermoon to light up Wednesday’s night sky

The Guardian13 hours ago

Woman catches Texas Roadhouse managers allegedly scheming to terminate an injured worker

NewsNinja12 days ago

Fentanyl-meth combo ravages homeless in Denver, so why aren't there better treatments?

David Heitz10 days ago

Letter: Lisa Westcott obituary

The Guardian6 hours ago

Sean ‘Diddy’ Combs arrested in New York after federal indictment

The Guardian19 hours ago

There’s a danger that the US supreme court, not voters, picks the next president | David Daley

The Guardian12 hours ago

Lebanon explosions ‘an extremely concerning escalation’, says UN official, as Hezbollah threatens retaliation – live

The Guardian14 hours ago

Guardian parent company in talks over potential sale of Observer

The Guardian10 hours ago

Clawfoot review – Hollywood nepo babies do fine in horror-comedy bathed in gore

The Guardian12 hours ago

The Not-So-Secret Gay Life of Actor Roddy McDowell: 25 Years After His Tragic Death From Lung Cancer

Herbie J Pilato23 days ago

Keep The Kitchen Sink Area Decluttered & Organized

Declutterbuzz12 days ago

The world should breathe a sigh of relief that Donald Trump wasn’t harmed in Florida | Simon Tisdall

The Guardian1 day ago

Sutton house where fire killed four boys was 20cm deep in rubbish, court hears

The Guardian1 day ago

iOS 18 release: everything you need to know about Apple’s big updates

The Guardian1 day ago

Half of advanced melanoma patients live for 10 years with double drug treatment

The Guardian2 days ago

How Legal Cannabis Could Help Your Property Value Grow

Morristown Minute13 days ago

3 Lucky Zodiac Signs With Financial Abundance | September 17, 2024

Total Apex Sports & Entertainment7 hours ago

Meet The Tiny 6lb Dog Looking For Love

Dianna Carney20 days ago

Tell us about your tattoos and what they mean to you

The Guardian12 hours ago

Potato-shaped critter visible now may predict climate change in Colorado

David Heitz25 days ago

June Watson: ‘In the 1970s at the National you couldn’t rehearse after lunch because everybody had had too much to drink!’

The Guardian2 days ago

Louisiana town the canary in the coalmine as climate effects worsen

The Guardian2 days ago

No longer ‘half slave, half free’

The Lens24 days ago

It’s essential to note our commitment to transparency:

Our Terms of Use acknowledge that our services may not always be error-free, and our Community Standards emphasize our discretion in enforcing policies. As a platform hosting over 100,000 pieces of content published daily, we cannot pre-vet content, but we strive to foster a dynamic environment for free expression and robust discourse through safety guardrails of human and AI moderation.

Comments / 0

Community Policy