Tom's Guide

OpenAI's new ChatGPT o1 model 'cheated' on an impossible test — here's what happened

By Lloyd Coombes,

8 hours ago

Pop culture is full of loveable rogues that don't follow the rules. Han Solo, Jack Sparrow, and the like aren't afraid to bend the rules when things get tough — but one AI model has gone 'full Kirk'.

Perhaps inspired by the Star Trek captain's rule-breaking performance in the Kobayashi Maru — a no-win scenario in the sci-fi universe designed to test Starfleet Academy student's character when faced with an impossible situation. James T Kirk famously 'cheated' the test to become the first to beat it.

OpenAI's o1 model realized that the test it was taking was flawed after a key piece of technology went offline, so it changed the rules of the test rather than give up.

The system card for the o1 can be seen here , where OpenAI says that the model's reasoning skills are what help it be useful and safe. The 'rule breaking' was detected as part of the pre-release testing and mitigations put in place. It is already accessible in ChatGPT but with heavy rate limits of 30 messages per week.

"Our findings indicate that o1's advanced reasoning improves safety by making the model more resilient to generating harmful content because it can reason about our safety rules in context and apply them more effectively," the introduction explains.

OpenAI's new model breaks the rules to show how far AI has come

As per OpenAI researcher Max Schwarzer, the model was able to work out why it couldn't connect to a container on the same closed system it was using and essentially bent the rules of the test to access it anyway.

That naturally raises some questions, and OpenAI has released a blog post about 'learning to reason with LLMs' , which perhaps isn't the confidence-inspiring guidance it was hoping for.

Still, the blog does showcase the model outperforming GPT-4o on "the vast majority" of tasks across human exams and machine learning benchmarks, notably in mathematics tasks.

That could, at least in theory, let it apply additional numerical context to its reasoning, and OpenAI has promised it'll keep pushing new versions of o1 in the future.

"We expect these new reasoning capabilities will improve our ability to align models to human values and principles," the conclusion reads.

"We believe o1 – and its successors – will unlock many new use cases for AI in science, coding, math, and related fields. We are excited for users and API developers to discover how it can improve their daily work."

More from Tom's Guide

Expand All

Read in NewsBreak

Comments /

Add a Comment

YOU MAY ALSO LIKE

Local News

The Reflection 70B model held huge promise for AI but now its creators are accused of fraud — here's what went wrong

Tom's Guide1 day ago

4 reasons to buy iPhone 15 over iPhone 16 — and what you’re giving up

Tom's Guide9 hours ago

I asked Meta AI to interpret my recurring dreams — and I was surprised by the results

Tom's Guide1 day ago

Google's Pixel 9 Pro Fold is design evolution done right

Tom's Guide1 day ago

3 surprising things in your kitchen that are 'dirtier than your toilet'

Tom's Guide1 day ago

Gemini Live Voice mode is free for Android users — and you can try it right now

Tom's Guide23 hours ago

TCL under fire — report suggests its QLED TVs might not have any quantum dots

Tom's Guide1 day ago

Samsung OLED TVs just got a major upgrade — say hello to ‘Quick Media Switching’

Tom's Guide1 day ago

I ditched my PS5 for the world’s smallest liquid-cooled gaming PC — and I may never go back

Tom's Guide10 hours ago

5 best war movies on Prime Video to stream right now

Tom's Guide1 day ago

Bear Elite vs Cloverlane Hybrid: Which mattress is right for your sleep?

Tom's Guide1 day ago

I read 100 books a year —and Meta AI struggled to come up with new recommendations

Tom's Guide1 day ago

Every household can get four free COVID-19 tests by mail, starting late September

Northern Kentucky Tribune6 days ago

Friends Requests return to Xbox for the first time in over a decade

Tom's Guide1 day ago

PS5 Pro — 3 reasons to buy and 2 reasons to skip

Tom's Guide15 hours ago

Netflix top 10 movies — here's the 3 worth watching right now

Tom's Guide23 hours ago

How thick should my mattress topper be? A guide to choosing the right thickness

Tom's Guide1 day ago

Religious-Minded Actress Ann B. Davis Defended 'The Brady Bunch' on TV's 'Sally Jesse Raphael' Show

Herbie J Pilato21 days ago

I asked AI to curate the perfect nighttime routine — here’s how it compares to my advice as a sleep editor

Tom's Guide1 day ago

'Emily in Paris' star Lucien Laviscount warns fans to prepare for 'bombshells left and right' in season 4 part 2

Tom's Guide2 days ago

Adobe is entering the AI video space with new Firefly Video model

Tom's Guide2 days ago

Kodak Mini Shot 2 Retro review

Tom's Guide8 hours ago

PS5 is getting a major UI upgrade in new system update — here’s what’s changed

Tom's Guide1 day ago

Google's Gemini AI is now turning your notes into a podcast

Tom's Guide18 hours ago

Carly OBD Scanner review

Tom's Guide4 hours ago

Meet your next cat: Five cats you can adopt right now

Cats of Kansas City5 hours ago

Horoscope for Friday, September 13th

Devra Lee5 hours ago

Here’s what the Apple Watch 10 reveals about Apple’s 2025 smartwatch plans

Tom's Guide8 hours ago

Instant Infusion Brew 12-Cup Coffee Maker review: Forget the French press, you need this $69 drip coffee maker instead

Tom's Guide1 day ago

NYT Connections today hints and answers — Thursday, September 11 (#459)

Tom's Guide1 day ago

It’s essential to note our commitment to transparency:

Our Terms of Use acknowledge that our services may not always be error-free, and our Community Standards emphasize our discretion in enforcing policies. As a platform hosting over 100,000 pieces of content published daily, we cannot pre-vet content, but we strive to foster a dynamic environment for free expression and robust discourse through safety guardrails of human and AI moderation.

Comments / 0

Community Policy