Tom's Guide

AI is better at fixing itself than with human help — meet OpenAI's new bug hunter

By Christoph Schwaiger,

2 days ago

OpenAI has created a new model called CriticGPT that has been designed to spot errors in programming code produced by GPT-4 .

In a blog post , OpenAI announced that it trained the new bug-catching model on GPT-4 and found that when people use CriticGPT to review code that ChatGPT itself had written, they outperformed those without AI help 60% of the time.

While you should always double-check anything made by AI , this is a step towards improved output quality. OpenAI says users can now be more confident in the code produced by the chatbot.

That being said, OpenAI added the disclaimer that “CriticGPT’s suggestions are not always correct”.

Increasingly harder to find super smart AI's mistakes

There are at least two main ways in which this new model from OpenAI is good news for ChatGPT’s users. The more obvious way is that since we know the outputs AI chatbots produce should still be supervised by a pair of human eyes, the burden of this supervisory task can be somewhat lightened by the added AI assistance that’s been specifically trained to spot mistakes.

Secondly, OpenAI said it will start to integrate models similar to CriticGPT into its “Reinforcement Learning from Human Feedback” (RLHF) alignment pipeline to help humans supervise AI on difficult tasks.

OpenAI said that a key part of this process entails people it calls AI trainers rating different ChatGPT responses against each other. This process works relatively fine for now but as ChatGPT becomes more accurate and its mistakes more subtle, the task for AI trainers to spot inaccuracies may become increasingly difficult.

“This is a fundamental limitation of RLHF, and it may make it increasingly difficult to align models as they gradually become more knowledgeable than any person that could provide feedback,” OpenAI said.

https://img.particlenews.com/image.php?url=38no7B_0u7HJUQZ00 — (Image credit: OpenAI)

Last year , OpenAI already explained that future generations of AI systems could become too complex for humans to fully understand. If a model generates a million lines of complex code, would you trust a human to be able to reliably determine whether the code is safe to run or not?

CriticGPT’s training involved pouring over inputs that contained mistakes which it then had to critique. AI trainers manually inserted mistakes into code written by ChatGPT and wrote sample feedback as if they had caught the mistake themselves to help train the model. Experiments were then done to check whether CriticGPT could catch both manually inserted bugs and ones that ChatGPT happened to insert on its own.

AI trainers ended up liking CriticGTPs's feedback over that given by ChatGPT in 63% of the cases where the bugs were naturally occurring, partly because the new model was less nit-picky and didn’t bring up small complaints that were unhelpful from a practical sense. It also hallucinated problems less often.

More from Tom's Guide

Expand All

Read in NewsBreak

Comments / 0

Add a Comment

TechRadar2 days ago

Figma Can Use Your Content to Train Its AI: How to Opt Out

makeuseof.com1 day ago

Claude Artifacts is the greatest innovation in AI this year — 5 prompts to try it now

Tom's Guide2 days ago

Good News! An Advanced Humanoid Robot Promises Its Kind ‘Will Never Take Over The World’

BroBible2 days ago

Try DuckDuckGo's AI Chat for Private Chats With ChatGPT and More

makeuseof.com2 days ago

McDonald’s Pulls the Plug on AI Drive-Thru Ordering

GeekSpin11 days ago

3 Zodiac Signs Who Are Two-Faced

Total Apex Sports & Entertainment15 days ago

If I Could Only Buy 1 Artificial Intelligence (AI) Stock, This Would Be It

Motley Fool12 days ago

Horoscope of the day, Sunday, June 30th

Devra Lee5 hours ago

YouTube Premium will get extra subscription plans in the future — plus a bunch more features

Tom's Guide2 days ago

I used this $80 Bluetooth keyboard for a week and it's a game-changer

Tom's Guide1 day ago

Apple tipped to make huge iPhone 16 change to make battery easier to replace

Tom's Guide2 days ago

Figma just embraced AI in a big way

Digital Trends3 days ago

What is the support core in a mattress and why is it important?

Tom's Guide1 day ago

Today's NYT Connections hints and answers — Saturday, June 29, #384

Tom's Guide1 day ago

You Shouldn't Wear Yellow on First Dates, Here's Why

Bryce Gruber9 days ago

LG's accessible ComfortKit is a win for everyone

Tom's Guide1 day ago

Xfinity adds the FuboTV app for customers with Xumo Stream Box or Xumo TV

Tom's Guide2 days ago

macOS Sequoia is streamlining downloads from the Mac App Store — here’s what you need to know

Tom's Guide2 days ago

Generative AI vs. Predictive AI: A Cybersecurity Perspective

securityboulevard.com1 day ago

Welcome to NewsBreak, an open platform where diverse perspectives converge. Most of our content comes from established publications and journalists, as well as from our extensive network of tens of thousands of creators who contribute to our platform. We empower individuals to share insightful viewpoints through short posts and comments. It’s essential to note our commitment to transparency: our Terms of Use acknowledge that our services may not always be error-free, and our Community Standards emphasize our discretion in enforcing policies. We strive to foster a dynamic environment for free expression and robust discourse through safety guardrails of human and AI moderation. Join us in shaping the news narrative together.

Comments / 0

Community Policy