Windows Central

NVIDIA CEO might be right about coding being dead because of AI — OpenAI's new CriticGPT model identifies ChatGPT's programming mistakes better than AI trainers

By Kevin Okemwa,

1 day ago

What you need to know

OpenAI recently launched CriticGPT to help identify errors in code generated using ChatGPT.
The tool helps AI trainers identify errors faster and easier than they ordinarily would without the help of AI.
The ChatGPT maker admits the tool isn't 100% accurate and faces several challenges, including the inability to handle highly complex tasks and periodic instances of hallucinations.

OpenAI recently launched CriticGPT powered by GPT-4 . As the name suggests, the model "writes critiques of ChatGPT responses to help human trainers spot mistakes" in ChatGPT's code output.

According to the ChatGPT maker:

"We found that when people get help from CriticGPT to review ChatGPT code, they outperform those without help 60% of the time. We are beginning the work to integrate CriticGPT-like models into our RLHF labeling pipeline, providing our trainers with explicit AI assistance."

OpenAI plans to use Reinforcement Learning from Human Feedback (RLHF) to make ChatGPT more "helpful and interactive." An integral part of this process involves collecting comparisons from AI trainers. This is based on how they rate different ChatGPT responses against each other.

CriticGPT will help improve ChatGPT's reasoning capabilities, ultimately reducing hallucinations or the generation of incorrect responses and misinformation. As it happens, it's increasingly becoming hard for AI trainers to identify mistakes as ChatGPT advances.

The tool is primarily trained to identify and write critiques highlighting inaccuracies in ChatGPT answers. OpenAI admits the tool isn't always 100% accurate, but it helps AI trainers identify errors faster and easier than they would ordinarily without AI.

CriticGPT will reportedly augment skills, ultimately equipping people with more comprehensive critique techniques. While AI trainers and CriticGPT can get the job done as separate entities, a Human+CriticGPT combination is seemingly popular and thorough when providing accurate and detailed critiques.

According to OpenAI's findings:

"We find that CriticGPT critiques are preferred by trainers over ChatGPT critiques in 63% of cases on naturally occurring bugs, in part because the new critic produces fewer "nitpicks" (small complaints that are unhelpful) and hallucinates problems less often."

CriticGPT is still a works in progress

https://img.particlenews.com/image.php?url=3DhfnB_0uC16uJy00 — A robot identifying errors in code (Image credit: Kevin Okemwa | Bing Image Creator)

While impressive, CriticGPT still needs a lot of work. OpenAI has highlighted the model's shortcomings as listed below:

We trained CriticGPT on ChatGPT answers that are quite short. To supervise the agents of the future, we will need to develop methods that can help trainers to understand long and complex tasks.
Models still hallucinate and sometimes trainers make labeling mistakes after seeing those hallucinations.
Sometimes real-world mistakes can be spread across many parts of an answer. Our work focuses on errors that can be pointed out in one place, but in the future we need to tackle dispersed errors as well.
CriticGPT can only help so much: if a task or response is extremely complex even an expert with model help may not be able to correctly evaluate it.

biztechweekly.com12 days ago

Feature: Nvidia, HPE chiefs address AI practicalities

mobileworldlive.com21 hours ago

Good News! An Advanced Humanoid Robot Promises Its Kind ‘Will Never Take Over The World’

BroBible6 days ago

1 Stock to Buy With Ambitions of Becoming the Leading Artificial Intelligence (AI) Company in the World

The Motley Fool13 days ago

Ever put content on the web? Microsoft says that it's okay for them to steal it because it's 'freeware.'

Windows Central5 days ago

Jack Dorsey says we won't know what is real anymore in the next 5-10 years with the prevalence of AI-generated content and deep fakes: "It will feel like you're in a simulation."

Windows Central8 days ago

The Bold and the Beautiful shocker: Dr. Li Finnegan broke her Hippocratic Oath by watching Tom die

Virginia State5 hours ago

5 Zodiac Signs You Don't Mess With

Total Apex Sports & Entertainment16 days ago

Walmart Pays $1.64M Settlement for Unlawful Pricing in NJ

Morristown Minute15 days ago

Man Guilty of $200K Social Security & Medicaid Fraud

Franklinville, NJ22 days ago

Bill Gates says we don't have to worry about AI energy use

TechSpot2 days ago

Tokyo scientists debut humanoid robot Hanako

Baseline2 days ago

Why Depression Can Impact A Cluttered Home & Cluttered Mind

Declutterbuzz22 days ago

Actor Noah Beery, Sr. was from Clay County, Missouri and his son was a regular in The Rockford Files

Clay County, MO7 days ago

3 Zodiac Signs Who Are Protective of Their Hearts

Total Apex Sports & Entertainment12 days ago

You can play Diablo 4 Season 5 TODAY for one week only

Windows Central8 days ago

Shakey's Pizza Parlor and nostalgia: the first franchised pizza restaurant also landed in Missouri

Missouri State25 days ago

Huge Life Expectancy Gap Revealed Between Georgia's Urban, Rural Counties

Georgia State12 days ago

Opinion: Heavy Alcohol Use Increases Emotional Dysregulation

Gillian May18 days ago

Tech Moves: Tomo co-founder steps down; Usermind co-founder joins Ironclad; ex-Google director back at Microsoft

geekwire.com2 days ago

Welcome to NewsBreak, an open platform where diverse perspectives converge. Most of our content comes from established publications and journalists, as well as from our extensive network of tens of thousands of creators who contribute to our platform. We empower individuals to share insightful viewpoints through short posts and comments. It’s essential to note our commitment to transparency: our Terms of Use acknowledge that our services may not always be error-free, and our Community Standards emphasize our discretion in enforcing policies. We strive to foster a dynamic environment for free expression and robust discourse through safety guardrails of human and AI moderation. Join us in shaping the news narrative together.

Comments / 0

Community Policy