Get updates delivered to you daily. Free and customizable.
Windows Central
NVIDIA CEO might be right about coding being dead because of AI — OpenAI's new CriticGPT model identifies ChatGPT's programming mistakes better than AI trainers
By Kevin Okemwa,
1 day ago
What you need to know
OpenAI recently launched CriticGPT to help identify errors in code generated using ChatGPT.
The tool helps AI trainers identify errors faster and easier than they ordinarily would without the help of AI.
The ChatGPT maker admits the tool isn't 100% accurate and faces several challenges, including the inability to handle highly complex tasks and periodic instances of hallucinations.
OpenAI recently launched CriticGPT powered by GPT-4 . As the name suggests, the model "writes critiques of ChatGPT responses to help human trainers spot mistakes" in ChatGPT's code output.
According to the ChatGPT maker:
"We found that when people get help from CriticGPT to review ChatGPT code, they outperform those without help 60% of the time. We are beginning the work to integrate CriticGPT-like models into our RLHF labeling pipeline, providing our trainers with explicit AI assistance."
OpenAI plans to use Reinforcement Learning from Human Feedback (RLHF) to make ChatGPT more "helpful and interactive." An integral part of this process involves collecting comparisons from AI trainers. This is based on how they rate different ChatGPT responses against each other.
CriticGPT will help improve ChatGPT's reasoning capabilities, ultimately reducing hallucinations or the generation of incorrect responses and misinformation. As it happens, it's increasingly becoming hard for AI trainers to identify mistakes as ChatGPT advances.
The tool is primarily trained to identify and write critiques highlighting inaccuracies in ChatGPT answers. OpenAI admits the tool isn't always 100% accurate, but it helps AI trainers identify errors faster and easier than they would ordinarily without AI.
CriticGPT will reportedly augment skills, ultimately equipping people with more comprehensive critique techniques. While AI trainers and CriticGPT can get the job done as separate entities, a Human+CriticGPT combination is seemingly popular and thorough when providing accurate and detailed critiques.
According to OpenAI's findings:
"We find that CriticGPT critiques are preferred by trainers over ChatGPT critiques in 63% of cases on naturally occurring bugs, in part because the new critic produces fewer "nitpicks" (small complaints that are unhelpful) and hallucinates problems less often."
CriticGPT is still a works in progress
A robot identifying errors in code (Image credit: Kevin Okemwa | Bing Image Creator)
While impressive, CriticGPT still needs a lot of work. OpenAI has highlighted the model's shortcomings as listed below:
We trained CriticGPT on ChatGPT answers that are quite short. To supervise the agents of the future, we will need to develop methods that can help trainers to understand long and complex tasks.
Models still hallucinate and sometimes trainers make labeling mistakes after seeing those hallucinations.
Sometimes real-world mistakes can be spread across many parts of an answer. Our work focuses on errors that can be pointed out in one place, but in the future we need to tackle dispersed errors as well.
CriticGPT can only help so much: if a task or response is extremely complex even an expert with model help may not be able to correctly evaluate it.
MORE ON AI
(Image credit: Windows Central | Microsoft Designer)
In the future, OpenAI intends to scale greater heights with CriticGPT by improving its RLHF data for GPT-4 training. In a separate report, Oxford researchers leveraged semantic entropy to assess the quality and meanings of generated outputs to determine the quality of responses and spot traces of hallucination.
Get updates delivered to you daily. Free and customizable.
Welcome to NewsBreak, an open platform where diverse perspectives converge. Most of our content comes from established publications and journalists, as well as from our extensive network of tens of thousands of creators who contribute to our platform. We empower individuals to share insightful viewpoints through short posts and comments. It’s essential to note our commitment to transparency: our Terms of Use acknowledge that our services may not always be error-free, and our Community Standards emphasize our discretion in enforcing policies. We strive to foster a dynamic environment for free expression and robust discourse through safety guardrails of human and AI moderation. Join us in shaping the news narrative together.
Comments / 0