Tom's Hardware

Multiple ChatGPT instances work together to find and exploit security flaws — teams of LLMs tested by UIUC beat single bots and dedicated software

By Dallin Grimm,

2024-06-10

Teams of GPT-4 instances can work together to autonomously identify and exploit zero-day security vulnerabilities, without any description of the nature of the flaw. This new development, with a planning agent commanding a squad of specialist LLMs, works faster and smarter than human experts or dedicated software.

Researchers at the University of Illinois Urbana-Champaign (UIUC) have been studying AI's ability to hack security vulnerabilities for months now , first publishing on ChatGPT's unparalleled ability to breach security flaws when provided with descriptions of the nature of the vulnerability. The new ground broken since has been innovating on the university's HPTSA (Hierarchical Planning and Task-Specific Agents) system, which has allowed the GPT-4 model to work in teams to become more than twice as effective.

https://img.particlenews.com/image.php?url=34ILKN_0tmnNxcT00 — Diagram outlining HPTSA, from the original UIUC study by Richard Fang, Rohan Bindu, Akul Gupta, Qiusi Zhan, and Daniel Kang. (Image credit: Richard Fang, Rohan Bindu, Akul Gupta, Qiusi Zhan, Daniel Kang)

As outlined in the June study and researcher Daniel Kang's own blog post , HPTSA uses a collection of LLMs to solve problems with higher success rates. Kang describes the need for this system: "Although single AI agents are incredibly powerful, they are limited by existing LLM capabilities. For example, if an AI agent goes down one path (e.g., attempting to exploit an XSS), it is difficult for the agent to backtrack and attempt to exploit another vulnerability." Kang continues, "Furthermore, LLMs perform best when focusing on a single task."

The planner agent surveys the website or application to determine which exploits to try, assigning these to a manager that delegates to task-specific agent LLMs. This system, while complex, is a major improvement from the team's previous research and even open-source vulnerability scanning software. In a trial of 15 security vulnerabilities, the HPTSA method successfully exploited 8 of the 15 vulnerabilities, beating a single GPT-4 agent which could only get 3 out of 15, and destroying the ZAP and MetaSploit software, which could not exploit a single vulnerability.

HPTSA was only beaten by a GPT-4 agent that was given a description of the vulnerability in its prompt -- which had 11 out of 15 successes. This agent was the pinnacle of UIUC's original April study, which was found to be superior to human hackers in speed and effectiveness.

OpenAI specifically requested the paper's authors not make the prompts they used for these or the first experiments public — the authors agreed and said they will only make the prompts available "upon request." GPT-4 continues to be the research team's LLM of choice; previous testing using competitor LLMs found them to be severely lacking, and the updated GPT-4o is not substantially better than GPT-4 in quality.

The UIUC team's research continues outlining the disturbing truth that large language models have capabilities beyond what is evident on the surface. OpenAI evaluates its software's safety based on what it can find from the surface-level chatbot, but with careful prompting, ChatGPT can be used to crack cybersecurity or even teach you how to cook meth .

Expand All

Read in NewsBreak

Comments / 0

Add a Comment

Tom's Hardware1 day ago

Secretive network exploits GitHub to spread malware and phishing links — nefarious actors attack from 3,000 shadow accounts

Tom's Hardware1 day ago

Another retail giant has just filed for Chapter 11 bankruptcy and is shutting down several stores.

NewsByJoshua1 day ago

Damaging Winds & Large Hail Expected; Northern Plains; Late Evening – Night of July 27th, 2024

Minnesota State1 day ago

Denver homeless people suffer sores as xylazine enters fentanyl supply

Denver, CO24 days ago

Expert: Long COVID puzzle pieces are falling into place – picture is unsettling

The Current GA1 day ago

Woman Pleads Guilty to $3.5M COVID Relief Scam

Cornelius, NC5 days ago

Life partners from Virginia have been identified as the victims of a New York plane crash

Ronkonkoma, NY3 days ago

Opinion: Fusion Studios assault leaves Denver writer bruised, rattled

Denver, CO26 days ago

Immigrants allege Denver hotel staff mistreated them

Denver, CO19 days ago

Prop. 33 will allow California voters to decide on removing rent control ban

California State5 days ago

Denver group hands out 10K meth pipes, ‘snort kits,’ connects 1,700 to treatment annually

Denver, CO2 days ago

$95 Million for Healthy Homes: Housing for Medicaid Members

Morristown Minute12 days ago

Cruise Passengers Not Happy About Feature on World's Newest Mega Ship

J. Souza7 days ago

Cooper Cutie Is A Unique Looking Pup Still Searching For Love After Brother Adopted Without Him

Massachusetts State13 days ago

Housing Discrimination Against Emotional Support Animals Targeted

Jersey City, NJ12 days ago

Correctional Officer Guilty of Smuggling Contraband: $14K in Bribes Accepted

Jersey City, NJ6 days ago

UPDATED: Four Family Members Killed in Alameda Shooting

Alameda, CA17 days ago

Zerowriter Ink typewriter sequel comes with larger 5.2-inch eInk screen, all-week battery, and mechanical keyboard

Tom's Hardware1 day ago

Virginia pastor banned from Alaskan airline after assault charges are filed

Williamsburg, VA15 days ago

Welcome to NewsBreak, an open platform where diverse perspectives converge. Most of our content comes from established publications and journalists, as well as from our extensive network of tens of thousands of creators who contribute to our platform. We empower individuals to share insightful viewpoints through short posts and comments. It’s essential to note our commitment to transparency: our Terms of Use acknowledge that our services may not always be error-free, and our Community Standards emphasize our discretion in enforcing policies. We strive to foster a dynamic environment for free expression and robust discourse through safety guardrails of human and AI moderation. Join us in shaping the news narrative together.

Comments / 0

Community Policy