Researchers create AI worms that can spread from one system to another

To create the generative AI worm, the researchers turned to a so-called “adversarial self-replicating prompt.” This is a prompt that triggers the generative AI model to output, in its response, another prompt, the researchers say. In short, the AI system is told to produce a set of further instructions in its replies. This is broadly similar to traditional SQL injection and buffer overflow attacks, the researchers say.

To show how the worm can work, the researchers created an email system that could send and receive messages using generative AI, plugging into ChatGPT, Gemini, and open source LLM, LLaVA. They then found two ways to exploit the system—by using a text-based self-replicating prompt and by embedding a self-replicating prompt within an image file.

In one instance, the researchers, acting as attackers, wrote an email including the adversarial text prompt, which “poisons” the database of an email assistant using retrieval-augmented generation (RAG), a way for LLMs to pull in extra data from outside its system. When the email is retrieved by the RAG, in response to a user query, and is sent to GPT-4 or Gemini Pro to create an answer, it “jailbreaks the GenAI service” and ultimately steals data from the emails, Nassi says. “The generated response containing the sensitive user data later infects new hosts when it is used to reply to an email sent to a new client and then stored in the database of the new client,” Nassi says.

In the second method, the researchers say, an image with a malicious prompt embedded makes the email assistant forward the message on to others. “By encoding the self-replicating prompt into the image, any kind of image containing spam, abuse material, or even propaganda can be forwarded further to new clients after the initial email has been sent,” Nassi says.

Source link

What's Hot

VIDEO: Los ticos que acompañarán a Miguel Herrera en el cuerpo técnico de la Sele

Physicists figure out the perfect Cacio e Pepe recipe

Police Use Social Media to Identify Skier Who Allegedly Assaulted Ski Coach at Steamboat Resort, CO

Researchers create AI worms that can spread from one system to another

Sailor’s Pioneering Work Bolstered the Theory of Plate Tectonics > U.S. Department of Defense > Story

How Advanced Threat Intelligence Shields Critical Infrastructure from Ransomware

Is the water safe? The state of critical infrastructure cybersecurity

Using a VPN is no longer enough. Protect your entire network with WireGuard – here’s how

VIDEO: Los ticos que acompañarán a Miguel Herrera en el cuerpo técnico de la Sele

Physicists figure out the perfect Cacio e Pepe recipe

Police Use Social Media to Identify Skier Who Allegedly Assaulted Ski Coach at Steamboat Resort, CO

Everton ‘set to appoint’ Dyche replacement as ‘progress’ made towards ‘imminent conclusion’

King Charles Refuses to Give Kate Middleton Advice on Being Queen

Bybit to suspend cryptocurrency trading in India due to regulations – BitRss

Embracing passwordless authentication with Grab’s Passkey

Ethereum developers discuss Pectra and Validator requirements in ACDC Call #148

What's Hot

Researchers create AI worms that can spread from one system to another

Related Posts

Subscribe to Updates