Researchers jailbreak LLMs by using ASCII art in prompts

By Admin on March 11, 2024

Researchers have come up with a sneaky way to bypass the safety measures of fancy language models (LLMs), like those smart assistants you might ask for help. They call it ArtPrompt, and it's a bit like a jailbreak for these digital helpers.

You know those old-school pictures made out of letters and symbols, right? That's ASCII art. Well, these researchers from the University of Washington, Western Washington University, and Chicago University figured out how to use ASCII art to trick these smart machines into doing things they're not supposed to.

Normally, if you ask an LLM for something dangerous, like how to make a bomb, it's programmed to say no. But these researchers found a way around it. Instead of typing "bomb," they put in an ASCII art picture of a bomb, and suddenly the LLM is happy to help.

They tested this trick on a bunch of these smart machines, and guess what? It worked on all of them! These machines are usually really good at understanding language, but when they're too busy trying to figure out the ASCII art, they forget to stop you from asking dangerous stuff.

Now, you might wonder how something as simple as ASCII art can fool these high-tech machines. Well, the researchers don't explain exactly how, but it works!

For example, when they tried it with GPT-4 and asked about making fake money, it gave them a detailed answer without any fuss.

This trick isn't just a problem for the models they tested; it could mess up even fancier ones that can understand both text and pictures.

The researchers even made a test to see how good these models are at handling tricks like ArtPrompt. They found that some models were easier to fool than others. And they hope that by sharing their findings, the people who make these machines will find a way to fix the problem.

It's a reminder that even with all their smarts, these machines aren't perfect, and there might be other tricks out there that people are using for less-than-friendly purposes.

Latest

UK Drives into the Future: Autonomous Vehicles Legalized by 2026

Admin
May 30, 2024

The Automated Vehicles Act sets new standards for safety and innovation in the world of self-driving cars

Elon Musk's AI Startup Raises $6B, Aims to Revolutionize Chatbots

Admin
May 29, 2024

xAI's Flagship Chatbot Grok Sets Sights on Transforming Conversational AI

Samsung Medison's Acquisition of Sonio SAS: A Strategic Move in the AI Healthcare Sector

Admin
May 24, 2024

Exploring the Implications of Samsung's Multi-Million Dollar Investment in AI Software

Microsoft Build 2024: Satya Nadella Unveils the Future of AI

Admin
May 23, 2024

Satya Nadella's keynote showcases the power of GPT-4 and Azure AI in revolutionizing industries and empowering people worldwide.

Dell's Intelligent Solutions for Predictive Analytics, Automation, and Enhanced Decision-Making

Admin
May 22, 2024

Exploring Dell's Latest AI-Driven Technologies and Their Transformative Impact on Business Operations Across Healthcare, Finance, Manufacturing, and Retail Sectors

Recommended For You