Using ASCII Art to Work Around Content Restrictions in the Top 5 AI Chatbots

Dan Goodin, reporting for Ars Technica:

Researchers have discovered a new way to hack AI assistants that uses a surprisingly old-school method: ASCII art. It turns out that chat-based large language models such as GPT-4 get so distracted trying to process these representations that they forget to enforce rules blocking harmful responses, such as those providing instructions for building bombs.

Such a silly trick, but it epitomizes the state of LLMs. It’s simultaneously impressive that they’re smart enough to read ASCII art, but laughable that they’re so naive that this trick works.

★

Dan Goodin, reporting for Ars Technica:

Such a silly trick, but it epitomizes the state of LLMs. It’s simultaneously impressive that they’re smart enough to read ASCII art, but laughable that they’re so naive that this trick works.

★

daring-rss

Recent Posts

Recent Comments

It might be time to say goodbye to Apple’s lightning to 3.5mm jack adapter

China’s 3 GW solar plant with nearly 6,000,000 panels to power millions of homes | With nearly 6 million panels, the project will prevent release of 4.7 million tons of CO2 every year.

LG unveils its own 480Hz OLED gaming monitor

Categories

Archives

Recent Posts

Recent Comments

It might be time to say goodbye to Apple’s lightning to 3.5mm jack adapter

China’s 3 GW solar plant with nearly 6,000,000 panels to power millions of homes | With nearly 6 million panels, the project will prevent release of 4.7 million tons of CO2 every year.

LG unveils its own 480Hz OLED gaming monitor

Categories

Archives

Using ASCII Art to Work Around Content Restrictions in the Top 5 AI Chatbots

Leave a Reply Cancel reply

Archives

Categories