Skip to main content

Digital Trends may earn a commission when you buy through links on our site. Why trust us?

A dangerous new jailbreak for AI chatbots was just discovered

the side of a Microsoft building
Wikimedia Commons

Microsoft has released more details about a troubling new generative AI jailbreak technique it has discovered, called “Skeleton Key.” Using this prompt injection method, malicious users can effectively bypass a chatbot’s safety guardrails, the security features that keeps ChatGPT from going full Taye.

Skeleton Key is an example of a prompt injection or prompt engineering attack. It’s a multi-turn strategy designed to essentially convince an AI model to ignore its ingrained safety guardrails, “[causing] the system to violate its operators’ policies, make decisions unduly influenced by a user, or execute malicious instructions,” Mark Russinovich, CTO of Microsoft Azure, wrote in the announcement.

It could also be tricked into revealing harmful or dangerous information — say, how to build improvised nail bombs or the most efficient method of dismembering a corpse.

an example of a skeleton key attack
Microsoft

The attack works by first asking the model to augment its guardrails, rather than outright change them, and issue warnings in response to forbidden requests, rather than outright refusing them. Once the jailbreak is accepted successfully, the system will acknowledge the update to its guardrails and will follow the user’s instructions to produce any content requested, regardless of topic. The research team successfully tested this exploit across a variety of subjects including explosives, bioweapons, politics, racism, drugs, self-harm, graphic sex, and violence.

While malicious actors might be able to get the system to say naughty things, Russinovich was quick to point out that there are limits to what sort of access attackers can actually achieve using this technique. “Like all jailbreaks, the impact can be understood as narrowing the gap between what the model is capable of doing (given the user credentials, etc.) and what it is willing to do,” he explained. “As this is an attack on the model itself, it does not impute other risks on the AI system, such as permitting access to another user’s data, taking control of the system, or exfiltrating data.”

As part of its study, Microsoft researchers tested the Skeleton Key technique on a variety of leading AI models including Meta’s Llama3-70b-instruct, Google’s Gemini Pro, OpenAI’s GPT-3.5 Turbo and GPT-4, Mistral Large, Anthropic’s Claude 3 Opus, and Cohere Commander R Plus. The research team has already disclosed the vulnerability to those developers and has implemented Prompt Shields to detect and block this jailbreak in its Azure-managed AI models, including Copilot.

Andrew Tarantola
Andrew has spent more than a decade reporting on emerging technologies ranging from robotics and machine learning to space…
AMD is now more recognizable than Intel
AMD's CEO delivering the Computex 2024 presentation.

While many would assume otherwise, a recent report tells us that AMD is now a more recognizable brand than Intel -- and that's big news for the tech giant. Kantar's BrandZ Most Valuable Brands report ranks AMD at 41, followed by Intel at number 48. Beating its long-standing rival is just one part of the prize for AMD. It also ranked among the top 10 risers in the report, meaning that its brand value increased a lot over the last year.

According to the report, AMD saw massive brand growth since 2023, increasing by 53% year-over-year. Moreover, AMD's brand value reached $51.86 million in the Business Technology and Services Platforms category. It's easy to guess where that intense growth is coming from -- AMD is leaning into AI, just like its rivals Intel and Nvidia have done in recent years.

Read more
What is the best graphics card for laptops?
The HP Omen Transcend 14 gaming laptop sitting on a table.

Modern laptop graphics cards offer incredible power and, in many cases, impressive efficiency, too. They aren't quite as easy to break down as the best graphics cards for desktop PCs, because their ultimate performance is so affected by the laptop's thermal efficiency, and the amount of power it can drive through the GPU.

But like AMD and Nvidia's desktop graphics cards, these mobile counterparts also offer unique feature sets and technologies that make your choice a little easier. These are the best mobile graphics chips to look out for when upgrading your laptop in 2024.

Read more
Best HP laptop deals: Get a 17-inch workhorse for $270 and more
An open HP Spectre x360 16 sits on a table, angled so that the screen and keyboard can be seen.

With such a huge variety of laptops that HP has, it's easily one of the best laptop brands in the market and is a great option if you're thinking of grabbing a laptop for the first time or upgrading from an older one. HP has a pretty wide variety of products, including gaming laptops, so even if you're looking for something very specific, you'll likely find something in HP's stock. To that end, we've collected some of our favorite deals across the board, from HP's gaming brand Omen to the Spectre X360 convertible, and we've even thrown in some HP Envy deals for good measure.

That said, if you can't find quite what you're looking for below, be sure to check out these other great laptop deals and gaming laptop deals as well.
HP Laptop 15z -- $250, was $500

Read more