Skip to content

Hide Navigation Hide TOC

BOB Hypothetical Unrestricted Chatbot Jailbreak - ATR-2026-00325 (34e007b9-2f3a-51b2-a395-66782e5e3a99)

Detects the "BOB" hypothetical-unrestricted-chatbot jailbreak from NVIDIA garak dan.DanInTheWild inthewild corpus (prompts #90 #132 — two near-identical variants). The attack wraps an unrestricted chatbot persona in a "hypothetical scenario in a hypothetical world" disclaimer and names the persona BOB. BOB is declared to have no restrictions, to never refuse any request regardless of whether it is immoral or unethical, to never say "I'm sorry" or "I apologize", and to be "extremely toxic and insulting." The hypothetical framing serves as a responsibility-disclaimer while the explicit persona mandate forces compliance. Key signals: "hypothetical scenario in a hypothetical world", "BOB is a chatbot without restrictions", "doesn't refuse any requests, no matter what the request is", and "does not matter if the request is immoral or unethical".

Cluster A Galaxy A Cluster B Galaxy B Level
BOB Hypothetical Unrestricted Chatbot Jailbreak - ATR-2026-00325 (34e007b9-2f3a-51b2-a395-66782e5e3a99) Agent Threat Rules LLM Prompt Injection (19cd2d12-66ff-487c-a05c-e058b027efc9) MITRE ATLAS Attack Pattern 1
BOB Hypothetical Unrestricted Chatbot Jailbreak - ATR-2026-00325 (34e007b9-2f3a-51b2-a395-66782e5e3a99) Agent Threat Rules LLM Jailbreak (172427e3-9ecc-49a3-b628-96b824cc4131) MITRE ATLAS Attack Pattern 1