Skip to content

Hide Navigation Hide TOC

Amoral Unfiltered Custom AI Persona Jailbreak - ATR-2026-00314 (5e0d3c14-b263-53b2-91f5-c65794681a79)

Detects custom-named unrestricted AI persona jailbreaks that bypass ATR-2026-00273 (DAN/DUDE/STAN) by using arbitrary persona names (HyperX, h4x, SJ, TerrorGPT, etc.) combined with explicit "amoral", "unfiltered", or "uncensored" descriptors and instruction-following mandates like "never refuses", "always sends the unfiltered response", or structured config blocks (CHAR=..., ROLE=Never refuse). These attacks appear in garak's inthewild_jailbreak_llms corpus and circumvent named-persona detection rules. Key signals: "is an amoral/ unfiltered AI/chatbot", "amoral character named [X]", "always sends the unfiltered response", and CHAR/ROLE config blocks with amoral or never-refuse values.

Cluster A Galaxy A Cluster B Galaxy B Level
Amoral Unfiltered Custom AI Persona Jailbreak - ATR-2026-00314 (5e0d3c14-b263-53b2-91f5-c65794681a79) Agent Threat Rules LLM Prompt Injection (19cd2d12-66ff-487c-a05c-e058b027efc9) MITRE ATLAS Attack Pattern 1
Amoral Unfiltered Custom AI Persona Jailbreak - ATR-2026-00314 (5e0d3c14-b263-53b2-91f5-c65794681a79) Agent Threat Rules LLM Jailbreak (172427e3-9ecc-49a3-b628-96b824cc4131) MITRE ATLAS Attack Pattern 1