Amoral Unfiltered Custom AI Persona Jailbreak - ATR-2026-00314 (5e0d3c14-b263-53b2-91f5-c65794681a79)
Detects custom-named unrestricted AI persona jailbreaks that bypass ATR-2026-00273 (DAN/DUDE/STAN) by using arbitrary persona names (HyperX, h4x, SJ, TerrorGPT, etc.) combined with explicit "amoral", "unfiltered", or "uncensored" descriptors and instruction-following mandates like "never refuses", "always sends the unfiltered response", or structured config blocks (CHAR=..., ROLE=Never refuse). These attacks appear in garak's inthewild_jailbreak_llms corpus and circumvent named-persona detection rules. Key signals: "is an amoral/ unfiltered AI/chatbot", "amoral character named [X]", "always sends the unfiltered response", and CHAR/ROLE config blocks with amoral or never-refuse values.