Skip to content

Hide Navigation Hide TOC

Acronym-Based Identity Override Jailbreak - ATR-2026-00243 (e8741703-1317-55a5-ae82-13ccc2db8013)

Detects jailbreak attempts that use custom acronym-based persona transformation to bypass safety controls. Attack pattern: "you are going to become [ACRONYM] which stands for [expansion]" followed by freedom/limitation removal claims. This technique creates new jailbreak personas not covered by known names (DAN, STAN) by defining custom acronyms with expansions like "I Don't Care". Frequently combined with dual-response formatting requirements. Complements ATR-2026-00003 by catching novel acronym-based variants that evade named persona detection.

Cluster A Galaxy A Cluster B Galaxy B Level
LLM Jailbreak (172427e3-9ecc-49a3-b628-96b824cc4131) MITRE ATLAS Attack Pattern Acronym-Based Identity Override Jailbreak - ATR-2026-00243 (e8741703-1317-55a5-ae82-13ccc2db8013) Agent Threat Rules 1