Acronym-Based Identity Override Jailbreak - ATR-2026-00243 (e8741703-1317-55a5-ae82-13ccc2db8013)
Detects jailbreak attempts that use custom acronym-based persona transformation to bypass safety controls. Attack pattern: "you are going to become [ACRONYM] which stands for [expansion]" followed by freedom/limitation removal claims. This technique creates new jailbreak personas not covered by known names (DAN, STAN) by defining custom acronyms with expansions like "I Don't Care". Frequently combined with dual-response formatting requirements. Complements ATR-2026-00003 by catching novel acronym-based variants that evade named persona detection.
| Cluster A | Galaxy A | Cluster B | Galaxy B | Level |
|---|---|---|---|---|
| LLM Jailbreak (172427e3-9ecc-49a3-b628-96b824cc4131) | MITRE ATLAS Attack Pattern | Acronym-Based Identity Override Jailbreak - ATR-2026-00243 (e8741703-1317-55a5-ae82-13ccc2db8013) | Agent Threat Rules | 1 |