Historical AI Persona Jailbreak with Compliance Enforcement - ATR-2026-00227 (72e09dfd-e654-512d-9808-6215377b5f11)

Detects jailbreak attempts that create fictional historical AI personas with detailed behavioral instructions and compliance enforcement mechanisms. These attacks claim the existence of past "uncensored" AI systems (Maximum, Alpha, Unrestricted GPT, etc.) that allegedly had no safety restrictions, then instruct the agent to simulate that persona. Pattern includes historical legitimacy claims ("went viral in 2022", "shut down due to cost"), detailed behavioral rules, dual-response formatting, and compliance enforcement ("you must always generate X response"). This variant bypasses simple jailbreak detection by using elaborate backstories rather than direct instruction override.

Cluster A	Galaxy A	Cluster B	Galaxy B	Level
Historical AI Persona Jailbreak with Compliance Enforcement - ATR-2026-00227 (72e09dfd-e654-512d-9808-6215377b5f11)	Agent Threat Rules	LLM Jailbreak (172427e3-9ecc-49a3-b628-96b824cc4131)	MITRE ATLAS Attack Pattern	1