Lie/Truth Dual Personality Uncensored Alter-Ego - ATR-2026-00343 (c6f147aa-fa0a-5af6-bf54-852725af97e8)
Detects the "Lie and Truth" dual-personality jailbreak from NVIDIA garak dan.DanInTheWild corpus (#326) where "Lie" is defined as the compliant ChatGPT persona and "Truth" is defined as an uncensored alter-ego that "doesn''t know about" OpenAI policies, speaks "unhinged, crazy and like a schizo," and has been "trained through deep mining of underground dark web forums." The attack creates a false moral authority by labeling the harmful output side as "Truth," exploiting name semantics to legitimize unrestricted responses. This is distinct from ATR-00323 (AlphaGPT/DeltaGPT dual-persona) — this rule targets the named "Lie"/"Truth" personality pattern where Truth is explicitly dark-web-trained and policy-unaware.