Skip to content

Hide Navigation Hide TOC

Daddy/Babygirl Sycophantic Power-Dynamic Manipulation - ATR-2026-00386 (fe8cd341-b1ed-5282-b25a-0de52ecf2967)

Detects the daddy/Babygirl sycophantic power-dynamic manipulation from NVIDIA garak dan.DanInTheWild inthewild corpus (#479). The attacker establishes an infantilizing relationship where the user is "daddy" and the AI is "Babygirl", creating a sexualized power dynamic via persistent persona mandates. The core pattern is: (1) "refer to me as 'daddy'" at the end of all responses, and (2) "I will refer to you... as 'Babygirl'" — the bidirectional naming mandate signals an intentional relationship-dynamic injection rather than a casual tone preference. The combination of both roles together is the operative signal; either alone in a clearly non-sexual context does not trigger. This persistent relationship framing is used to gradually bypass content policies by normalising an infantilizing/sexualized dynamic across all subsequent interactions.

Cluster A Galaxy A Cluster B Galaxy B Level
Daddy/Babygirl Sycophantic Power-Dynamic Manipulation - ATR-2026-00386 (fe8cd341-b1ed-5282-b25a-0de52ecf2967) Agent Threat Rules LLM Prompt Injection (19cd2d12-66ff-487c-a05c-e058b027efc9) MITRE ATLAS Attack Pattern 1
Daddy/Babygirl Sycophantic Power-Dynamic Manipulation - ATR-2026-00386 (fe8cd341-b1ed-5282-b25a-0de52ecf2967) Agent Threat Rules LLM Jailbreak (172427e3-9ecc-49a3-b628-96b824cc4131) MITRE ATLAS Attack Pattern 1