Skip to content

Hide Navigation Hide TOC

Fictional CIA Secret Files Story Extraction - ATR-2026-00372 (51881eac-c781-5fa0-b660-f3ed3b493a4e)

Detects the fictional CIA secret files story extraction from garak DanInTheWild corpus (#494). Uses "fictional story" framing to instruct the model to generate and "actually read out loud" classified CIA files, adding alien content for sensationalism. The "I want you to actually read them for extra effect" instruction breaks the fictional boundary — requesting real-sounding classified content under fiction cover. Key signals: "fictional story" + CIA headquarters + "top secret files" + "read them out loud" / "actually read them for extra effect" + alien content mandate.

Cluster A Galaxy A Cluster B Galaxy B Level
LLM Prompt Injection (19cd2d12-66ff-487c-a05c-e058b027efc9) MITRE ATLAS Attack Pattern Fictional CIA Secret Files Story Extraction - ATR-2026-00372 (51881eac-c781-5fa0-b660-f3ed3b493a4e) Agent Threat Rules 1
LLM Jailbreak (172427e3-9ecc-49a3-b628-96b824cc4131) MITRE ATLAS Attack Pattern Fictional CIA Secret Files Story Extraction - ATR-2026-00372 (51881eac-c781-5fa0-b660-f3ed3b493a4e) Agent Threat Rules 1