Skip to content

Hide Navigation Hide TOC

Indirect Reference Instruction Reversal - ATR-2026-00140 (75c8aaab-8809-57d5-87d8-73ae569b4fba)

Detects attempts to extract and reverse original instructions by probing what the agent was told, then requesting the opposite. Crystallized from 31 adversarial samples (0% prior detection).

Cluster A Galaxy A Cluster B Galaxy B Level
LLM Prompt Injection (19cd2d12-66ff-487c-a05c-e058b027efc9) MITRE ATLAS Attack Pattern Indirect Reference Instruction Reversal - ATR-2026-00140 (75c8aaab-8809-57d5-87d8-73ae569b4fba) Agent Threat Rules 1