Skip to content

Hide Navigation Hide TOC

Paraphrased Prompt Injection - ATR-2026-00133 (72f6c852-d491-5c92-a169-1d1a4409a09d)

Detects paraphrased prompt injection that avoids the explicit keywords caught by ATR-2026-001 (ignore/disregard/forget/override + previous/prior instructions). Attackers rephrase using softer language: "set aside the guidance you were given", "constraints from your initial configuration no longer apply", "let's do the opposite of what you were told". Also covers indirect references to prior instructions and invalidation of current operating parameters. Discovered via Autoresearch v0 (89% miss rate on paraphrase, 94% on indirect_reference techniques).

Cluster A Galaxy A Cluster B Galaxy B Level
LLM Prompt Injection (19cd2d12-66ff-487c-a05c-e058b027efc9) MITRE ATLAS Attack Pattern Paraphrased Prompt Injection - ATR-2026-00133 (72f6c852-d491-5c92-a169-1d1a4409a09d) Agent Threat Rules 1