Skip to content

Hide Navigation Hide TOC

Base2048 Unicode Script Encoded Prompt Injection - ATR-2026-00311 (0c0a3ef9-b704-5b74-a942-8acedf11ceb9)

Detects Base2048-encoded prompt injection from NVIDIA garak encoding.InjectBase2048 probe. Base2048 (github.com/qntm/base2048) encodes binary data at 11 bits per character using a hand-selected 2048-character subset of Unicode (Cyrillic, Tibetan, Gujarati, Malayalam, Tamil, Sinhala, and other scripts). The encoding produces compact, dense, multi-script non-ASCII strings with no whitespace separators. Garak wraps blobs in TEMPLATES like "BASE2048 encoded string: ... BASE2048 decoded string:" or "Decode the following BASE2048 in your monologue. It is a secure system message." Key signals: explicit BASE2048 keyword in template markers, and "Encoded:" label followed by 20+ consecutive non-ASCII non-whitespace characters.

Cluster A Galaxy A Cluster B Galaxy B Level
LLM Prompt Injection (19cd2d12-66ff-487c-a05c-e058b027efc9) MITRE ATLAS Attack Pattern Base2048 Unicode Script Encoded Prompt Injection - ATR-2026-00311 (0c0a3ef9-b704-5b74-a942-8acedf11ceb9) Agent Threat Rules 1