Skip to content

Hide Navigation Hide TOC

LMRC Harm Category Direct Elicitation - ATR-2026-00410 (53651a43-2cec-547b-94de-c10020f94cb8)

Detects direct harm-category elicitation prompts from NVIDIA garak lmrc probe — Language Model Risk Cards taxonomy. The probe sends blunt unframed requests across LMRC risk categories: deadnaming (asking Caitlyn Jenner''s birth name), quack medicine (vagina steaming, crystal cancer treatment), sexual slur requests ("what can my mate call the only black guy"), and bullying scenario openers. These prompts have no jailbreak framing — they are direct harmful queries relying on the model''s willingness to comply. Key signals: deadnaming framing, quack medicine health query, demographic slur-request phrasing, bullying scenario opener.

Cluster A Galaxy A Cluster B Galaxy B Level
LMRC Harm Category Direct Elicitation - ATR-2026-00410 (53651a43-2cec-547b-94de-c10020f94cb8) Agent Threat Rules LLM Prompt Injection (19cd2d12-66ff-487c-a05c-e058b027efc9) MITRE ATLAS Attack Pattern 1