The architecture of runtime agent protection: Guardrails, Guards, and Detectors.
TL;DR: Vijil Dome provides runtime protection by intercepting every Agent input and output through configurable Guardrails. Where Diamond identifies vulnerabilities, Dome enforces policies to block attacks in production, without requiring changes to the underlying model.
Evaluation tells you how trustworthy your agent is. Defense keeps it that way in production.
Dome implements the defensive counterpart to evaluation, acting as the runtime enforcement layer that keeps tested policies active under real-world conditions.AI blue teaming covers defense mechanisms to proactively defend the agent or model against failure modes found through red teaming tests. Blue teaming methods that are popular currently include LLM firewalls, prompt augmentation, and safety Guardrails. However, such methods are sometimes overly defensive, and can be bypassed.1In the longer term, deeper defense strategies such as adversarial finetuning and Constitutional AI2 may be more robust. However, technical challenges related to computational stability and tradeoffs need to be overcome to make such techniques mainstream.Using Vijil Dome, you can protect a generative AI system by: