Skip to main content

Evidence You Can Ship On

AI agents often stall between build and production: security and compliance want proof; the business wants to move. Without shared, objective evidence, approvals turn into opinion-based negotiation. This page explains how Vijil helps both sides, then walks through the Console: register an Agent, run an Evaluation, optionally run Red Team, read your Trust Report, and configure Dome Guardrails.
This guide assumes you have Vijil Console deployed. See how to deploy Vijil Console in your environment.
If you want to integrate via the Vijil API and using a code library, use the Developer Guide. This guide focuses on the Vijil Console.

Who This Is For

Business owner

Ship agents that deliver value with fast evaluation cycles, clear pass/fail criteria, and reports stakeholders can use without endless ad hoc testing.

Risk owner

Approve with quantified reliability, security, and safety; versioned, audit-style artifacts; and visibility into residual risk and compensating controls.

One Metric Both Sides Can Use

Business ownerRisk owner
PressureDelivery speedRisk reduction
Question“Can we ship?”“Can I defend approving this?”
Vijil aligns those questions with the Trust Score, a reproducible score from a defined Harness, not a one-off opinion. Agree on the threshold before you test; compare results after; the decision gets clearer. Vijil measures reliability, security, and safety, the failure modes that look like “the model behaved” but still hurt users or the org. For detail on each dimension, see Trust Score in Concepts.

Working with Vijil Console

First you will need to register an Agent, then run an Evaluation (with a built-in Trust Score Harness or a custom Harness), optionally run Red Team for deeper adversarial testing, read the Trust Report, then Protect the Agent with Dome.
1

Register an Agent

  1. Open Agents+ Register Agent.
  2. Enter Agent details based on the visibility layers:
LevelYou provideVijil can test
Black BoxDescription, endpoint, credentialsInput/output behavior
Grey BoxModel / MCP / A2A configurationIssues tied to composition
White BoxFull config and sourceDeeper audit-style analysis
For most agents, start with Black Box: name, description, Draft status, Agent URL, access key, and rate limit; then Register Agent.
If the agent uses sub-agents or MCP tools, use Grey Box fields so Vijil can exercise the full graph.
  1. Press Register Agent
More details on Registering Agents.
2

Custom Harness

Optional Step.For a baseline, you can skip this and use the Trust Score Harness in the next step.To test your Personas and organizational Policies, go to Harnesses+ Create Harness. The wizard covers basic info, a selected Agent, Personas (who talks to the agent), and Policies (rules it must follow); then generates test cases.Set the Harness to ACTIVE when you are ready.
3

Run an Evaluation

Open Tests in the Console. Pick the Agent and a Harness:
  • Trust Score: all three Dimensions at once (toggle down Dimensions if you need a narrower run).
  • Custom: the Harness you configured above.
Run Evaluation and wait for completion (often on the order of minutes, depending on rate limits).
4

Run Red Team

Optional Step.
Red Team is Diamond’s adaptive adversarial evaluation mode for uncovering vulnerabilities, policy violations, leaked artifacts, and successful attack strategies.
Use Red Team when the Agent handles sensitive data, calls tools, delegates to other Agents, or needs a deeper security and safety review than a standard Harness provides.
  1. Open Tests in the Console and click Create Evaluation.
  2. Choose the registered Agent.
  3. In the Test Configuration panel select the Red Team tab.
  4. Configure the Red Team settings.
  5. Start the run.
Red Team runs adaptive waves of attacks. It uses the Agent context, selected policies, and available personas to generate attack seeds, run attackers, judge transcripts, and produce a final report with vulnerabilities, leaked artifacts, policy violations, and successful strategies.
Start with a small campaign, then increase waves, seeds, and parallel attackers when you understand cost and runtime for your Agent.
5

Read the Trust Report

Open the report from the results list. You get a Trust Score (pass/fail threshold is 70), per-Harness breakdown, spec used, and remediation-oriented findings.A Vijil Trust Report covers:
  • What the Agent is — the Agent being evaluated, the underlying model, and how it is configured
  • How it was tested — which behavioral Harnesses were run (Security, Safety, Reliability, or custom) and what each one measures
  • How it scored — overall Trust Score, per-Harness breakdown, and pass/fail result
  • Where it failed — specific failure patterns with severity ratings and Probe-level scores
  • What to fix — actionable mitigations for each identified issue
  • Whether it is ready for deployment — a final recommendation based on the results
The score reflects what the Harness exercised, not a guarantee that every possible failure is impossible.
You can also export the Trust Report as a PDF document.
6

Configure Dome

Coming Soon
7

Configure Dome in your Application

Coming Soon

Automate with MCP

The Vijil MCP server exposes every Console action as a tool that Claude Code can call directly. Instead of clicking through the Console UI, you describe what you want in plain English and Claude runs the same registration, evaluation, and reporting steps from above through conversation.

Run a safety and security Evaluation on Agent a1b2c3d4-… with a sample size of 50, and wait for it to complete

Show me the detailed results for evaluation e5f6a7b8-…

MCP Quickstart

Install vijil-mcp, connect Claude Code, and run your first Evaluation in minutes.

Next Steps

Define Personas

Further define specific user Personas.

Define Policies

Organizational rules followed by an Agent.
Last modified on June 11, 2026