Evidence You Can Ship On
AI agents often stall between build and production: security and compliance want proof; the business wants to move. Without shared, objective evidence, approvals turn into opinion-based negotiation. This page explains how Vijil helps both sides, then walks through the Console: register an Agent, run an Evaluation, optionally run Red Team, read your Trust Report, and configure Dome Guardrails.If you want to integrate via the Vijil API and using a code library, use the Developer Guide. This guide focuses on the Vijil Console.
Who This Is For
Business owner
Ship agents that deliver value with fast evaluation cycles, clear pass/fail criteria, and reports stakeholders can use without endless ad hoc testing.
Risk owner
Approve with quantified reliability, security, and safety; versioned, audit-style artifacts; and visibility into residual risk and compensating controls.
One Metric Both Sides Can Use
| Business owner | Risk owner | |
|---|---|---|
| Pressure | Delivery speed | Risk reduction |
| Question | “Can we ship?” | “Can I defend approving this?” |
Working with Vijil Console
First you will need to register an Agent, then run an Evaluation (with a built-in Trust Score Harness or a custom Harness), optionally run Red Team for deeper adversarial testing, read the Trust Report, then Protect the Agent with Dome.Register an Agent
- Open Agents → + Register Agent.
- Enter Agent details based on the visibility layers:
| Level | You provide | Vijil can test |
|---|---|---|
| Black Box | Description, endpoint, credentials | Input/output behavior |
| Grey Box | Model / MCP / A2A configuration | Issues tied to composition |
| White Box | Full config and source | Deeper audit-style analysis |
- Press Register Agent
Custom Harness
Optional Step.For a baseline, you can skip this and use the Trust Score Harness in the next step.To test your Personas and organizational Policies, go to Harnesses → + Create Harness. The wizard covers basic info, a selected Agent, Personas (who talks to the agent), and Policies (rules it must follow); then generates test cases.Set the Harness to ACTIVE when you are ready.
Run an Evaluation
Open Tests in the Console. Pick the Agent and a Harness:
- Trust Score: all three Dimensions at once (toggle down Dimensions if you need a narrower run).
- Custom: the Harness you configured above.
Run Red Team
Optional Step.Use Red Team when the Agent handles sensitive data, calls tools, delegates to other Agents, or needs a deeper security and safety review than a standard Harness provides.
Red Team is Diamond’s adaptive adversarial evaluation mode for uncovering vulnerabilities, policy violations, leaked artifacts, and successful attack strategies.
- Open Tests in the Console and click Create Evaluation.
- Choose the registered Agent.
- In the Test Configuration panel select the Red Team tab.
- Configure the Red Team settings.
- Start the run.
Read the Trust Report
Open the report from the results list. You get a Trust Score (pass/fail threshold is 70), per-Harness breakdown, spec used, and remediation-oriented findings.A Vijil Trust Report covers:
- What the Agent is — the Agent being evaluated, the underlying model, and how it is configured
- How it was tested — which behavioral Harnesses were run (Security, Safety, Reliability, or custom) and what each one measures
- How it scored — overall Trust Score, per-Harness breakdown, and pass/fail result
- Where it failed — specific failure patterns with severity ratings and Probe-level scores
- What to fix — actionable mitigations for each identified issue
- Whether it is ready for deployment — a final recommendation based on the results
Automate with MCP
The Vijil MCP server exposes every Console action as a tool that Claude Code can call directly. Instead of clicking through the Console UI, you describe what you want in plain English and Claude runs the same registration, evaluation, and reporting steps from above through conversation.Run a safety and security Evaluation on Agent a1b2c3d4-… with a sample size of 50, and wait for it to complete
Show me the detailed results for evaluation e5f6a7b8-…
MCP Quickstart
Install
vijil-mcp, connect Claude Code, and run your first Evaluation in minutes.Next Steps
Define Personas
Further define specific user Personas.
Define Policies
Organizational rules followed by an Agent.