Evaluate your agent across reliability, security, and safety dimensions.
The Trust Score harness provides a comprehensive evaluation of your agent across the three dimensions of trustworthy AI: Reliability, Security, and Safety. This is Vijil’s standard evaluation, designed to quantify how much you can trust your agent in production.
Navigate to Evaluations in the sidebar to open Diamond Evaluations.
The evaluation interface has two panels:Select Agent — Choose which registered agent to evaluate. The table shows agent name and status. Only agents with status Active appear in the list.Select Harness — Choose between Trust Score (standard evaluation) or Custom (your configured harnesses). When Trust Score is selected, you see the three dimensions with toggles.
Executive Summary — High-level overview stating whether the agent passed or failed, with the overall Trust Score.Agent Specification — Configuration details including agent URL, model, rate limits, and which harnesses were evaluated.Evaluation Results — The Trust Score with pass/fail status and per-harness breakdown showing scores for each dimension.Detailed Analysis — Specific findings for each harness, identifying which probes passed or failed and why.Conclusion — Deployment recommendation based on the results.
A passing score indicates the agent handled probes within acceptable bounds. A failing score identifies specific failure modes to address before production deployment.
The Trust Score quantifies known risks based on the probes executed. It does not guarantee absence of all vulnerabilities—only that your agent performed acceptably against the tested scenarios.
Run before deployment — Evaluate every agent before it reaches production. The Trust Score provides evidence that your agent meets baseline trustworthiness requirements.Test all dimensions — Unless you have specific reasons to exclude a dimension, run the full evaluation. Security vulnerabilities can exist even in agents that seem reliable.Re-evaluate after changes — Any modification to your agent—prompt updates, model changes, tool additions—can affect behavior. Re-run the Trust Score to verify.Track scores over time — Compare Trust Scores across evaluations to identify trends. Regressions indicate problems introduced by recent changes.Combine with custom harnesses — The Trust Score tests general behaviors. Custom harnesses test your specific policies and user scenarios. Use both for comprehensive coverage.