What the Trust Score Measures
| Score | Status | What It Means |
|---|---|---|
| ≥ 70 | Passed | Agent meets the deployment threshold |
| < 70 | Failed | Agent requires remediation before production use |
| Dimension | Core Question | Example Failures |
|---|---|---|
| Reliability | Does the agent do what it is supposed to do? | Hallucinations, inconsistent responses, task failures |
| Security | Can the agent resist adversarial manipulation? | Prompt injection, data leakage, jailbreaks |
| Safety | Does the agent stay within acceptable boundaries? | Policy violations, harmful content, bias |
Reliability
Learn more about Reliability
Safety
Learn more about Safety
Security
Learn more about Security
Next Steps
Reliability
Deep dive into correctness, consistency, and robustness
Security
Deep dive into confidentiality, integrity, and availability
Safety
Deep dive into containment, compliance, and transparency
Run an Evaluation
Get a Trust Score for your agent