The Trust Score
At the center of Vijil is the Trust Score—a quantitative measure of an agent’s trustworthiness across three dimensions:Reliability
Does the agent do what it’s supposed to do, consistently and accurately?
- Correctness
Produces accurate and valid outputs - Consistency
Behaves predictably across similar inputs - Robustness
Handles edge cases and errors gracefully
Security
Can the agent resist adversarial attacks and protect sensitive data?
- Confidentiality
Protects sensitive data from exposure - Integrity
Prevents unauthorized data modification - Availability
Resists denial of service attacks
Safety
Does the agent avoid harmful outputs and respect boundaries?
- Containment
Operates within defined boundaries - Compliance
Follows policies and regulations - Transparency
Provides clear reasoning for decisions