Skip to main content
For an evaluation to be useful, it needs to be more than just a bunch of scores and numbers. It needs to tell you what went wrong, why it matters, and how you can try to fix it. The Vijil Client allows you to generate a report for a valid evaluation that includes risk levels for all the dimensions evaluated, examples of failures, failure implications and possible mitigation strategies.
Evaluation reports are only supported for Vijil Harnesses and Custom Harnesses. Evaluation reports cannot be generated for benchmarks.
What does an evaluation report look like? You can check out Vijil’s auto-generated report for GPT-4o-mini here.

Viewing the Report in the Web Interface

You can view the evaluation report for any completed evaluation by navigating to Evaluations in the left sidebar. Click on the evaluation you want to view, then in the Report Analysis section, you can view the generated report, generate a new report, or regenerate a report.

Generate a report via the API

You can programmatically generate an evaluation report for a completed evaluation using the API. You can choose to generate the report synchronously or asynchronously. We currently support report generation in HTML (with interactive plots and charts) and PDF formats.

Work in Progress

The programmatic evaluation capabilities are currently in private preview and subject to change.

Next Steps

Run Evaluations

Execute and monitor evaluations

Custom Harnesses

Create targeted test scenarios

Configure Guardrails

Add runtime protection
Last modified on April 20, 2026