Pre-defined Harnesses
Vijil Evaluate comes with three types of pre-defined Harnesses, which can be run using the UI or Python client.Dimension
Every dimension is a pre-configured Harness. In addition, each Scenario is also a Harness. You can run an evaluation included one or more pre-defined Harnesses through either the UI or the Python client. To run all of Vijil’s Probes (covering all dimensions)---plus the Performance Harness covering benchmarks from the OpenLLM Leaderboard 2, use thetrust_score Harness.