Run an Evaluation
To run an evaluation from the UI, simply select Custom as the Model Hub, then paste your custom endpoint as the Model URL. To run an evaluation using the Python client, use the following code pattern, with your custom endpoint asmodel_url, model name as model_name, and a Harness of your choice.
NOTE: You may need to store multiple API keys that are tied to different custom agent endpoints. Given that, you have made
api_key_name an additional mandatory parameter to be supplied for evaluating custom endpoints.