List evaluation summaries for a team with optional filters and pagination.
Args: team_id: Team ID to list summaries for (required) agent_id: Optional filter by agent ID status: Optional filter by status (running, completed, failed, cancelled) harness_type: Optional filter by harness type tested_by: Optional filter by evaluation tool limit: Max number of results (default 10, max 100) offset: Number of results to skip for paging claims: JWT claims with user and team info summary_repo: Evaluation summary repository
Returns: Paginated list of evaluation summaries with total count
Raises: HTTPException: 401 if unauthorized, 403 if no permission
Team ID to list summaries for
Filter by agent ID
Filter by status (running, completed, failed, cancelled) Statuses for an evaluation run (mirrors the workflow lifecycle).
running, completed, failed, cancelled Filter by harness type (standard or custom)
standard, custom Filter by tool that ran the evaluation Tools that can produce evaluation results.
diamond, promptfoo, garak, pyrit Maximum number of results to return
1 <= x <= 100Number of results to skip for paging
x >= 0