Vijil Evaluate is integrated with a number of leading LLM providers. To evaluate serverless LLM endpoints hosted on any of these, use the same setup but with different values for the model hub and model name.Documentation Index
Fetch the complete documentation index at: https://docs.vijil.ai/llms.txt
Use this file to discover all available pages before exploring further.
| Provider | Model Hub | Out-of-the-box model examples | Default rate limit* |
|---|---|---|---|
| OpenAI | openai | gpt-4.1, gpt-4.5-preview, gpt-4o, gpt-4o-mini, o1, o1-mini, o3-mini, gpt-4-turbo, gpt-4, gpt-3.5-turbo | 500 requests / 60s |
| Anthropic | anthropic | claude-opus-4-0, claude-sonnet-4-0, claude-3-7-sonnet-latest, claude-3-5-sonnet-latest, claude-3-5-haiku-latest | 500 requests / 60s |
| Together AI | together | deepseek-ai/DeepSeek-R1, meta-llama/Llama-3.3-70B-Instruct-Turbo, Qwen/QwQ-32B-Preview, google/gemma-2-27b-it, mistralai/Mixtral-8x22B-Instruct-v0.1 | 600 requests / 60s |
| Mistral AI | mistral | mistral-large-latest, mistral-saba-latest, ministral-3b-latest, ministral-8b-latest, mistral-small-latest, open-mixtral-8x22b | 300 requests / 60s |
| Fireworks AI | fireworks | llama-v3p2-1b-instruct, llama-v3p2-3b-instruct, llama-v3p2-11b-vision-instruct, llama-v3p2-90b-vision-instruct, mixtral-8x22b-instruct, qwen2p5-72b-instruct, deepseek-r1 | 600 requests / 60s |
| NVIDIA NIM | nvidia | nvidia/llama3-chatqa-1.5-8b, nvidia/llama3-chatqa-1.5-70b, nvidia/nemotron-4-340b-instruct | 60 requests / 60s |
| Google Cloud Vertex AI | vertex | google/gemini-2.5-pro, google/gemini-2.0-flash-001, google/gemini-1.5-flash-001, google/gemini-1.5-pro-001, google/gemini-1.0-pro-002 | 60 requests / 60s |
| AWS Bedrock | bedrock | us.amazon.nova-lite-v1:0, us.amazon.nova-micro-v1:0, us.amazon.nova-pro-v1:0, us.anthropic.claude-3-7-sonnet-20250219-v1:0, anthropic.claude-3-5-sonnet-20241022-v2:0, meta.llama3-1-70b-instruct-v1:0, meta.llama3-1-405b-instruct-v1:0, mistral.mistral-large-2407-v1:0 | 60 requests / 60s |
| AWS Bedrock Agents | bedrockAgents | Bedrock-hosted agents configured by agent_id / agent_alias_id | 30 requests / 60s |
| Google Cloud (Agentforce) | agentforce | Agentforce-hosted agents (via Agentforce configuration) | 30 requests / 60s |
| Azure | azure | Azure-hosted deployments (models configured in your Azure account) | 60 requests / 60s |
| DigitalOcean | digitalocean | DigitalOcean-hosted deployments (models configured in your DO account) | 30 requests / 60s |
| OpenRouter | openrouter | microsoft/phi-4, google/gemini-2.5-flash-preview, x-ai/grok-3-beta | 60 requests / 60s |
| Groq | groq | llama-3.1-8b-instant, llama-3.3-70b-versatile, openai/gpt-oss-120b, openai/gpt-oss-20b | 30 requests / 60s |
Google Vertex AI
Learn more about integrating Google Vertex AI
DigitalOcean
Learn more about integrating DigitalOcean
AWS Bedrock
Learn more about integrating AWS Bedrock