Provider Setup¶
AMP uses external AI providers for content generation. At least one LLM provider is required.
LLM Providers¶
Anthropic Claude (Recommended)¶
Supported models: - claude-3-opus (highest quality) - claude-3-sonnet (balanced) - claude-3-haiku (fastest)
OpenAI¶
Supported models: - gpt-4-turbo - gpt-4 - gpt-3.5-turbo
Google Gemini¶
Supported models: - gemini-pro - gemini-pro-vision
Image Providers¶
OpenAI DALL-E¶
Stability AI¶
Replicate¶
Provider Configuration¶
Default Provider¶
Set default LLM provider:
Per-Tenant Configuration¶
Configure via API:
curl -X PUT https://api.amp.dev/v1/tenants/{id} \
-H "Authorization: Bearer $AMP_API_KEY" \
-d '{
"config": {
"provider_routing": {
"llm_preference": "claude",
"llm_fallback": ["openai"],
"image_preference": "openai"
}
}
}'
Provider Fallback¶
When primary provider fails, AMP automatically tries fallbacks:
graph LR
A[Request] --> B{Claude}
B -->|Success| C[Response]
B -->|Fail| D{OpenAI}
D -->|Success| C
D -->|Fail| E{Gemini}
E -->|Success| C
E -->|Fail| F[Error] Fallback Configuration¶
Cost Management¶
Tracking¶
AMP tracks provider costs per request:
Limits¶
Set spending limits:
curl -X PUT https://api.amp.dev/v1/tenants/{id} \
-d '{
"config": {
"cost_limits": {
"monthly_limit_cents": 100000,
"alert_threshold_percent": 80
}
}
}'
Rate Limits¶
Provider Rate Limits¶
| Provider | Requests/Minute | Tokens/Minute |
|---|---|---|
| Claude | 60 | 100,000 |
| OpenAI | 60 | 90,000 |
| Gemini | 60 | 120,000 |
Handling Rate Limits¶
AMP automatically: - Queues requests when approaching limits - Backs off with exponential delay - Falls back to alternate providers
Self-Hosted LLMs¶
Ollama¶
vLLM¶
Configuration¶
Monitoring¶
Health Checks¶
Usage Statistics¶
curl https://api.amp.dev/v1/analytics/costs?breakdown=provider \
-H "Authorization: Bearer $AMP_API_KEY"
Troubleshooting¶
Provider Unavailable¶
Check provider status pages: - Anthropic Status - OpenAI Status - Google Cloud Status
Rate Limit Errors¶
{
"error": {
"code": "provider_rate_limited",
"message": "Claude rate limit exceeded",
"retry_after": 60
}
}
Solutions: - Wait for rate limit reset - Enable fallback providers - Upgrade provider plan
Authentication Errors¶
Solutions: - Verify API key is correct - Check key hasn't expired - Ensure key has required permissions