Groq Provider
Groq provides ultra-fast inference using custom LPU (Language Processing Unit) hardware, delivering extremely low latency responses.
Authentication
Using API Key
Or set the environment variable:
Get your API key at console.groq.com.
Available Models
Configuration
CLI Usage
Rate Limits
Groq has rate limits based on your plan:
- Free tier: Limited requests per minute and tokens per day
- Paid plans: Higher limits available
Savfox handles rate limiting automatically with retries.
Troubleshooting
Authentication errors
- Verify your API key starts with
gsk_ - Check if the key is active
- Ensure you haven't exceeded your quota
Model not available
- Check the Groq docs for current model list
- Some models may be deprecated or renamed