Credit is a basic unit that applies exclusively to Vext managed LLM usage. The total amount of allowed credits per account varies based on the plan you're on. You can learn more about the total credits you have for each plan here.
We decided not to use "token" because it could be confusing for some and difficult to manage/forecast usage and cost.
The consumption of credits per LLM query also depends on which LLM you choose, here is a chart that shows you how many credits per LLM query is consumed for each LLM:
LLM | Credits Used / Query | Plan |
Anthropic Claude 2 | 10 | Pro Plan |
Anthropic Claude 3 Haiku | 1 | All |
Anthropic Claude 3 Opus | 10 | Pro Plan |
Anthropic Claude 3 Sonnet | 5 | Pro Plan |
Anthropic Claude Instant | 1 | All |
Azure OpenAI GPT 4o Mini | 1 | All |
Azure OpenAI GPT 4 | 5 | Pro Plan |
Azure OpenAI GPT 4o | 5 | Pro Plan |
Cohere Command | 1 | All |
Cohere Command Light | 1 | All |
Google Gemini 1.0 Pro | 1 | All |
Google Gemini 1.5 Pro | 5 | Pro |
Google Gemini 1.5 Flash | 1 | All |
Meta Llama 3/3.1 8B | 1 | All |
Meta Llama 3/3.1 70B | 1 | All |
Mistral 7B | 1 | All |
Mixtral 8x7B | 1 | All |
Mistral Large | 5 | Pro Plan |
Note that if you're bringing your own model to the platform, no credits will be used when the workflow is triggered.