Models
—
Vendors
—
With cache
—
Tiered
—
Loading…
| Model | Vendor | Tags | Billing | Input | Output | Endpoints |
|---|
Prices may change with model versions. Actual usage settlement governs.
Per-Token
Per-token billing
Chat and inference models are billed by input/output tokens, each at its own unit price.
Cache
Cache pricing
Some models support cache read/write. Cache read is typically far cheaper than fresh input; cache creation is billed at a one-off rate.
Tiered
Tiered pricing
Some models price by input context length tier. Open the card for input/output/cache unit prices per tier.