OctoAI Text Gen pricing, only pay for what you use

Pricing & Plans

Get started today on OctoAI and receive $10 of free credit in your account.

Overview Text Gen Solution Media Gen Solution

Text Gen Solution

The $10 credit is the equivalent of over a million words of output with Llama 3.1-70B model and Mixtral 8x7B model.

OctoAI’s unified API endpoint means you can build on your choice of models using your fine tunes.

We're giving away up to 150x bonus credits for our Text Gen Solution on top of our industry-leading cost-per-token. Requires certain spend or commit to spend.

See detailed pricingFeatures	Free Trial $10 Free credit upon sign up Get started building your project	Pro $0.15 Per 1M tokens for 7B and 8B models $3 in/$9 out Per 1M tokens for 405B models	Enterprise Contact Us Bring your own checkpoint
GTE Large
Bring your Fine Tune
Fine-tuning
Bring your choice of checkpoints
Committed use discounts
Performance optimization options
Contractual SLAs
Dedicated Customer Success Manager
Option for private deployment

See detailed pricingFeatures	Free Trial $10 Free credit upon sign up Get started building your project	Pro $0.15 Per 1M tokens for 7B and 8B models $3 in/$9 out Per 1M tokens for 405B models	Enterprise Contact Us Bring your own checkpoint
GTE Large
Bring your Fine Tune
Fine-tuning
Bring your choice of checkpoints
Committed use discounts
Performance optimization options
Contractual SLAs
Dedicated Customer Success Manager
Option for private deployment

Frequently asked questions

Don’t see the answer to your question here? Feel free to reach out so we can help.

Pricing & Plans

Text Gen Solution

$10

$0.15

$3 in/$9 out

Contact Us

$10

$0.15

$3 in/$9 out

Contact Us

Frequently asked questions

What are your rate limits for the Text Gen Solution?

What are input and output tokens?

How is RAG implemented?

Is it possible to pre-define a prompt?