OctoAI and AWS Better Together
OctoAI is a compute service to run, tune, and scale your generative AI, built on top of AWS. It allows developers to quickly and cost effectively take generative AI applications to production on AWS.
![AWS Partner Network logo AWS Partner Network logo](https://www.datocms-assets.com/45680/1657754001-aws-partner-network-logo.png?auto=format&w=408)
Benefits of OctoAI, powered by AWS
OctoAI complements the AWS core infrastructure offerings to ensure models are run in a hardware configuration that is optimized for the model and for the application. OctoAI's model acceleration reduces latency and cost for popular foundation models including Stable Diffusion, Whisper, LLaMA and Falcon, as well as custom models built or trained by customers.
![Speedometer Icon Speedometer Icon](https://www.datocms-assets.com/45680/1620140798-speedyellow.png?auto=format&w=160)
Ease of Use
Ready to use deployment templates for popular OSS models
Customize OSS models
Easily integrate with app dev and model dev workflows
Auto-selection of hardware
![Graph Icon Graph Icon](https://www.datocms-assets.com/45680/1620984345-iconbenchmark2yellow.png?auto=format&w=160)
Efficiency
Fastest foundation models for generative AI made possible through our model acceleration technology
Accelerate and run your custom models
Flexibility to make price-performance tradeoffs
![Globe Icon Globe Icon](https://www.datocms-assets.com/45680/1620984430-iconproductionyellow.png?auto=format&w=160)
Make Accessible
Customers may select and run accelerated OSS foundation models, fine tune models, upgrade to new models as they emerge, or bring their own custom models
No lock-in into the model or service
OctoAI powered by AWS
![null || ' null || '](https://www.datocms-assets.com/45680/1693939057-octoai-powered-by-aws-web-1.png?auto=format&w=1800)
OctoAI Model Acceleration on AWS
![](https://www.datocms-assets.com/45680/1709683374-svd-blog-thumbnail.png?auto=format&w=570)
Stable Video Diffusion (SVD) 1.1 now on OctoAI empowers developers to easily add engaging animations and motion to GenAI-powered images.
![Blog Author - Janisha Anand Blog Author - Janisha Anand](https://www.datocms-assets.com/45680/1707936885-janishaanand.jpg?auto=format&w=1331)
![Blog Author - Michal Piszczek Blog Author - Michal Piszczek](https://www.datocms-assets.com/45680/1689271921-michal-piszczek.jpg?auto=format&w=512)
![](https://www.datocms-assets.com/45680/1691181236-royal-blue-octoml-circle-logo-blog-report-docs-tutorial-data-thumbnail_2023.png?auto=format&w=1140)
Capitol AI and OctoAI worked together to achieve a 4x improvement in speed and 75% reduction in large language model (LLM) usage costs, through fine-tuned versions of Mistral models.
![Blog Author - Tom Hallaran Blog Author - Tom Hallaran](https://www.datocms-assets.com/45680/1709322126-tom-hallaran-capitolai.webp?auto=format&w=170)
![Blog Author - Haleh Lewis Blog Author - Haleh Lewis](https://www.datocms-assets.com/45680/1710450234-img_1208.jpg?auto=format&w=3024)
![](https://www.datocms-assets.com/45680/1702420300-mixtral8x7b_thumb-preview-blog.png?auto=format&w=1140)
We’re excited today to announce that Mistral’s Mixtral 8x7B Instruct large language model (LLM) is now available on the OctoAI Text Gen Solution. Customers can get started with Mixtral on OctoAI today.
![Blog Author - Ben Hamm Blog Author - Ben Hamm](https://www.datocms-assets.com/45680/1685998188-benhamm-principalproductmanager-1.jpeg?auto=format&w=221)
Your choice of models on our SaaS or in your environment
Run any model or checkpoint on our efficient, reliable, and customizable API endpoints. Sign up and start building in minutes.