OSS

Open source AI innovation powering OctoAI

The foundation of OctoAI is systems and compilation technologies we’ve pioneered: XGBoost, Apache TVM, and MLC/MLC-LLM, giving you an enterprise system that runs in our SaaS or your private environment. The OctoAI Platform abstracts the complicated problems of running, managing and scaling GenAI, so developers can build the AI apps of tomorrow.

OSS AI innovation by OctoAI co-founders help with ML compilation, optimization, and ease deployment

Deployed at leading companies worldwide

APACHE TVM

The standard for deep learning compilation

Started as a research project at the SAMPL group of Paul G. Allen School of Computer Science & Engineering, University of Washington, TVM is now in incubation at The Apache Software Foundation (ASF). It enables two kinds of optimizations:

Computational graph optimization performs high0level operator fusion, layout transformation and memory management tasks
Tensor operator optimization and code generation layer that efficiently manages tensor operators

Learn more

Apache TVM illustration with a cube in the center of a cirlcular path for optimization, compilation, and deployment of ML models

MLC-LLM

Universal deployment solution for LLMs

The Machine Learning Compilation (MLC) for LLMs allows native deployment of any LLM with a native APIs including compiler acceleration. This allows models to run natively on any device hardware including: iOS, Android, laptops, and other consumer devices, all at near-native speeds. MLC-LLM was developed in collaboration between the Catalyst research group at Carnegie Mellon University (CMU), OctoAI, and the whole Apache TVM community.

Learn more

MLC-LLM chat using Llama 3 on device with no internet, the LLM wrote a poem about Machine Learning

XGBOOST

XGBoost for efficiency, flexibility, & portability

XGBoost is an optimized distributed gradient boosting library that provides parallel tree boosting that solves data science problems beyond billions of examples with speed and accuracy. It is the leading machine learning library for regression, classification, and ranking problems. This portable library runs on major platforms today like: OS X, Windows, and Linux, and is used in production at Microsoft, NVIDIA, and others.

Learn more

Your choice of models on our SaaS or in your environment

Run any model or checkpoint on our efficient, reliable, and customizable API endpoints. Sign up and start building in minutes.