The Python SDK allows for running inferences against all OctoAI endpoints.
Requirements to run inferences
Ensure you have set
OCTOAI_TOKEN either as an environment variable or passed to the client before getting started. See Python SDK Installation & Setup for more information.
To run an inference, you need to know 2 pieces of data.
- The endpoint that can accept inferences
- The data the endpoint takes in to produce an output.
If you scroll down below the GUI to run inferences, you will see “Endpoint URL” as well as a description on how to run an inference using cURL. In the future, examples using the Python SDK will also be available to run more easily.
For health checks, most end with the URL
Was this page helpful?