Skip to content
Socaity Docs
  • Docs
  • Deploy AI

APIPod

APIPod packages a Python function into a production-ready service — CUDA detection, job queue, deploy artifacts. Start with the four-step quickstart, then drill into the individual command and deployment references.

Start here

Getting Started

Install APIPod and write your first service in four steps — install, write service.py, serve locally, test the endpoint.

Build

Package your service into a production Docker image with apipod --build. CUDA detection, layer caching, custom Dockerfile support.

Endpoints

@app.endpoint reference — decorator parameters, media file types, progress reporting, queue behaviour.

Deploy Serverless

Deploy as a serverless GPU endpoint that scales to zero when idle and only bills while actively running.

Deploy Dedicated

Deploy on always-on GPU hardware for predictable latency and steady, high-volume workloads.

Providers

GPU cloud providers supported by APIPod. RunPod EU is live today; Scaleway and Azure are coming soon.

Previous
Quickstart
Next
Getting Started