Skip to main content

Welcome to PolarGrid

PolarGrid is edge AI infrastructure that brings GPU-powered inference closer to your users. Run LLMs, voice AI, and embeddings with ultra-low latency across our global edge network.

Quickstart

Get your first API call working in 5 minutes

API Reference

OpenAI-compatible endpoints for easy migration

JavaScript SDK

npm install @polargrid/polargrid-sdk

Python SDK

pip install polargrid-sdk

Why PolarGrid?

Edge-First Architecture

Your inference requests are routed to the nearest GPU-equipped edge node, minimizing round-trip latency. Critical for real-time voice AI and interactive applications.

OpenAI-Compatible API

Drop-in replacement for OpenAI’s API. Migrate existing applications with minimal code changes.

Real-Time Voice

Sub-30ms latency for text-to-speech and speech-to-text, enabling natural conversational AI experiences.

Dynamic Model Loading

Load and unload models on-demand. Only pay for what you use.

Available Regions

RegionLocationID
TorontoCanada Centralyto-01
VancouverCanada Westyvr-02
MontrealCanada Eastyul-01

Getting Help