Welcome to Lunar

Lunar Platform

Lunar offers a complete set of tools for developers who want to distill, route, and deploy AI models. Our platform provides:

Lunar SDK

Python & TypeScript SDK for LLM inference with intelligent routing, fallbacks, cost tracking, and built-in evaluations.

GPU Instances

Deploy and run open-source models on dedicated NVIDIA GPUs, from L4 to H200.

Platform Features

Lunar SDK: OpenAI-Compatible LLM Access

Access 12+ LLM providers through a single API. Features smart routing with AI task classification, automatic fallbacks, per-request cost tracking, and a comprehensive evaluation framework with 15+ built-in scorers.

GPU Instances: Deploy Any Model

Run open-source models like LLaMA, Qwen, DeepSeek, and more on dedicated GPU instances. Choose from 6 tiers ranging from NVIDIA L4 (24GB) to H200 clusters (1128GB).

Get Started

Python
TypeScript
REST API
GPU Instances
CLI

Install the SDK

pip install lunar-sdk

Set your API key

export LUNAR_API_KEY="pk_live_your_key"

Make your first request

from lunar import Lunar

client = Lunar()
response = client.chat.completions.create(
    model="auto",
    messages=[{"role": "user", "content": "Hello!"}]
)
print(response.choices[0].message.content)

Full Quickstart Guide

Learn more about the Lunar SDK

Install the SDK

npm install lunar-sdk

Make your first request

import Lunar from "lunar-sdk";

const client = new Lunar({ apiKey: "pk_live_your_key" });

const response = await client.chat.completions.create({
  model: "auto",
  messages: [{ role: "user", content: "Hello!" }],
});
console.log(response.choices[0].message.content);

Get your API key

Make a request

curl https://api.lunar.dev/v1/chat/completions \
  -H "Authorization: Bearer pk_live_YOUR_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "auto",
    "messages": [{"role": "user", "content": "Hello!"}],
    "stream": true
  }'

API Reference

Full REST API documentation

Install the CLI

pip install lunar-cli

Authenticate

lunar auth login --api-key pk_live_YOUR_KEY

Distill and deploy

lunar distill create --teacher gpt-4o --student llama-3.2-3b
lunar deploy --model my-distilled-model --gpu L4

Pricing

Tier	GPU	VRAM	Price	Best For
XS	1x L4	24GB	~$0.20/h	7B-13B models
S	1x L40S	48GB	~$0.60/h	13B-34B models
M	4x A10G	96GB	~$1.80/h	70B INT4
L	4x L40S	192GB	~$3.50/h	70B FP16
XL	8x A100	320-640GB	~$12/h	180B models
XXL	8x H100/H200	640-1128GB	~$20-30/h	405B models

Full Pricing Details

View complete pricing information

Community and Support

Discord

Join our community to get help and share experiences.

GitHub

Contribute to development and report issues.

Getting Started

Lunar SDK

Pricing

Welcome to Lunar

Lunar Platform

Lunar SDK

GPU Instances

Platform Features

Get Started

Full Quickstart Guide

API Reference

View Instance Tiers

Pricing

Full Pricing Details

Community and Support

Discord

GitHub

Getting Started

Lunar SDK

Pricing

​Lunar Platform

Lunar SDK

GPU Instances

​Platform Features

​Get Started

Full Quickstart Guide

API Reference

View Instance Tiers

​Pricing

Full Pricing Details

​Community and Support

Discord

GitHub

Lunar Platform

Platform Features

Get Started

Pricing

Community and Support