v0.1.0 — Now open source

One API for every AI provider

Self-hosted gateway that unifies Claude, OpenAI, Gemini & Ollama behind a single OpenAI-compatible API. Use your own subscriptions. Zero external dependencies.

Terminal
$ curl -fsSL https://raw.githubusercontent.com/
  openhinge/openhinge/main/install.sh | bash
$ cd ~/openhinge && npm start

  Admin token: a1b2c3d4e5f6...
  Dashboard: http://localhost:3700

Every major provider. One endpoint.

Use OAuth with your existing subscriptions or bring your own API keys.

C
Claude
Anthropic
Claude Code keychain or API key
O
OpenAI
ChatGPT / Codex
Codex CLI auth or API key
G
Gemini
Google
Google OAuth or API key
O
Ollama
Local
Auto-detected on localhost

Everything you need to run AI in production

Not just a proxy. A complete gateway with auth, routing, logging, and a dashboard.

OpenAI-Compatible API
Drop-in replacement. Works with any OpenAI SDK — Python, Node.js, curl. Zero code changes.
Soul-Based Routing
AI personas with dedicated system prompts, models, and endpoints. Route by URL, header, or key binding.
OAuth Login
Use your Claude Pro, ChatGPT Plus, or Gemini subscription. No per-token cost. Tokens auto-refresh.
API Key Management
Generate ohk_ keys with rate limits, soul binding, budget controls, and expiry. Revoke instantly.
Full Streaming
Server-Sent Events in OpenAI format. Stream responses from any provider with consistent output.
Request Logging
Every request logged with tokens, latency, and cost. Search, filter, sort, and paginate.
Fallback Routing
Providers have priority. If one fails or goes down, the next healthy provider handles the request.
Web Dashboard
Manage providers, souls, keys, logs, and settings from a modern dark-mode UI. No CLI needed.
Zero Dependencies
Embedded SQLite. No Redis, Postgres, or Docker required. One directory, one command to run.

Three commands. That's it.

Install, start, and you're live. No configuration files to write.

1
Install
curl -fsSL https://openhinge.com/install.sh | bash
2
Start
cd ~/openhinge && npm start
3
Use
curl http://localhost:3700/v1/chat/completions \
  -H "Authorization: Bearer ohk_YOUR_KEY" \
  -H "Content-Type: application/json" \
  -d '{"model":"claude-sonnet-4-6","messages":[{"role":"user","content":"Hello"}]}'

On first start, OpenHinge auto-generates your config and prints your admin token.
Open http://localhost:3700 and paste it to access the dashboard.

How requests flow

Every request passes through auth, rate limiting, soul routing, and provider selection.

architecture
  Client Request
      │
      ├── Auth (validate ohk_ key)
      ├── Rate limiter (sliding window, per key)
      ├── Budget check (daily/monthly limits)
      │
      ├── Soul resolver (system prompt + model)
      ├── Provider selector (priority + health + fallback)
      │
      ├── Provider adapter
      │   ├── Claude   (OAuth or API key)
      │   ├── OpenAI   (OAuth or API key)
      │   ├── Gemini   (OAuth or API key)
      │   └── Ollama   (local)
      │
      ├── Stream / collect response
      └── Log usage (tokens, latency, cost)

Stop managing multiple AI SDKs

One gateway. One API. Every provider. Self-hosted on your machine in under a minute.

View on GitHub
npm install openhinge