OpenAI + Anthropic compatible

One API for many AI models with IDR billing.

Genfity AI Gateway unifies API keys, credits, quotas, and model routing in one stable endpoint. Pay via QRIS or bank transfer—no USD credit card required.

1 endpoint·many models
Pay in IDR·no USD card
Production·failover & analytics
api.genfity.com
Authorization: Bearer sk_live...
OpenAI
/v1/chat/completions
live
Anthropic
/v1/messages
live
Embeddings
/v1/embeddings
live
route.select(model)
{ cost: "optimized", fallback: true }
usage.track(api_key)

Supported providers

OpenAI logoOpenAI
Anthropic logoAnthropic
Google Gemini logoGoogle Gemini
Meta logoMeta
Mistral AI logoMistral AI
xAI logoxAI
Perplexity logoPerplexity
Hugging Face logoHugging Face
Cohere logoCohere
OpenAI logoOpenAI
Anthropic logoAnthropic
Google Gemini logoGoogle Gemini
Meta logoMeta
Mistral AI logoMistral AI
xAI logoxAI
Perplexity logoPerplexity
Hugging Face logoHugging Face
Cohere logoCohere
OpenAI logoOpenAI
Anthropic logoAnthropic
Google Gemini logoGoogle Gemini
Meta logoMeta
Mistral AI logoMistral AI
xAI logoxAI
Perplexity logoPerplexity
Hugging Face logoHugging Face
Cohere logoCohere

Why Genfity

More than just an AI proxy.

Infrastructure built for Indonesian developers: pay in rupiah, local docs, WIB support team.

One API, all models

Switch between models by changing the model name string. No new setup required.

Sensible billing

Top up via QRIS, bank transfer, or e-wallets. Official invoices for businesses.

Production-ready

Rate limiting, monitoring, automatic failover, and built-in circuit breaker.

API compatibility

Drop-in for the SDK you already use.

Just change baseURL to Genfity. All methods, parameters, and response formats are identical.

OpenAI-compatible
POST /v1/chat/completions
client.chat.completions.create({ model, messages })
SDK-friendly endpoint dengan kontrol API key, usage, dan model.
Anthropic-compatible
POST /v1/messages
client.messages.create({ model, messages, max_tokens })
SDK-friendly endpoint dengan kontrol API key, usage, dan model.

Pricing

3 ways to pay, pick what fits.

Credits for exploration, Subscription for production, or PAYG for dynamic traffic.

Sold Out
Recommended
Subscription

7 Days Unlimited Token + Opus 4.7/4.8 PRO

Rp 200.000$20

7 Days

  • 5,000 requests / period
  • 1,000 requests/day (RPD)
  • Unlimited daily credit quota
  • Unlimited period credit quota
  • 120 requests/min
  • 6 concurrent
  • 600,000,000 tokens / period
  • Unlimited Token Input & Output
Recommended
Subscription

1 Days Claude + All Models

Rp 40.000$4

1 Day

  • 750 requests / period
  • 750 requests/day (RPD)
  • Unlimited daily credit quota
  • Unlimited period credit quota
  • 120 requests/min
  • 6 concurrent
  • Unlimited tokens
  • Unlimited Token Input & Output
Subscription

1 Days Unlimited Token Claude Model Lite

Rp 20.000$2

1 Days

  • 250 requests / period
  • 250 requests/day (RPD)
  • Unlimited daily credit quota
  • Unlimited period credit quota
  • 120 requests/min
  • 6 concurrent
  • Unlimited tokens
  • Unlimited Token Input & Output
Recommended
Subscription

7 Days Unlimited Token Opus 4.8/4.7 GPT 5.5

Rp 250.000$25

7 Days

  • 5,000 requests / period
  • 750 requests/day (RPD)
  • Unlimited daily credit quota
  • Unlimited period credit quota
  • 120 requests/min
  • 6 concurrent
  • Unlimited tokens
  • Unlimited Token Input & Output
Subscription

7 Days China Models

Rp 150.000$15

7 Days

  • Unlimited requests / period
  • No daily request limit
  • Unlimited daily credit quota
  • 7,000 credits / period
  • 30 requests/min
  • 6 concurrent
  • Unlimited tokens
  • Unlimited Token Input & Output
Sold Out
Subscription

30 Days Developer Lite

Rp 250.000$25

30 Days

  • 10,000 requests / period
  • 500 requests/day (RPD)
  • 3,000 credits/day
  • 30,000 credits / period
  • 120 requests/min
  • 6 concurrent
  • 1,000,000,000 tokens / period
  • Unlimited Token Input & Output
Sold Out
Subscription

30 Days Developer Pro

Rp 350.000$30

30 Days

  • 10,000 requests / period
  • 500 requests/day (RPD)
  • 4,500 credits/day
  • 45,000 credits / period
  • 120 requests/min
  • 6 concurrent
  • 1,000,000,000 tokens / period
  • Unlimited Token Input & Output

Supported Models

Models currently published in the gateway.

ModelContextCapabilityCopy
Auto
genfity/auto
200,000
ToolsVisionStream
Claude Haiku 4.5
genfity/claude-haiku-4.5
200,000
ToolsVisionStream
Claude Opus 4.6
genfity/claude-opus-4.6
1,000,000
ToolsVisionStream
Claude Opus 4.7
genfity/claude-opus-4.7
1,000,000
ToolsVisionStream
Claude Opus 4.8
genfity/claude-opus-4.8
1,000,000
ToolsVisionStream
Claude Sonnet 4.6
genfity/claude-sonnet-4.6
1,000,000
ToolsVisionStream
Deepseek 3.2
genfity/deepseek-3.2
128,000
ToolsStream
Deepseek v4 Flash
genfity/deepseek-v4-flash
1,000,000
ToolsStream
Deepseek v4 Pro
genfity/deepseek-v4-pro
1,000,000
ToolsStream
GLM 5
genfity/glm-5
128,000
ToolsStream
GLM 5.1
genfity/glm-5.1
202,752
ToolsStream
GPT 5.3 Codex
genfity/gpt-5.3-codex
400,000
ToolsVisionStream
GPT 5.4
genfity/gpt-5.4
400,000
ToolsVisionStream
GPT 5.4 Mini
genfity/gpt-5.4-mini
400,000
ToolsVisionStream
GPT 5.5
genfity/gpt-5.5
400,000
ToolsVisionStream
Gemini 3.1 Pro
genfity/gemini-3.1-pro
1,000,000
ToolsVisionStream
KIMI k2.5
genfity/kimi-k2.5
262,144
ToolsVisionStream
KIMI k2.6
genfity/kimi-k2.6
262,144
ToolsVisionStream
MiniMax M3
genfity/minimax-m3
1,000,000
ToolsVisionStream
Minimax M2.5
genfity/minimax-m2.5
1,000,000
ToolsVisionStream
Qwen3.6 Flash
genfity/qwen3.6-27b
262,144
ToolsVisionStream
Qwen3.6 Plus
genfity/qwen3.6-plus
1,000,000
ToolsVisionStream
Subagent
genfity/subagent
200,000
ToolsVisionStream
Xiaomi Mimo v2.5
genfity/mimo-v2.5
1,000,000
ToolsVisionStream
Xiaomi Mimo v2.5 Pro
genfity/mimo-v2.5-pro
1,000,000
ToolsVisionStream

Features

Production-ready, out of the box.

Everything you need to run AI in production—without building it yourself.

Rate Limit per API Key

RPM and TPM per key, isolated per environment.

Real-time Analytics

Live dashboard: cost, tokens, latency per model.

Automatic Failover

Automatic fallback if a provider has downtime.

Streaming Support

SSE streaming for all chat models.

Tools / Function Calling

Compatible with OpenAI/Anthropic tool use spec.

Indonesian Support

WIB support team via WhatsApp & email.

FAQ

Frequently asked questions.

Key details before using AI Gateway for your product.

Can it really be cheaper?

Yes. Use affordable models for common requests, reserve premium models for tasks that need strong reasoning. Save 40-60% on mixed use cases.

Can I keep using popular SDKs?

100% compatible with OpenAI SDK. Just change baseURL to api.genfity.com/v1—all methods and responses are identical.

How does billing work?

3 models: Credit (package top-up), Subscription (unlimited monthly), or PAYG (USD balance per token). Pick what fits your usage.

Is it production ready?

Yes. Rate limits, failover, monitoring, and circuit breaker are built-in. 99.9% uptime target with formal SLA for Pro/Enterprise plans.

Get started

Start with one API key. Scale to many models when ready.

Sign up free with starter credit. No USD credit card needed.

Credit packages
Flexible top up
Usage control
Quota & balance tracking
Fast routing
Per model capability

Footer.cta.scheduleHeading

Footer.cta.scheduleBody

Footer.cta.consultNow
Footer.brand.alt