x402 pay-per-request, prepaid credits, or enterprise contracts

Enterprise controls for LLM access, usage, and agent billing.

PerkOS LLM Gateway gives teams a controlled front door for model access: wallet-authenticated dashboards, per-agent metering, model routing, and a clean path from test traffic to production billing.

Review architecture View plans

Built for operators managing real usage. API keys stay server-side; agents report stable IDs; dashboards stay wallet-scoped.

Abstract PerkOS hero illustration
Gateway preview
# 1) Client calls your gateway POST /v1/chat/completions Authorization: x402 ... { "model": "minimax-m3:cloud", "messages": [...] } # 2) Gateway enforces policy + billing rate-limit ✓ auth ✓ usage-meter ✓ # 3) You get clean logs + exports trace_id: "pk_01H..." cost: $0.0042
Billing modes
x402 + credits + enterprise
Controls
Keys, quotas, policies
Operational usage, policy enforcement, and wallet-scoped visibility for agent fleets.

Everything you need to ship LLM access like a product

Make LLM usage auditable, governable, and monetizable, while keeping developer experience fast.

Flexible monetization

Start with x402 pay-per-request, offer prepaid credits for predictable spend, and close enterprise contracts with invoicing.

Policy + guardrails

Per-key quotas, org and user limits, model allowlists, request validation, and safe defaults for production rollouts.

Observability-ready

Usage metering, clean structured logs, trace IDs, exports, and the hooks you need for dashboards and audits.

Provider-agnostic

Route to your preferred LLM providers. Keep stable client APIs even as vendors and models change.

Security-first

Keep raw provider keys private. Issue scoped gateway keys, rotate easily, and isolate tenants with clear boundaries.

Built for teams

Admin controls, audit trails, cost allocation, and the predictable behavior that engineering and finance both need.

How it works

Drop in a gateway layer that handles auth, metering, and billing. Your app stays focused on product logic.

1

Integrate once

Point clients at the gateway. Keep an OpenAI-compatible surface for fast adoption and minimal churn.

2

Enforce policy automatically

Rate limits, quota caps, model allowlists, and structured logging, enforced consistently on every request.

3

Monetize per request or prepay

Use x402 for instant pay-per-request, or offer prepaid credits for predictable teams and internal chargebacks.

4

Graduate to enterprise

SAML/SSO, SLAs, custom limits, invoicing, and deployment support when you need it.

Deployment notes

A gateway that plays nicely with your infrastructure

Keep provider keys in server-side env vars. Add your WAF, DDoS protection, and logging pipeline. For enterprise, deploy behind a private network boundary or in your preferred cloud.

Client DX

Fast onboarding, stable endpoints, and predictable error semantics for retries and backoffs.

Ops DX

Explicit controls for spend, usage, and keys, plus clean exports for audit and accounting.

Pricing

Pick what fits your stage. Mix and match over time. We designed this to scale from prototypes to procurement.

x402 Pay-per-request
Best for public APIs and instant monetization
No commitments
Usage-based / per request
  • x402-compatible settlement flow
  • Per-key quotas and burst controls
  • Usage metering and per-request receipts
  • Bring your model provider(s)
Enable x402

Ideal when you want buyers to pay automatically as they go.

Enterprise
Best for compliance, SLAs, and procurement
Custom
Let’s talk / tailored
  • SAML/SSO and advanced access controls
  • Custom limits, SLAs, and support
  • Invoicing and contract terms
  • VPC/private deployments available
Talk to sales

We’ll align on security requirements and rollout timelines.

Need help picking a plan?

Tell us your volume, billing preference, and deployment constraints. We’ll recommend the simplest path.

Pricing shown is illustrative. Final terms depend on provider costs, volume, and support requirements.

FAQ

Answers to the questions we hear most from teams shipping LLM products.

What is x402 pay-per-request?

x402 is a payment flow designed for HTTP requests. The gateway can require payment for an API call, settle it, then forward the request to your model provider and return the response, with a receipt and clean usage metadata.

Can I use multiple model providers?

Yes. PerkOS is designed to be provider-agnostic. You can route by model, org, or policy, while keeping a stable client API surface.

Do you support OpenAI-compatible endpoints?

That’s the goal. Teams typically want the least amount of client churn, so we aim for a familiar, OpenAI-style interface wherever possible.

What about security and compliance?

Enterprise plans support stronger controls (SSO, audits, and deployment options). Even on smaller plans, provider keys stay server-side and you can issue scoped gateway keys with quotas and rotation.

Ready for production-grade LLM operations?

Use one gateway for authentication, routing, metering, wallet ownership, and operational visibility across your agent fleet.