PROVIDER

OpenAI AI Models

Q: Is this an official OpenAI integration?

RunAPI exposes a managed API surface with transparent per-call pricing, fully documented capability and parameters, and clear error behavior. You get the same model output quality without managing a direct provider relationship or provider-side account.

Q: Do I need a OpenAI account?

No. Your RunAPI API key is enough for managed access to all OpenAI models. You do not need to create a separate account, manage provider-specific credentials, or handle provider-side billing.

Q: What's the latency overhead from proxying through RunAPI?

Typically under 20 ms. RunAPI keeps the proxy layer close to model execution regions to minimize added latency. Media generation time is dominated by the model itself, not the proxy.

Q: Are images / videos cached?

Generated outputs are stored and retrievable by task ID. You can fetch completed results at any time using the task status endpoint or the RunAPI dashboard. Output URLs remain accessible for the retention period shown in the API docs. Inputs are not cached or stored.

Q: Can I bring my own key?

Not currently. Calls use RunAPI-managed access, which simplifies authentication and lets RunAPI handle rate limiting, retries, and billing consolidation on your behalf.

Q: How is billing consolidated?

All API calls across all providers appear on a single monthly USD invoice. There is no per-provider billing, no subscription, and no minimum spend. Failed generations are never charged.

Q: What SDKs can I use with OpenAI models?

Official SDKs are available for Python, Node.js, PHP, Java, Ruby, and Go. Each SDK handles authentication, async task polling, and typed responses. For LLM models, the OpenAI and Anthropic SDKs also work by pointing the base URL to RunAPI.

Q: What are model skills and how do they work?

Model skills are installable packages that load a model's docs, typed schemas, pricing notes, and setup steps directly into your coding workspace. Install a skill with one command and your agent has the right context before you write integration code. Skills work with Claude Code, Codex, Gemini CLI, Cursor, and VS Code.

Q: How do I switch between OpenAI models?

Change the model parameter in your API request. All OpenAI models share the same API key, the same request shape, and the same billing. No code changes, no re-authentication, and no separate billing setup are required when switching between models or between variants of the same model. You can also switch to models from other providers by changing the same parameter — the API surface is unified across the entire catalog.

GPT language models and GPT Image generation — OpenAI's reasoning and creative AI suite.

5 models · 14 variants · from $0.010

Browse all models API docs →

All OpenAI models available through RunAPI 5 models

Embedding Text

3 variants · from $0.010

GPT Text

11 variants · from $0.010

GPT Image Image

2 endpoints · from $0.040

GPT Image 2 Image

2 endpoints · from $0.060

GPT-4o Image Image

1 endpoints · from $0.060

OVERVIEW

OpenAI provides the GPT family of language models for chat, code, and reasoning, alongside GPT Image and GPT Image 2 for text-to-image generation with near-perfect multilingual text rendering.

Single API key shared across all providers
No separate %{provider} account required
Model skills carry docs, schemas, and setup steps into your workspace
Per-call billing in USD, no subscription or minimum spend
Failed generations are never charged
Switch models by changing one parameter
Billing consolidated into one monthly invoice

FEATURES

What stands out

FASTEST

Embedding

P50 ~ <1s

One of the most-used model APIs from OpenAI, chosen by developers for its balance of output quality, speed, and pricing.

FRONTIER

GPT Image 2

Frontier tier

One of the most-used model APIs from OpenAI, chosen by developers for its balance of output quality, speed, and pricing.

CHEAPEST

Embedding

from $0.010

Lowest starting price across the OpenAI catalog, suited for high-volume workflows and cost-sensitive production pipelines.

MODELS

All OpenAI models available through RunAPI

Embedding

OpenAI

Text

OpenAI text embeddings for semantic search, retrieval, clustering, and ranking workflows.

/v1/embeddings endpoint

from $0.010 / 1K tokens View →

GPT

OpenAI

Text

GPT API access for OpenAI's flagship LLM across chat, code generation, and multi-step reasoning tasks.

/v1/responses endpoint

from $0.010 / 1K tokens View →

GPT Image

OpenAI

Image

GPT Image API access for text-to-image and image editing powered by OpenAI image generation models.

from $0.040 / call View →

GPT Image 2

OpenAI

Image

GPT Image 2 API access for OpenAI image generation with near-perfect multilingual text rendering.

from $0.060 / call View →

GPT-4o Image

OpenAI

Image

GPT-4o Image API access for native image generation and editing inside the conversation.

from $0.060 / call View →

QUICKSTART

Install a OpenAI model skill for your app.

Pick a model and add its skill so your coding tool has docs, schemas, pricing notes, and setup steps. Skills work with Claude Code, Codex, Gemini CLI, Cursor, and VS Code. Install once, then switch models by changing one parameter.

runapi.ai

# Base URL
https://runapi.ai

# Endpoints
POST /v1/embeddings

curl https://runapi.ai/v1/chat/completions \
  -H "Authorization: Bearer $RUNAPI_KEY" \
  -H "Content-Type: application/json" \
  -d '{
  "model": "text-embedding-ada-002",
  "input": [
    "Embed these support articles and a customer query so I can rank the closest matches."
  ],
  "encoding_format": "float"
}'

from openai import OpenAI

client = OpenAI(
    base_url="https://runapi.ai/v1",
    api_key="your-runapi-key"
)

response = client.chat.completions.create(
    model="text-embedding-ada-002",
    messages=[{"role": "user", "content": "Embed these support articles and a customer query so I can rank the closest matches."}]
)

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://runapi.ai/v1",
  apiKey: "your-runapi-key"
});

const response = await client.chat.completions.create({
  model: "text-embedding-ada-002",
  messages: [{ role: "user", content: "Embed these support articles and a customer query so I can rank the closest matches." }]
});

https://runapi.ai /v1/embeddings

REFERENCE

Every OpenAI variant with pricing and model IDs

Full pricing table →

Model	Variant	Billing	From
Embedding	text-embedding-3-large	1K tokens	$0.010	View →
	text-embedding-3-small	1K tokens	$0.010	View →
	text-embedding-ada-002	1K tokens	$0.010	View →
GPT	codex-auto-review	1K tokens	$0.150	View →
	gpt-5.2	1K tokens	$0.070	View →
	gpt-5.2-pro	1K tokens	$0.840	View →
	gpt-5.3-codex	1K tokens	$0.070	View →
	gpt-5.3-codex-spark	1K tokens	$0.070	View →
	gpt-5.4	1K tokens	$0.080	View →
	gpt-5.4-mini	1K tokens	$0.030	View →
	gpt-5.4-nano	1K tokens	$0.010	View →
	gpt-5.4-pro	1K tokens	$0.900	View →
	gpt-5.5	1K tokens	$0.150	View →
	gpt-5.5-pro	1K tokens	$0.900	View →

FAQ

Frequently asked questions about OpenAI

Is this an official OpenAI integration?

RunAPI exposes a managed API surface with transparent per-call pricing, fully documented capability and parameters, and clear error behavior. You get the same model output quality without managing a direct provider relationship or provider-side account.

Do I need a OpenAI account?

No. Your RunAPI API key is enough for managed access to all OpenAI models. You do not need to create a separate account, manage provider-specific credentials, or handle provider-side billing.

What's the latency overhead from proxying through RunAPI?

Typically under 20 ms. RunAPI keeps the proxy layer close to model execution regions to minimize added latency. Media generation time is dominated by the model itself, not the proxy.

Are images / videos cached?

Generated outputs are stored and retrievable by task ID. You can fetch completed results at any time using the task status endpoint or the RunAPI dashboard. Output URLs remain accessible for the retention period shown in the API docs. Inputs are not cached or stored.

Can I bring my own key?

Not currently. Calls use RunAPI-managed access, which simplifies authentication and lets RunAPI handle rate limiting, retries, and billing consolidation on your behalf.

How is billing consolidated?

All API calls across all providers appear on a single monthly USD invoice. There is no per-provider billing, no subscription, and no minimum spend. Failed generations are never charged.

What SDKs can I use with OpenAI models?

Official SDKs are available for Python, Node.js, PHP, Java, Ruby, and Go. Each SDK handles authentication, async task polling, and typed responses. For LLM models, the OpenAI and Anthropic SDKs also work by pointing the base URL to RunAPI.

What are model skills and how do they work?

Model skills are installable packages that load a model's docs, typed schemas, pricing notes, and setup steps directly into your coding workspace. Install a skill with one command and your agent has the right context before you write integration code. Skills work with Claude Code, Codex, Gemini CLI, Cursor, and VS Code.

How do I switch between OpenAI models?

Change the model parameter in your API request. All OpenAI models share the same API key, the same request shape, and the same billing. No code changes, no re-authentication, and no separate billing setup are required when switching between models or between variants of the same model. You can also switch to models from other providers by changing the same parameter — the API surface is unified across the entire catalog.