SKILLS · 31 modelos · 50+ agentes

Instala skills de modelos de IA generativa en cualquier agente.

Un skill incluye instalación, documentación y llamadas de herramienta para imagen, video, música con IA y LLM. Funciona en Claude Code, Codex CLI, Gemini CLI o directamente desde npm/pip.

All systems normal
# Works with Claude Code, Codex, Gemini CLI, Cursor, and 50+ agents
npx skills add runapi-ai/<model> -g
The -g flag installs globally so every project picks it up.
Or paste this prompt to your AI agent:
Install the <model> skill for me:

1. Clone https://github.com/runapi-ai/<model>
2. Copy the skills/<model>/ directory into your
   user-level skills directory (e.g. ~/.claude/skills/
   for Claude Code, ~/.codex/skills/ for Codex).
3. Verify that SKILL.md is present.
4. Confirm the install path when done.
One skill gives your agent everything it needs to call the API — setup, docs, and recipes.
RESUMEN

RunAPI agrega 31 modelos multimodales de 18 proveedores y entrega cada uno como skill instalable por CLI. Una clave desbloquea todo el catálogo de API de IA unificada; un esquema funciona en todos los runtimes.

  • Una sola API key para todos los proveedores
  • Llamadas schema-first validadas antes de enviar
  • Precios medidos por llamada, sin compromiso
  • Skills compatibles con 50+ agentes: Claude Code, Codex, Cursor, Gemini CLI y más
CATEGORÍAS

Explora todos los skills

Filtra por modalidad o ve directo a un proveedor.

Claude
Anthropic
Text

Anthropic's LLM for complex reasoning, code, analysis, and extended-context tasks.

desde $0.050 / 1K tokens Ver →
DeepSeek
DeepSeek
Text

Reasoning-first LLMs via RunAPI — flash for fast, low-cost work; pro for complex agentic tasks.

desde $0.060 / 1K tokens Ver →
ElevenLabs
ElevenLabs
Audio & Music

Voice synthesis, text-to-speech, sound effects, speech-to-text, and audio isolation.

desde $0.040 / minute Ver →
Flux 2
Black Forest Labs
Image

Text-to-image and remix-image with strong prompt adherence from Black Forest Labs.

desde $0.050 / call Ver →
Flux Kontext
Black Forest Labs
Image

In-context image editing — local edits, style transfer, and character-consistent generation.

desde $0.100 / call Ver →
Gemini
Google
Text

Google's multimodal LLM for chat, code generation, reasoning, and long-context tasks.

desde $0.030 / 1K tokens Ver →
Gemini Omni
Google
Image

Voice, character, and multimodal video generation resources for narration, dialogue, and agent media workflows.

desde $0.0000 / call Ver →
GPT
OpenAI
Text

OpenAI's flagship LLM for chat, code generation, and multi-step reasoning tasks.

desde $0.030 / 1K tokens Ver →
GPT Image
OpenAI
Image

Text-to-image and image editing powered by OpenAI's image generation models.

desde $0.040 / call Ver →
GPT Image 2
OpenAI
Image

Latest OpenAI image generation with near-perfect multilingual text rendering inside images.

desde $0.060 / call Ver →
GPT-4o Image
OpenAI
Image

Native image generation inside GPT-4o — generate and edit images within the conversation.

desde $0.060 / call Ver →
Grok Imagine
xAI
Image

Image and video generation from text — text-to-image, image-to-video, and editing with audio.

desde $0.020 / call Ver →
Hailuo
MiniMax
Video

Text and image-to-video at native 1080p with accurate physics simulation and motion.

desde $0.300 / call Ver →
HappyHorse
Alibaba
Video

Text, image, and edit-video generation with 720p and 1080p output, duration control, first-frame image support, ordered reference images for character-guided clips, and source-video editing.

desde $0.480 / second Ver →
Ideogram V3
Ideogram
Text

Text-to-image with industry-leading in-image text accuracy — posters, logos, and typography.

desde $0.070 / call Ver →
Imagen 4
Google
Image

Photorealistic text-to-image with precise typography, broad style range, and up to 2K resolution.

desde $0.040 / call Ver →
InfiniteTalk
MeiGen-AI
Video

Audio-driven talking-head animation — lip-sync and animate a portrait from any audio input.

desde $0.120 / second Ver →
Kling
Kuaishou
Text

Text and image-to-video at up to 4K 60fps with multimodal audio and AI avatar generation.

desde $0.050 / second Ver →
Luma
Luma
Video

Video modification and transformation powered by Luma's Dream Machine model.

desde $0.500 / call Ver →
Nano Banana
Google
Image

Fast text-to-image with accurate in-image text rendering and multi-character consistency.

desde $0.040 / call Ver →
Qwen 2
Alibaba
Image

Text-to-image, image remix, and image editing from Alibaba's Qwen visual model family.

desde $0.040 / call Ver →
Recraft
Recraft
Image

AI image upscaling and background removal for design and production workflows.

desde $0.010 / call Ver →
Runway
Runway
Video

Video generation and editing — create and transform footage with text prompts.

desde $0.120 / call Ver →
Runway Aleph
Runway
Video

Prompt-guided video editing that transforms existing footage with frame-level continuity.

desde $1.10 / call Ver →
Seedance
Bytedance
Video

Text and image-to-video with native audio-video joint synthesis, up to 15-second multi-shot clips.

desde $0.020 / second Ver →
Seedream
Bytedance
Image

Text-to-image and image editing with strong typography rendering, up to 4K resolution.

desde $0.060 / call Ver →
Suno
Suno
Audio & Music

AI music generation — create full songs with vocals, instruments, and lyrics from a text prompt.

desde $0.0000 / call Ver →
Topaz
Topaz
Image

AI-powered image and video upscaling — enhance resolution and detail without artifacts.

desde $0.120 / second Ver →
Veo 3.1
Google
Video

High-fidelity video generation up to 4K with natively synthesized dialogue, sound effects, and ambience.

desde $0.300 / call Ver →
Wan
Alibaba
Video

Comprehensive video and image suite — text-to-video, image-to-video, video editing, and storyboards.

desde $0.050 / call Ver →
Z Image
Alibaba
Image

Ultra-fast text-to-image — photorealistic results in ~1 second with 8 inference steps.

desde $0.010 / call Ver →
SKILLS

Un skill es más que un binding: es documentación, schema e instalación en uno.

Una instalación, todos los agentes

Elige el runtime, pega el comando y el agente obtiene documentación, schema y adaptador.

Llamadas schema-first

Cada skill incluye un esquema JSON tipado. Las llamadas se validan antes de llegar a producción.

Coste y latencia predecibles

Precios por llamada o unidad: sabes cuánto cuesta cada ejecución antes de lanzarla.

INSTALAR PATTERNS

Un comando, todos los agentes.

Funciona con Claude Code, Codex, Gemini CLI, Cursor y 50+ agentes.

# Works with Claude Code, Codex, Gemini CLI, Cursor, and 50+ agents
npx skills add runapi-ai/<model> -g
The -g flag installs globally so every project picks it up.
Or paste this prompt to your AI agent:
Install the <model> skill for me:

1. Clone https://github.com/runapi-ai/<model>
2. Copy the skills/<model>/ directory into your
   user-level skills directory (e.g. ~/.claude/skills/
   for Claude Code, ~/.codex/skills/ for Codex).
3. Verify that SKILL.md is present.
4. Confirm the install path when done.
PRECIOS

Paga solo por lo que llamas.

Tabla completa de precios →
VIDEO
$0.300 / call

Video generation — text-to-video, image-to-video, extend, upscale.

IMAGE
$0.060 / call

Image generation and editing — text-to-image, remix, upscale.

AUDIO & MUSIC
$0.040 / minute

Music, speech synthesis, sound effects, and audio processing.

LLM
$0.050 / 1K tokens

Large language models for chat, code, and reasoning.

FAQ

Respuestas rápidas desde la documentación.

¿Qué es exactamente un model skill?

Un skill empaqueta instalación, schema, prompt y adaptador de runtime para CLIs compatibles.

¿Qué CLIs soportan los skills hoy?

Los skills funcionan con 50+ agentes, incluidos Claude Code, Codex, Gemini CLI y Cursor.

¿Necesito una cuenta separada por proveedor?

No. Una clave RunAPI desbloquea todo el catálogo y unifica la facturación.

¿Cómo se factura?

Por llamada o por unidad medida, con facturación mensual en USD. Las generaciones fallidas no se cobran.

¿Puedo autohospedar el SDK?

Los paquetes SDK en npm, pip y RubyGems tienen licencia MIT.

¿Dónde reporto skills rotos?

Abre un issue en el repositorio público de GitHub.

EMPEZAR

Instala tu primer skill en menos de sesenta segundos.

Un comando, sin pasos de autenticación complicados, primeras 1.000 llamadas gratis.