Model Directory

Search the modelsyour product can ship.

Browse live routes for Seedance 2.5, Seedance 2.0 mini, Seedance 2.0, Happy Horse 1.1, Seedream 5.0, Claude Fable 5, Claude 4.8, Claude 4.7, Claude 4.6, Claude 4.5, ElevenLabs, Suno, and the rest of your image, video, audio, TTS, and text stack.

Seedance 2.5Seedance 2.0 miniSeedance 2.0Happy Horse 1.1Seedream 5.0Claude Fable 5Claude 4.8Claude 4.7Claude 4.6Claude 4.5ElevenLabsSuno

Live catalogOpen the current model pricing dashboardJump to live model prices, availability, API keys, and route setup.

Search by provider, compare context and pricing signals, then route the right model through one API.

Open live pricing Request a model

Search and filter

Filter by modality, provider, reasoning, or sort order.

Modality

Provider

Sort

Reasoning only

All (683)Audio (23)Embeddings (26)File (71)Image (226)Rerank (3)Text (683)TTS (2)Video (47)

Results

ByteDance

Seedance 2.5

VideoLive

bytedance/seedance-2.5

Vendor-priority routed video model for text-to-video, image-to-video, real-person scenes, and commercial motion workflows.

VideoTextImage

Context

N/A

Group

Featured

Pricing preview

Video tokens: $7 /M tokens

Need volume pricing or launch support?

Contact sales

Open model page

ByteDance

Seedance 2.0 mini

VideoLive

bytedance/seedance-2.0-mini

Lower-cost Seedance route for fast video generation tests, prototypes, and scalable creative workloads.

VideoTextImage

Context

N/A

Group

Featured

Pricing preview

See live pricing dashboard

Need volume pricing or launch support?

Contact sales

Open model page

ByteDance

Seedance 2.0

VideoLive

bytedance/seedance-2.0

Seedance 2.0 video generation route for text-to-video and image-to-video workflows through one API.

VideoTextImage

Context

N/A

Group

Featured

Pricing preview

See live pricing dashboard

Need volume pricing or launch support?

Contact sales

Open model page

Alibaba

Happy Horse 1.1

VideoLive

alibaba/happyhorse-1.0

Character-led video generation for prompt, image, reference, and edit workflows tuned for product and creative teams.

VideoImage

Context

N/A

Group

Featured

Pricing preview

From $0.7 / video

Need volume pricing or launch support?

Contact sales

Open model page

ByteDance

Seedream 5.0

ImageLive

bytedance/seedream-5.0

Image generation, editing, and multi-image composition route for teams building commercial creative tools.

ImageText

Context

N/A

Group

Featured

Pricing preview

See live pricing dashboard

Need volume pricing or launch support?

Contact sales

Open model page

Anthropic

Claude Fable 5 / 4.x

TextLive

anthropic/claude-fable-5

Claude search and routing intent hub for Fable 5, Claude 4.8, 4.7, 4.6, and 4.5 fallback planning.

Text

Context

N/A

Group

Featured

Pricing preview

Compare Claude routes in dashboard

Need volume pricing or launch support?

Contact sales

Open model page

ElevenLabs

AudioLive

elevenlabs/voice

Voice, text-to-speech, and speech workflows that can share billing and routing with image and video models.

AudioTTS

Context

N/A

Group

Featured

Pricing preview

See live pricing dashboard

Need volume pricing or launch support?

Contact sales

Open model page

Suno

AudioLive

suno/music

Music generation search path for teams evaluating Suno-style audio workflows alongside multimodal APIs.

Audio

Context

N/A

Group

Featured

Pricing preview

See live pricing dashboard

Need volume pricing or launch support?

Contact sales

Open model page

OpenAI

GPT Image 2

ImageLive

openai/gpt-image-2

High-quality image generation and editing route with strong instruction following and production-ready visual output.

ImageText

Context

N/A

Group

Featured

Pricing preview

Input $8/M · cache read $2/M · output $30/M

Need volume pricing or launch support?

Contact sales

Open model page

Google

Nano Banana Pro

ImageLive

google/nano-banana-pro

Fast image route for stylized and photoreal prompts, useful for lightweight visual generation and iteration.

ImageText

Context

N/A

Group

Featured

Pricing preview

From $0.03 / img

Need volume pricing or launch support?

Contact sales

Open model page

Kuaishou

Kling AI

VideoLive

kuaishou/kling-v3.0

Unified Kling route for current-generation prompt-led and image-guided video tasks with aspect-ratio controls.

VideoImage

Context

N/A

Group

Featured

Pricing preview

From $0.4 / video

Need volume pricing or launch support?

Contact sales

Open model page

Alibaba

Wan 2.7

VideoLive

alibaba/wan-2.7

Current Wan video route for text-to-video, image-guided generation, and reference-driven motion workflows.

VideoTextImage

Context

N/A

Group

Featured

Pricing preview

Video output: $0.1 / second

Need volume pricing or launch support?

Contact sales

Open model page

Google Vertex

Google: Veo 3.1

VideoLive

google/veo-3.1

Google's state-of-the-art video generation model, built for maximum visual fidelity in final production cuts. Veo 3.1 generates high-quality 1080p video from text or image prompts with native synchronized audio — including dialogue, ambient effects, and background sound. Supports scene extension (up to 20 chained clips for 140+ second narratives), frames-to-video transitions between two images, vertical video for Shorts, and 4K upscaling.

VideoTextImage

Context

N/A

Group

Other

Pricing preview

Video (with audio): $0.4 per second

Video (no audio): $0.2 per second

Need volume pricing or launch support?

Contact sales

Anthropic

Anthropic: Claude Opus 4.7

TextReasoningLive

anthropic/claude-opus-4.7

Opus 4.7 is the next generation of Anthropic's Opus family, built for long-running, asynchronous agents. Building on the coding and agentic strengths of Opus 4.6, it delivers stronger performance on complex, multi-step tasks and more reliable agentic execution across extended workflows. It is especially effective for asynchronous agent pipelines where tasks unfold over time - large codebases, multi-stage debugging, and end-to-end project orchestration. Beyond coding, Opus 4.7 brings improved knowledge work capabilities - from drafting documents and building presentations to analyzing data. It maintains coherence across very long outputs and extended sessions, making it a strong default for tasks that require persistence, judgment, and follow-through. For users upgrading from earlier Opus versions, see our [official migration guide here](https://openrouter.ai/docs/guides/evaluate-and-optimize/model-migrations/claude-4-7)

TextImage

Context

Group

Claude

Pricing preview

Input Price: $5 /M tokens

Output Price: $25 /M tokens

Need volume pricing or launch support?

Contact sales

Azure

OpenAI: GPT-5.4

TextReasoningLive

openai/gpt-5.4

GPT-5.4 is OpenAI’s latest frontier model, unifying the Codex and GPT lines into a single system. It features a 1M+ token context window (922K input, 128K output) with support for text and image inputs, enabling high-context reasoning, coding, and multimodal analysis within the same workflow. The model delivers improved performance in coding, document understanding, tool use, and instruction following. It is designed as a strong default for both general-purpose tasks and software engineering, capable of generating production-quality code, synthesizing information across multiple sources, and executing complex multi-step workflows with fewer iterations and greater token efficiency.

TextImageFile

Context

1.1M

Group

GPT

Pricing preview

Input Price: $2.5 /M tokens

Output Price: $15 /M tokens

Need volume pricing or launch support?

Contact sales

AtlasCloud

Kling: Video O1

VideoLive

kwaivgi/kling-video-o1

Kling Video O1 is a video generation model from Kuaishou. It supports text and image inputs with video output, enabling text-to-video and image-to-video workflows. It is suited for cinematic content production, with first-frame and last-frame control for precise scene composition. It generates 5 or 10 second clips in 16:9, 9:16, or 1:1 aspect ratios.

VideoTextImage

Context

N/A

Group

Other

Pricing preview

Video Output: $0.0896 per second

Need volume pricing or launch support?

Contact sales

Mistral

Mistral: Voxtral Mini TTS

TTSLive

mistralai/voxtral-mini-tts-2603

Voxtral Mini TTS is Mistral's text-to-speech model featuring zero-shot voice cloning and multilingual support. It converts text input into natural-sounding audio output.

TTSText

Context

4.1K

Group

Mistral

Pricing preview

Characters: $16 per 1M characters

Need volume pricing or launch support?

Contact sales

OpenAI

OpenAI: GPT-4o Mini TTS

TTSLive

openai/gpt-4o-mini-tts-2025-12-15

GPT-4o Mini TTS is OpenAI's cost-efficient text-to-speech model. It converts text input into natural-sounding audio output, supporting a variety of voices and tones.

TTSText

Context

4.1K

Group

GPT

Pricing preview

Characters: $0.6 per 1M characters

Need volume pricing or launch support?

Contact sales

Parasail

MoonshotAI: Kimi K2.6

TextReasoningLive

moonshotai/kimi-k2.6

Kimi K2.6 is Moonshot AI's next-generation multimodal model, designed for long-horizon coding, coding-driven UI/UX generation, and multi-agent orchestration. It handles complex end-to-end coding tasks across Python, Rust, and Go, and can convert prompts and visual inputs into production-ready interfaces. Its agent swarm architecture scales to hundreds of parallel sub-agents for autonomous task decomposition - delivering documents, websites, and spreadsheets in a single run without human oversight.

TextImage

Context

262.1K

Group

Other

Pricing preview

Input Price: $0.6 /M tokens

Output Price: $2.8 /M tokens

Need volume pricing or launch support?

Contact sales

Chutes

MoonshotAI: Kimi K2.5

TextReasoningLive

moonshotai/kimi-k2.5

Kimi K2.5 is Moonshot AI's native multimodal model, delivering state-of-the-art visual coding capability and a self-directed agent swarm paradigm. Built on Kimi K2 with continued pretraining over approximately 15T mixed visual and text tokens, it delivers strong performance in general reasoning, visual coding, and agentic tool-calling.

TextImage

Context

262.1K

Group

Other

Pricing preview

Input Price: $0.44 /M tokens

Output Price: $2 /M tokens

Need volume pricing or launch support?

Contact sales

Google AI Studio

Google: Gemini Embedding 2 Preview

EmbeddingsLive

google/gemini-embedding-2-preview

Gemini Embedding 2 Preview is Google's first multimodal embedding model, mapping text, images, video, audio, and PDFs into a unified vector space for semantic search and retrieval-augmented generation (RAG). It supports input context up to 8,192 tokens and flexible output dimensions from 128 to 3,072 (recommended: 768, 1536, or 3,072). Designed for cross-modal similarity — you can embed a text query and retrieve the most relevant images, or vice versa — making it well-suited for multimodal search, recommendation, and document understanding pipelines.

EmbeddingsTextImage

Context

8.2K

Group

Gemini

Pricing preview

Text Input: $0.2 /M tokens

Image Input: $0.45 /M tokens

Need volume pricing or launch support?

Contact sales

Black Forest Labs

Black Forest Labs: FLUX.2 Klein 4B

ImageLive

black-forest-labs/flux.2-klein-4b

FLUX.2 [klein] 4B is the fastest and most cost-effective model in the FLUX.2 family, optimized for high-throughput use cases while maintaining excellent image quality. Pricing is based on the output image. The first generated megapixel is charged $0.014. Each subsequent megapixel is charged $0.001.

ImageText

Context

41K

Group

Other

Pricing preview

Output Image: $0.014 per megapixel

Need volume pricing or launch support?

Contact sales

Anthropic

Anthropic: Claude Opus 4.6 (Fast)

TextReasoningLive

anthropic/claude-opus-4.6-fast

Fast-mode variant of [Opus 4.6](/anthropic/claude-opus-4.6) - identical capabilities with higher output speed at premium 6x pricing. Learn more in Anthropic's docs: https://platform.claude.com/docs/en/build-with-claude/fast-mode

TextImage

Context

Group

Claude

Pricing preview

Input Price: $30 /M tokens

Output Price: $15 /M tokens

Need volume pricing or launch support?

Contact sales

Amazon Bedrock

Anthropic: Claude Opus 4.6

TextReasoningLive

anthropic/claude-opus-4.6

Opus 4.6 is Anthropic’s strongest model for coding and long-running professional tasks. It is built for agents that operate across entire workflows rather than single prompts, making it especially effective for large codebases, complex refactors, and multi-step debugging that unfolds over time. The model shows deeper contextual understanding, stronger problem decomposition, and greater reliability on hard engineering tasks than prior generations. Beyond coding, Opus 4.6 excels at sustained knowledge work. It produces near-production-ready documents, plans, and analyses in a single pass, and maintains coherence across very long outputs and extended sessions. This makes it a strong default for tasks that require persistence, judgment, and follow-through, such as technical design, migration planning, and end-to-end project execution. For users upgrading from earlier Opus versions, see our [official migration guide here](https://openrouter.ai/docs/guides/guides/model-migrations/claude-4-6-opus)

TextImage

Context

Group

Claude

Pricing preview

Input Price: $5 /M tokens

Output Price: $25 /M tokens

Need volume pricing or launch support?

Contact sales

Seed

ByteDance: Seedance 1.5 Pro

VideoLive

bytedance/seedance-1-5-pro

ByteDance's next-generation audio-visual generation model with a 4.5B parameter Dual-Branch Diffusion Transformer architecture. Seedance 1.5 Pro generates video and audio simultaneously in a single unified pass — eliminating the timing issues of sequential audio dubbing. Supports multi-language lip-sync (English, Mandarin, Japanese, Korean, Spanish, and more), cinematic camera control (pan, tilt, zoom, orbit), multi-character dialogue, and character consistency across shots. Produces clips from 4–12 seconds at up to 1080p. The number of tokens is given by (height of output video * width of output video * duration * 24) / 1024

VideoTextImage

Context

N/A

Group

Other

Pricing preview

Video Tokens (with audio): $2.4 /M tokens

Video Tokens (no audio): $1.2 /M tokens

Need volume pricing or launch support?

Contact sales

Seed

ByteDance: Seedance 2.0 Fast

VideoLive

bytedance/seedance-2.0-fast

Seedance 2.0 Fast is a video generation model from ByteDance. It supports text-to-video, image-to-video with first and last frame control, and multimodal reference-to-video. It prioritizes generation speed and lower cost over maximum output quality. The number of tokens is given by (height of output video * width of output video * duration * 24) / 1024

VideoTextImage

Context

N/A

Group

Other

Pricing preview

Video Tokens (with audio): $5.6 /M tokens

Video Tokens (no audio): $5.6 /M tokens

Need volume pricing or launch support?

Contact sales

OpenAI

OpenAI: Sora 2 Pro

VideoLive

openai/sora-2-pro

OpenAI's flagship video generation model, delivering production-quality video with physics-accurate motion, synchronized audio, and world-state persistence across shots. Sora 2 Pro follows intricate multi-shot instructions while maintaining consistent spatial relationships — objects don't disappear or change shape between cuts. Supports text-to-video and image-to-video, with synchronized background soundscapes, speech, and sound effects. Includes advanced content safety with C2PA metadata provenance and SynthID-style watermarking.

VideoTextImage

Context

N/A

Group

Other

Pricing preview

Video Output: $0.3 per second

Need volume pricing or launch support?

Contact sales

Stealth

Elephant

TextLive

openrouter/elephant-alpha

Elephant Alpha is a 100B-parameter text model focused on intelligence efficiency, delivering strong performance while minimizing token usage. It supports a 256K context window with up to 32K output tokens, function calling, structured output, and prompt caching. It is particularly well-suited for code completion and debugging, rapid document processing, and lightweight agent interactions. Note: Prompts and completions may be logged by the provider and used to improve the model.

Text

Context

262.1K

Group

Other

Pricing preview

Input Price: $0 /M tokens

Output Price: $0 /M tokens

Need volume pricing or launch support?

Contact sales

Anthropic

Anthropic: Claude Sonnet 4.6

TextReasoningLive

anthropic/claude-sonnet-4.6

Sonnet 4.6 is Anthropic's most capable Sonnet-class model yet, with frontier performance across coding, agents, and professional work. It excels at iterative development, complex codebase navigation, end-to-end project management with memory, polished document creation, and confident computer use for web QA and workflow automation.

TextImage

Context

Group

Claude

Pricing preview

Input Price: $3 /M tokens

Output Price: $15 /M tokens

Need volume pricing or launch support?

Contact sales

Anthropic

Anthropic: Claude Opus 4.5

TextReasoningLive

anthropic/claude-opus-4.5

Claude Opus 4.5 is Anthropic’s frontier reasoning model optimized for complex software engineering, agentic workflows, and long-horizon computer use. It offers strong multimodal capabilities, competitive performance across real-world coding and reasoning benchmarks, and improved robustness to prompt injection. The model is designed to operate efficiently across varied effort levels, enabling developers to trade off speed, depth, and token usage depending on task requirements. It comes with a new parameter to control token efficiency, which can be accessed using the OpenRouter Verbosity parameter with low, medium, or high. Opus 4.5 supports advanced tool use, extended context management, and coordinated multi-agent setups, making it well-suited for autonomous research, debugging, multi-step planning, and spreadsheet/browser manipulation. It delivers substantial gains in structured reasoning, execution reliability, and alignment compared to prior Opus generations, while reducing token overhead and improving performance on long-running tasks.

TextFileImage

Context

200K

Group

Claude

Pricing preview

Input Price: $5 /M tokens

Output Price: $25 /M tokens

Need volume pricing or launch support?

Contact sales

Google Vertex (Europe)

Anthropic: Claude Opus 4

TextReasoningLive

anthropic/claude-opus-4

Claude Opus 4 is benchmarked as the world’s best coding model, at time of release, bringing sustained performance on complex, long-running tasks and agent workflows. It sets new benchmarks in software engineering, achieving leading results on SWE-bench (72.5%) and Terminal-bench (43.2%). Opus 4 supports extended, agentic workflows, handling thousands of task steps continuously for hours without degradation. Read more at the [blog post here](https://www.anthropic.com/news/claude-4)

TextImageFile

Context

200K

Group

Claude

Pricing preview

Input Price: $15 /M tokens

Output Price: $75 /M tokens

Need volume pricing or launch support?

Contact sales

Google Vertex (Global)

Google: Gemini 2.5 Pro

TextReasoningLive

google/gemini-2.5-pro

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.

TextImageFileAudioVideo

Context

Group

Gemini

Pricing preview

Input Price: $1.25 /M tokens

Output Price: $10 /M tokens

Need volume pricing or launch support?

Contact sales

Unknown provider

Anthropic: Claude 3.5 Sonnet

TextLive

anthropic/claude-3.5-sonnet

New Claude 3.5 Sonnet delivers better-than-Opus capabilities, faster-than-Sonnet speeds, at the same Sonnet prices. Sonnet is particularly good at: - Coding: Scores ~49% on SWE-Bench Verified, higher than the last best score, and without any fancy prompt scaffolding - Data science: Augments human data science expertise; navigates unstructured data while using multiple tools for insights - Visual processing: excelling at interpreting charts, graphs, and images, accurately transcribing text to derive insights beyond just the text alone - Agentic tasks: exceptional tool use, making it great at agentic tasks (i.e. complex, multi-step problem solving tasks that require engaging with other systems) #multimodal

TextImageFile

Context

200K

Group

Claude

Pricing preview

No pricing information published for this model.

Need volume pricing or launch support?

Contact sales

AtlasCloud

Alibaba: Wan 2.6

VideoLive

alibaba/wan-2.6

Alibaba's most advanced video generation model, supporting over 10 visual creation capabilities in a unified system. Wan 2.6 generates 1080p video at 24fps from text, images, reference videos, or audio, with native audio-visual synchronization and precise lip-sync. Key features include reference-to-video (insert a character's appearance and voice into new scenes), multi-shot storytelling from simple prompts, synchronized sound effects and music, and support for 16:9, 9:16, and 1:1 aspect ratios with clips up to 15 seconds.

VideoTextImage

Context

N/A

Group

Other

Pricing preview

Text to Video: $0.04 per second

Image to Video: $0.1 per second

Need volume pricing or launch support?

Contact sales

Azure

OpenAI: GPT-5.4 Nano

TextReasoningLive

openai/gpt-5.4-nano

GPT-5.4 nano is the most lightweight and cost-efficient variant of the GPT-5.4 family, optimized for speed-critical and high-volume tasks. It supports text and image inputs and is designed for low-latency use cases such as classification, data extraction, ranking, and sub-agent execution. The model prioritizes responsiveness and efficiency over deep reasoning, making it ideal for pipelines that require fast, reliable outputs at scale. GPT-5.4 nano is well suited for background tasks, real-time systems, and distributed agent architectures where minimizing cost and latency is essential.

TextFileImage

Context

400K

Group

GPT

Pricing preview

Input Price: $0.2 /M tokens

Output Price: $1.25 /M tokens

Need volume pricing or launch support?

Contact sales

Azure

OpenAI: GPT-5.4 Mini

TextReasoningLive

openai/gpt-5.4-mini

GPT-5.4 mini brings the core capabilities of GPT-5.4 to a faster, more efficient model optimized for high-throughput workloads. It supports text and image inputs with strong performance across reasoning, coding, and tool use, while reducing latency and cost for large-scale deployments. The model is designed for production environments that require a balance of capability and efficiency, making it well suited for chat applications, coding assistants, and agent workflows that operate at scale. GPT-5.4 mini delivers reliable instruction following, solid multi-step reasoning, and consistent performance across diverse tasks with improved cost efficiency.

TextFileImage

Context

400K

Group

GPT

Pricing preview

Input Price: $0.75 /M tokens

Output Price: $4.5 /M tokens

Need volume pricing or launch support?

Contact sales

Azure

OpenAI: GPT-5.4 Pro

TextReasoningLive

openai/gpt-5.4-pro

GPT-5.4 Pro is OpenAI's most advanced model, building on GPT-5.4's unified architecture with enhanced reasoning capabilities for complex, high-stakes tasks. It features a 1M+ token context window (922K input, 128K output) with support for text and image inputs. Optimized for step-by-step reasoning, instruction following, and accuracy, GPT-5.4 Pro excels at agentic coding, long-context workflows, and multi-step problem solving.

TextImageFile

Context

1.1M

Group

GPT

Pricing preview

Input Price: $30 /M tokens

Output Price: $18 /M tokens

Need volume pricing or launch support?

Contact sales

OpenAI

OpenAI: GPT-5.3 Chat

TextLive

openai/gpt-5.3-chat

GPT-5.3 Chat is an update to ChatGPT's most-used model that makes everyday conversations smoother, more useful, and more directly helpful. It delivers more accurate answers with better contextualization and significantly reduces unnecessary refusals, caveats, and overly cautious phrasing that can interrupt conversational flow.

TextImageFile

Context

128K

Group

GPT

Pricing preview

Input Price: $1.75 /M tokens

Output Price: $14 /M tokens

Need volume pricing or launch support?

Contact sales

OpenAI

OpenAI: GPT-5.3-Codex

TextReasoningLive

openai/gpt-5.3-codex

GPT-5.3-Codex is OpenAI’s most advanced agentic coding model, combining the frontier software engineering performance of GPT-5.2-Codex with the broader reasoning and professional knowledge capabilities of GPT-5.2. It achieves state-of-the-art results on SWE-Bench Pro and strong performance on Terminal-Bench 2.0 and OSWorld-Verified, reflecting improved multi-language coding, terminal proficiency, and real-world computer-use skills. The model is optimized for long-running, tool-using workflows and supports interactive steering during execution, making it suitable for complex development tasks, debugging, deployment, and iterative product work. Beyond coding, GPT-5.3-Codex performs strongly on structured knowledge-work benchmarks such as GDPval, supporting tasks like document drafting, spreadsheet analysis, slide creation, and operational research across domains. It is trained with enhanced cybersecurity awareness, including vulnerability identification capabilities, and deployed with additional safeguards for high-risk use cases. Compared to prior Codex models, it is more token-efficient and approximately 25% faster, targeting professional end-to-end workflows that span reasoning, execution, and computer interaction.

TextImageFile

Context

400K

Group

GPT

Pricing preview

Input Price: $1.75 /M tokens

Output Price: $14 /M tokens

Need volume pricing or launch support?

Contact sales

OpenAI

OpenAI: GPT Audio

TextLive

openai/gpt-audio

The gpt-audio model is OpenAI's first generally available audio model. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Audio is priced at $32 per million input tokens and $64 per million output tokens.

TextAudio

Context

128K

Group

GPT

Pricing preview

Input Price: $2.5 /M tokens

Output Price: $10 /M tokens

Need volume pricing or launch support?

Contact sales

OpenAI

OpenAI: GPT Audio Mini

TextLive

openai/gpt-audio-mini

A cost-efficient version of GPT Audio. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Input is priced at $0.60 per million tokens and output is priced at $2.40 per million tokens.

TextAudio

Context

128K

Group

GPT

Pricing preview

Input Price: $0.6 /M tokens

Output Price: $2.4 /M tokens

Need volume pricing or launch support?

Contact sales

Azure

OpenAI: GPT-5.2-Codex

TextReasoningLive

openai/gpt-5.2-codex

GPT-5.2-Codex is an upgraded version of GPT-5.1-Codex optimized for software engineering and coding workflows. It is designed for both interactive development sessions and long, independent execution of complex engineering tasks. The model supports building projects from scratch, feature development, debugging, large-scale refactoring, and code review. Compared to GPT-5.1-Codex, 5.2-Codex is more steerable, adheres closely to developer instructions, and produces cleaner, higher-quality code outputs. Reasoning effort can be adjusted with the `reasoning.effort` parameter. Read the [docs here](https://openrouter.ai/docs/use-cases/reasoning-tokens#reasoning-effort-level) Codex integrates into developer environments including the CLI, IDE extensions, GitHub, and cloud tasks. It adapts reasoning effort dynamically—providing fast responses for small tasks while sustaining extended multi-hour runs for large projects. The model is trained to perform structured code reviews, catching critical flaws by reasoning over dependencies and validating behavior against tests. It also supports multimodal inputs such as images or screenshots for UI development and integrates tool use for search, dependency installation, and environment setup. Codex is intended specifically for agentic coding applications.

TextImage

Context

400K

Group

GPT

Pricing preview

Input Price: $1.75 /M tokens

Output Price: $14 /M tokens

Need volume pricing or launch support?

Contact sales

Azure

OpenAI: GPT-5.2 Chat

TextLive

openai/gpt-5.2-chat

GPT-5.2 Chat (AKA Instant) is the fast, lightweight member of the 5.2 family, optimized for low-latency chat while retaining strong general intelligence. It uses adaptive reasoning to selectively “think” on harder queries, improving accuracy on math, coding, and multi-step tasks without slowing down typical conversations. The model is warmer and more conversational by default, with better instruction following and more stable short-form reasoning. GPT-5.2 Chat is designed for high-throughput, interactive workloads where responsiveness and consistency matter more than deep deliberation.

TextFileImage

Context

128K

Group

GPT

Pricing preview

Input Price: $1.75 /M tokens

Output Price: $14 /M tokens

Need volume pricing or launch support?

Contact sales

OpenAI

OpenAI: GPT-5.2 Pro

TextReasoningLive

openai/gpt-5.2-pro

GPT-5.2 Pro is OpenAI’s most advanced model, offering major improvements in agentic coding and long context performance over GPT-5 Pro. It is optimized for complex tasks that require step-by-step reasoning, instruction following, and accuracy in high-stakes use cases. It supports test-time routing features and advanced prompt understanding, including user-specified intent like "think hard about this." Improvements include reductions in hallucination, sycophancy, and better performance in coding, writing, and health-related tasks.

TextImageFile

Context

400K

Group

GPT

Pricing preview

Input Price: $21 /M tokens

Output Price: $168 /M tokens

Need volume pricing or launch support?

Contact sales

Azure

OpenAI: GPT-5.2

TextReasoningLive

openai/gpt-5.2

GPT-5.2 is the latest frontier-grade model in the GPT-5 series, offering stronger agentic and long context perfomance compared to GPT-5.1. It uses adaptive reasoning to allocate computation dynamically, responding quickly to simple queries while spending more depth on complex tasks. Built for broad task coverage, GPT-5.2 delivers consistent gains across math, coding, sciende, and tool calling workloads, with more coherent long-form answers and improved tool-use reliability.

TextFileImage

Context

400K

Group

GPT

Pricing preview

Input Price: $1.75 /M tokens

Output Price: $14 /M tokens

Need volume pricing or launch support?

Contact sales

OpenAI

OpenAI: GPT-5.1-Codex-Max

TextReasoningLive

openai/gpt-5.1-codex-max

GPT-5.1-Codex-Max is OpenAI’s latest agentic coding model, designed for long-running, high-context software development tasks. It is based on an updated version of the 5.1 reasoning stack and trained on agentic workflows spanning software engineering, mathematics, and research. GPT-5.1-Codex-Max delivers faster performance, improved reasoning, and higher token efficiency across the development lifecycle.

TextImage

Context

400K

Group

GPT

Pricing preview

Input Price: $1.25 /M tokens

Output Price: $10 /M tokens

Need volume pricing or launch support?

Contact sales

Azure

OpenAI: GPT-5.1

TextReasoningLive

openai/gpt-5.1

GPT-5.1 is the latest frontier-grade model in the GPT-5 series, offering stronger general-purpose reasoning, improved instruction adherence, and a more natural conversational style compared to GPT-5. It uses adaptive reasoning to allocate computation dynamically, responding quickly to simple queries while spending more depth on complex tasks. The model produces clearer, more grounded explanations with reduced jargon, making it easier to follow even on technical or multi-step problems. Built for broad task coverage, GPT-5.1 delivers consistent gains across math, coding, and structured analysis workloads, with more coherent long-form answers and improved tool-use reliability. It also features refined conversational alignment, enabling warmer, more intuitive responses without compromising precision. GPT-5.1 serves as the primary full-capability successor to GPT-5

TextImageFile

Context

400K

Group

GPT

Pricing preview

Input Price: $1.25 /M tokens

Output Price: $10 /M tokens

Need volume pricing or launch support?

Contact sales

OpenAI

OpenAI: GPT-5.1 Chat

TextLive

openai/gpt-5.1-chat

GPT-5.1 Chat (AKA Instant is the fast, lightweight member of the 5.1 family, optimized for low-latency chat while retaining strong general intelligence. It uses adaptive reasoning to selectively “think” on harder queries, improving accuracy on math, coding, and multi-step tasks without slowing down typical conversations. The model is warmer and more conversational by default, with better instruction following and more stable short-form reasoning. GPT-5.1 Chat is designed for high-throughput, interactive workloads where responsiveness and consistency matter more than deep deliberation.

TextFileImage

Context

128K

Group

GPT

Pricing preview

Input Price: $1.25 /M tokens

Output Price: $10 /M tokens

Need volume pricing or launch support?

Contact sales

Page 1 of 15

PreviousNext

Missing a route?

Need a model that is not in the directory yet?

Send the model name, expected traffic, and launch timing. We will confirm availability, onboarding priority, or an equivalent route already live on ImaRouter.

Support

support@imarouter.com

For model requests, onboarding priority, routing strategy, or rollout planning, reach out directly.

Request a model Send model details