Video Generation API is now live!

Models

Explore the active model market,from a local OpenRouter snapshot.

This page reads from a local JSON snapshot synced from OpenRouter, so the catalog stays fast, indexable, and stable. Use it to browse current model coverage by provider, modality, reasoning support, context window, and pricing metadata.

Reset

Results

Showing 48 of 683 matching models

Snapshot source: OpenRouter. Synced April 21, 2026 at 8:00 AM. Page 1 of 15.

This route is built from local JSON so the catalog stays stable for browsing and SEO. If you need a specific model on ImaRouter, treat this page as a discovery reference and then contact the team for availability.

Video

AtlasCloud

Kling: Video O1

Kling Video O1 is a video generation model from Kuaishou. It supports text and image inputs with video output, enabling text-to-video and image-to-video workflows. It is suited for cinematic content production, with first-frame and last-frame control for precise scene composition. It generates 5 or 10 second clips in 16:9, 9:16, or 1:1 aspect ratios.

VideoTextImage

Context

N/A

Group

Other

Pricing preview

Video Output: $0.0896 per second

Slug

kwaivgi/kling-video-o1

TTS

Mistral

Mistral: Voxtral Mini TTS

Voxtral Mini TTS is Mistral's text-to-speech model featuring zero-shot voice cloning and multilingual support. It converts text input into natural-sounding audio output.

TTSText

Context

4.1K

Group

Mistral

Pricing preview

Characters: $16 per 1M characters

Slug

mistralai/voxtral-mini-tts-2603

TTS

OpenAI

OpenAI: GPT-4o Mini TTS

GPT-4o Mini TTS is OpenAI's cost-efficient text-to-speech model. It converts text input into natural-sounding audio output, supporting a variety of voices and tones.

TTSText

Context

4.1K

Group

GPT

Pricing preview

Characters: $0.6 per 1M characters

Slug

openai/gpt-4o-mini-tts-2025-12-15

TextReasoning

Parasail

MoonshotAI: Kimi K2.6

Kimi K2.6 is Moonshot AI's next-generation multimodal model, designed for long-horizon coding, coding-driven UI/UX generation, and multi-agent orchestration. It handles complex end-to-end coding tasks across Python, Rust, and Go, and can convert prompts and visual inputs into production-ready interfaces. Its agent swarm architecture scales to hundreds of parallel sub-agents for autonomous task decomposition - delivering documents, websites, and spreadsheets in a single run without human oversight.

TextImage

Context

262.1K

Group

Other

Pricing preview

Input Price: $0.6 /M tokens

Output Price: $2.8 /M tokens

Slug

moonshotai/kimi-k2.6

TextReasoning

Chutes

MoonshotAI: Kimi K2.5

Kimi K2.5 is Moonshot AI's native multimodal model, delivering state-of-the-art visual coding capability and a self-directed agent swarm paradigm. Built on Kimi K2 with continued pretraining over approximately 15T mixed visual and text tokens, it delivers strong performance in general reasoning, visual coding, and agentic tool-calling.

TextImage

Context

262.1K

Group

Other

Pricing preview

Input Price: $0.44 /M tokens

Output Price: $2 /M tokens

Slug

moonshotai/kimi-k2.5

Embeddings

Google AI Studio

Google: Gemini Embedding 2 Preview

Gemini Embedding 2 Preview is Google's first multimodal embedding model, mapping text, images, video, audio, and PDFs into a unified vector space for semantic search and retrieval-augmented generation (RAG). It supports input context up to 8,192 tokens and flexible output dimensions from 128 to 3,072 (recommended: 768, 1536, or 3,072). Designed for cross-modal similarity — you can embed a text query and retrieve the most relevant images, or vice versa — making it well-suited for multimodal search, recommendation, and document understanding pipelines.

EmbeddingsTextImage

Context

8.2K

Group

Gemini

Pricing preview

Text Input: $0.2 /M tokens

Image Input: $0.45 /M tokens

Slug

google/gemini-embedding-2-preview

TextReasoning

Anthropic

Anthropic: Claude Opus 4.7

Opus 4.7 is the next generation of Anthropic's Opus family, built for long-running, asynchronous agents. Building on the coding and agentic strengths of Opus 4.6, it delivers stronger performance on complex, multi-step tasks and more reliable agentic execution across extended workflows. It is especially effective for asynchronous agent pipelines where tasks unfold over time - large codebases, multi-stage debugging, and end-to-end project orchestration. Beyond coding, Opus 4.7 brings improved knowledge work capabilities - from drafting documents and building presentations to analyzing data. It maintains coherence across very long outputs and extended sessions, making it a strong default for tasks that require persistence, judgment, and follow-through. For users upgrading from earlier Opus versions, see our [official migration guide here](https://openrouter.ai/docs/guides/evaluate-and-optimize/model-migrations/claude-4-7)

TextImage

Context

1M

Group

Claude

Pricing preview

Input Price: $5 /M tokens

Output Price: $25 /M tokens

Slug

anthropic/claude-opus-4.7

Image

Black Forest Labs

Black Forest Labs: FLUX.2 Klein 4B

FLUX.2 [klein] 4B is the fastest and most cost-effective model in the FLUX.2 family, optimized for high-throughput use cases while maintaining excellent image quality. Pricing is based on the output image. The first generated megapixel is charged $0.014. Each subsequent megapixel is charged $0.001.

ImageText

Context

41K

Group

Other

Pricing preview

Output Image: $0.014 per megapixel

Slug

black-forest-labs/flux.2-klein-4b

TextReasoning

Anthropic

Anthropic: Claude Opus 4.6 (Fast)

Fast-mode variant of [Opus 4.6](/anthropic/claude-opus-4.6) - identical capabilities with higher output speed at premium 6x pricing. Learn more in Anthropic's docs: https://platform.claude.com/docs/en/build-with-claude/fast-mode

TextImage

Context

1M

Group

Claude

Pricing preview

Input Price: $30 /M tokens

Output Price: $15 /M tokens

Slug

anthropic/claude-opus-4.6-fast

TextReasoning

Amazon Bedrock

Anthropic: Claude Opus 4.6

Opus 4.6 is Anthropic’s strongest model for coding and long-running professional tasks. It is built for agents that operate across entire workflows rather than single prompts, making it especially effective for large codebases, complex refactors, and multi-step debugging that unfolds over time. The model shows deeper contextual understanding, stronger problem decomposition, and greater reliability on hard engineering tasks than prior generations. Beyond coding, Opus 4.6 excels at sustained knowledge work. It produces near-production-ready documents, plans, and analyses in a single pass, and maintains coherence across very long outputs and extended sessions. This makes it a strong default for tasks that require persistence, judgment, and follow-through, such as technical design, migration planning, and end-to-end project execution. For users upgrading from earlier Opus versions, see our [official migration guide here](https://openrouter.ai/docs/guides/guides/model-migrations/claude-4-6-opus)

TextImage

Context

1M

Group

Claude

Pricing preview

Input Price: $5 /M tokens

Output Price: $25 /M tokens

Slug

anthropic/claude-opus-4.6

Video

Seed

ByteDance: Seedance 1.5 Pro

ByteDance's next-generation audio-visual generation model with a 4.5B parameter Dual-Branch Diffusion Transformer architecture. Seedance 1.5 Pro generates video and audio simultaneously in a single unified pass — eliminating the timing issues of sequential audio dubbing. Supports multi-language lip-sync (English, Mandarin, Japanese, Korean, Spanish, and more), cinematic camera control (pan, tilt, zoom, orbit), multi-character dialogue, and character consistency across shots. Produces clips from 4–12 seconds at up to 1080p. The number of tokens is given by (height of output video * width of output video * duration * 24) / 1024

VideoTextImage

Context

N/A

Group

Other

Pricing preview

Video Tokens (with audio): $2.4 /M tokens

Video Tokens (no audio): $1.2 /M tokens

Slug

bytedance/seedance-1-5-pro

Video

Seed

ByteDance: Seedance 2.0

Seedance 2.0 is a video generation model from ByteDance. It supports text-to-video, image-to-video with first and last frame control, and multimodal reference-to-video. It is particularly strong at preserving character consistency, visual style, and camera movement from reference material. The number of tokens is given by (height of output video * width of output video * duration * 24) / 1024

VideoTextImage

Context

N/A

Group

Other

Pricing preview

Video Tokens (with audio): $7 /M tokens

Video Tokens (no audio): $7 /M tokens

Slug

bytedance/seedance-2.0

Video

Seed

ByteDance: Seedance 2.0 Fast

Seedance 2.0 Fast is a video generation model from ByteDance. It supports text-to-video, image-to-video with first and last frame control, and multimodal reference-to-video. It prioritizes generation speed and lower cost over maximum output quality. The number of tokens is given by (height of output video * width of output video * duration * 24) / 1024

VideoTextImage

Context

N/A

Group

Other

Pricing preview

Video Tokens (with audio): $5.6 /M tokens

Video Tokens (no audio): $5.6 /M tokens

Slug

bytedance/seedance-2.0-fast

Video

OpenAI

OpenAI: Sora 2 Pro

OpenAI's flagship video generation model, delivering production-quality video with physics-accurate motion, synchronized audio, and world-state persistence across shots. Sora 2 Pro follows intricate multi-shot instructions while maintaining consistent spatial relationships — objects don't disappear or change shape between cuts. Supports text-to-video and image-to-video, with synchronized background soundscapes, speech, and sound effects. Includes advanced content safety with C2PA metadata provenance and SynthID-style watermarking.

VideoTextImage

Context

N/A

Group

Other

Pricing preview

Video Output: $0.3 per second

Slug

openai/sora-2-pro

Video

AtlasCloud

Alibaba: Wan 2.7

Wan 2.7 is a video generation model from Alibaba. It supports text-to-video, image-to-video with first and last frame control, and reference-to-video, where multiple reference images guide the style and content of the generated scene.

VideoTextImage

Context

N/A

Group

Other

Pricing preview

Video Output: $0.1 per second

Slug

alibaba/wan-2.7

Text

Stealth

Elephant

Elephant Alpha is a 100B-parameter text model focused on intelligence efficiency, delivering strong performance while minimizing token usage. It supports a 256K context window with up to 32K output tokens, function calling, structured output, and prompt caching. It is particularly well-suited for code completion and debugging, rapid document processing, and lightweight agent interactions. Note: Prompts and completions may be logged by the provider and used to improve the model.

Text

Context

262.1K

Group

Other

Pricing preview

Input Price: $0 /M tokens

Output Price: $0 /M tokens

Slug

openrouter/elephant-alpha

TextReasoning

Anthropic

Anthropic: Claude Sonnet 4.6

Sonnet 4.6 is Anthropic's most capable Sonnet-class model yet, with frontier performance across coding, agents, and professional work. It excels at iterative development, complex codebase navigation, end-to-end project management with memory, polished document creation, and confident computer use for web QA and workflow automation.

TextImage

Context

1M

Group

Claude

Pricing preview

Input Price: $3 /M tokens

Output Price: $15 /M tokens

Slug

anthropic/claude-sonnet-4.6

TextReasoning

Anthropic

Anthropic: Claude Opus 4.5

Claude Opus 4.5 is Anthropic’s frontier reasoning model optimized for complex software engineering, agentic workflows, and long-horizon computer use. It offers strong multimodal capabilities, competitive performance across real-world coding and reasoning benchmarks, and improved robustness to prompt injection. The model is designed to operate efficiently across varied effort levels, enabling developers to trade off speed, depth, and token usage depending on task requirements. It comes with a new parameter to control token efficiency, which can be accessed using the OpenRouter Verbosity parameter with low, medium, or high. Opus 4.5 supports advanced tool use, extended context management, and coordinated multi-agent setups, making it well-suited for autonomous research, debugging, multi-step planning, and spreadsheet/browser manipulation. It delivers substantial gains in structured reasoning, execution reliability, and alignment compared to prior Opus generations, while reducing token overhead and improving performance on long-running tasks.

TextFileImage

Context

200K

Group

Claude

Pricing preview

Input Price: $5 /M tokens

Output Price: $25 /M tokens

Slug

anthropic/claude-opus-4.5

TextReasoning

Google Vertex (Europe)

Anthropic: Claude Opus 4

Claude Opus 4 is benchmarked as the world’s best coding model, at time of release, bringing sustained performance on complex, long-running tasks and agent workflows. It sets new benchmarks in software engineering, achieving leading results on SWE-bench (72.5%) and Terminal-bench (43.2%). Opus 4 supports extended, agentic workflows, handling thousands of task steps continuously for hours without degradation. Read more at the [blog post here](https://www.anthropic.com/news/claude-4)

TextImageFile

Context

200K

Group

Claude

Pricing preview

Input Price: $15 /M tokens

Output Price: $75 /M tokens

Slug

anthropic/claude-opus-4

TextReasoning

Google Vertex (Global)

Google: Gemini 2.5 Pro

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.

TextImageFileAudioVideo

Context

1M

Group

Gemini

Pricing preview

Input Price: $1.25 /M tokens

Output Price: $10 /M tokens

Slug

google/gemini-2.5-pro

Text

Unknown provider

Anthropic: Claude 3.5 Sonnet

New Claude 3.5 Sonnet delivers better-than-Opus capabilities, faster-than-Sonnet speeds, at the same Sonnet prices. Sonnet is particularly good at: - Coding: Scores ~49% on SWE-Bench Verified, higher than the last best score, and without any fancy prompt scaffolding - Data science: Augments human data science expertise; navigates unstructured data while using multiple tools for insights - Visual processing: excelling at interpreting charts, graphs, and images, accurately transcribing text to derive insights beyond just the text alone - Agentic tasks: exceptional tool use, making it great at agentic tasks (i.e. complex, multi-step problem solving tasks that require engaging with other systems) #multimodal

TextImageFile

Context

200K

Group

Claude

Pricing preview

No display pricing published in the current snapshot.

Slug

anthropic/claude-3.5-sonnet

Video

AtlasCloud

Alibaba: Wan 2.6

Alibaba's most advanced video generation model, supporting over 10 visual creation capabilities in a unified system. Wan 2.6 generates 1080p video at 24fps from text, images, reference videos, or audio, with native audio-visual synchronization and precise lip-sync. Key features include reference-to-video (insert a character's appearance and voice into new scenes), multi-shot storytelling from simple prompts, synchronized sound effects and music, and support for 16:9, 9:16, and 1:1 aspect ratios with clips up to 15 seconds.

VideoTextImage

Context

N/A

Group

Other

Pricing preview

Text to Video: $0.04 per second

Image to Video: $0.1 per second

Slug

alibaba/wan-2.6

Video

Google Vertex

Google: Veo 3.1

Google's state-of-the-art video generation model, built for maximum visual fidelity in final production cuts. Veo 3.1 generates high-quality 1080p video from text or image prompts with native synchronized audio — including dialogue, ambient effects, and background sound. Supports scene extension (up to 20 chained clips for 140+ second narratives), frames-to-video transitions between two images, vertical video for Shorts, and 4K upscaling.

VideoTextImage

Context

N/A

Group

Other

Pricing preview

Video (with audio): $0.4 per second

Video (no audio): $0.2 per second

Slug

google/veo-3.1

TextReasoning

Azure

OpenAI: GPT-5.4 Nano

GPT-5.4 nano is the most lightweight and cost-efficient variant of the GPT-5.4 family, optimized for speed-critical and high-volume tasks. It supports text and image inputs and is designed for low-latency use cases such as classification, data extraction, ranking, and sub-agent execution. The model prioritizes responsiveness and efficiency over deep reasoning, making it ideal for pipelines that require fast, reliable outputs at scale. GPT-5.4 nano is well suited for background tasks, real-time systems, and distributed agent architectures where minimizing cost and latency is essential.

TextFileImage

Context

400K

Group

GPT

Pricing preview

Input Price: $0.2 /M tokens

Output Price: $1.25 /M tokens

Slug

openai/gpt-5.4-nano

TextReasoning

Azure

OpenAI: GPT-5.4 Mini

GPT-5.4 mini brings the core capabilities of GPT-5.4 to a faster, more efficient model optimized for high-throughput workloads. It supports text and image inputs with strong performance across reasoning, coding, and tool use, while reducing latency and cost for large-scale deployments. The model is designed for production environments that require a balance of capability and efficiency, making it well suited for chat applications, coding assistants, and agent workflows that operate at scale. GPT-5.4 mini delivers reliable instruction following, solid multi-step reasoning, and consistent performance across diverse tasks with improved cost efficiency.

TextFileImage

Context

400K

Group

GPT

Pricing preview

Input Price: $0.75 /M tokens

Output Price: $4.5 /M tokens

Slug

openai/gpt-5.4-mini

TextReasoning

Azure

OpenAI: GPT-5.4 Pro

GPT-5.4 Pro is OpenAI's most advanced model, building on GPT-5.4's unified architecture with enhanced reasoning capabilities for complex, high-stakes tasks. It features a 1M+ token context window (922K input, 128K output) with support for text and image inputs. Optimized for step-by-step reasoning, instruction following, and accuracy, GPT-5.4 Pro excels at agentic coding, long-context workflows, and multi-step problem solving.

TextImageFile

Context

1.1M

Group

GPT

Pricing preview

Input Price: $30 /M tokens

Output Price: $18 /M tokens

Slug

openai/gpt-5.4-pro

TextReasoning

Azure

OpenAI: GPT-5.4

GPT-5.4 is OpenAI’s latest frontier model, unifying the Codex and GPT lines into a single system. It features a 1M+ token context window (922K input, 128K output) with support for text and image inputs, enabling high-context reasoning, coding, and multimodal analysis within the same workflow. The model delivers improved performance in coding, document understanding, tool use, and instruction following. It is designed as a strong default for both general-purpose tasks and software engineering, capable of generating production-quality code, synthesizing information across multiple sources, and executing complex multi-step workflows with fewer iterations and greater token efficiency.

TextImageFile

Context

1.1M

Group

GPT

Pricing preview

Input Price: $2.5 /M tokens

Output Price: $15 /M tokens

Slug

openai/gpt-5.4

Text

OpenAI

OpenAI: GPT-5.3 Chat

GPT-5.3 Chat is an update to ChatGPT's most-used model that makes everyday conversations smoother, more useful, and more directly helpful. It delivers more accurate answers with better contextualization and significantly reduces unnecessary refusals, caveats, and overly cautious phrasing that can interrupt conversational flow.

TextImageFile

Context

128K

Group

GPT

Pricing preview

Input Price: $1.75 /M tokens

Output Price: $14 /M tokens

Slug

openai/gpt-5.3-chat

TextReasoning

OpenAI

OpenAI: GPT-5.3-Codex

GPT-5.3-Codex is OpenAI’s most advanced agentic coding model, combining the frontier software engineering performance of GPT-5.2-Codex with the broader reasoning and professional knowledge capabilities of GPT-5.2. It achieves state-of-the-art results on SWE-Bench Pro and strong performance on Terminal-Bench 2.0 and OSWorld-Verified, reflecting improved multi-language coding, terminal proficiency, and real-world computer-use skills. The model is optimized for long-running, tool-using workflows and supports interactive steering during execution, making it suitable for complex development tasks, debugging, deployment, and iterative product work. Beyond coding, GPT-5.3-Codex performs strongly on structured knowledge-work benchmarks such as GDPval, supporting tasks like document drafting, spreadsheet analysis, slide creation, and operational research across domains. It is trained with enhanced cybersecurity awareness, including vulnerability identification capabilities, and deployed with additional safeguards for high-risk use cases. Compared to prior Codex models, it is more token-efficient and approximately 25% faster, targeting professional end-to-end workflows that span reasoning, execution, and computer interaction.

TextImageFile

Context

400K

Group

GPT

Pricing preview

Input Price: $1.75 /M tokens

Output Price: $14 /M tokens

Slug

openai/gpt-5.3-codex

Text

OpenAI

OpenAI: GPT Audio

The gpt-audio model is OpenAI's first generally available audio model. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Audio is priced at $32 per million input tokens and $64 per million output tokens.

TextAudio

Context

128K

Group

GPT

Pricing preview

Input Price: $2.5 /M tokens

Output Price: $10 /M tokens

Slug

openai/gpt-audio

Text

OpenAI

OpenAI: GPT Audio Mini

A cost-efficient version of GPT Audio. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Input is priced at $0.60 per million tokens and output is priced at $2.40 per million tokens.

TextAudio

Context

128K

Group

GPT

Pricing preview

Input Price: $0.6 /M tokens

Output Price: $2.4 /M tokens

Slug

openai/gpt-audio-mini

TextReasoning

Azure

OpenAI: GPT-5.2-Codex

GPT-5.2-Codex is an upgraded version of GPT-5.1-Codex optimized for software engineering and coding workflows. It is designed for both interactive development sessions and long, independent execution of complex engineering tasks. The model supports building projects from scratch, feature development, debugging, large-scale refactoring, and code review. Compared to GPT-5.1-Codex, 5.2-Codex is more steerable, adheres closely to developer instructions, and produces cleaner, higher-quality code outputs. Reasoning effort can be adjusted with the `reasoning.effort` parameter. Read the [docs here](https://openrouter.ai/docs/use-cases/reasoning-tokens#reasoning-effort-level) Codex integrates into developer environments including the CLI, IDE extensions, GitHub, and cloud tasks. It adapts reasoning effort dynamically—providing fast responses for small tasks while sustaining extended multi-hour runs for large projects. The model is trained to perform structured code reviews, catching critical flaws by reasoning over dependencies and validating behavior against tests. It also supports multimodal inputs such as images or screenshots for UI development and integrates tool use for search, dependency installation, and environment setup. Codex is intended specifically for agentic coding applications.

TextImage

Context

400K

Group

GPT

Pricing preview

Input Price: $1.75 /M tokens

Output Price: $14 /M tokens

Slug

openai/gpt-5.2-codex

Text

Azure

OpenAI: GPT-5.2 Chat

GPT-5.2 Chat (AKA Instant) is the fast, lightweight member of the 5.2 family, optimized for low-latency chat while retaining strong general intelligence. It uses adaptive reasoning to selectively “think” on harder queries, improving accuracy on math, coding, and multi-step tasks without slowing down typical conversations. The model is warmer and more conversational by default, with better instruction following and more stable short-form reasoning. GPT-5.2 Chat is designed for high-throughput, interactive workloads where responsiveness and consistency matter more than deep deliberation.

TextFileImage

Context

128K

Group

GPT

Pricing preview

Input Price: $1.75 /M tokens

Output Price: $14 /M tokens

Slug

openai/gpt-5.2-chat

TextReasoning

OpenAI

OpenAI: GPT-5.2 Pro

GPT-5.2 Pro is OpenAI’s most advanced model, offering major improvements in agentic coding and long context performance over GPT-5 Pro. It is optimized for complex tasks that require step-by-step reasoning, instruction following, and accuracy in high-stakes use cases. It supports test-time routing features and advanced prompt understanding, including user-specified intent like "think hard about this." Improvements include reductions in hallucination, sycophancy, and better performance in coding, writing, and health-related tasks.

TextImageFile

Context

400K

Group

GPT

Pricing preview

Input Price: $21 /M tokens

Output Price: $168 /M tokens

Slug

openai/gpt-5.2-pro

TextReasoning

Azure

OpenAI: GPT-5.2

GPT-5.2 is the latest frontier-grade model in the GPT-5 series, offering stronger agentic and long context perfomance compared to GPT-5.1. It uses adaptive reasoning to allocate computation dynamically, responding quickly to simple queries while spending more depth on complex tasks. Built for broad task coverage, GPT-5.2 delivers consistent gains across math, coding, sciende, and tool calling workloads, with more coherent long-form answers and improved tool-use reliability.

TextFileImage

Context

400K

Group

GPT

Pricing preview

Input Price: $1.75 /M tokens

Output Price: $14 /M tokens

Slug

openai/gpt-5.2

TextReasoning

OpenAI

OpenAI: GPT-5.1-Codex-Max

GPT-5.1-Codex-Max is OpenAI’s latest agentic coding model, designed for long-running, high-context software development tasks. It is based on an updated version of the 5.1 reasoning stack and trained on agentic workflows spanning software engineering, mathematics, and research. GPT-5.1-Codex-Max delivers faster performance, improved reasoning, and higher token efficiency across the development lifecycle.

TextImage

Context

400K

Group

GPT

Pricing preview

Input Price: $1.25 /M tokens

Output Price: $10 /M tokens

Slug

openai/gpt-5.1-codex-max

TextReasoning

Azure

OpenAI: GPT-5.1

GPT-5.1 is the latest frontier-grade model in the GPT-5 series, offering stronger general-purpose reasoning, improved instruction adherence, and a more natural conversational style compared to GPT-5. It uses adaptive reasoning to allocate computation dynamically, responding quickly to simple queries while spending more depth on complex tasks. The model produces clearer, more grounded explanations with reduced jargon, making it easier to follow even on technical or multi-step problems. Built for broad task coverage, GPT-5.1 delivers consistent gains across math, coding, and structured analysis workloads, with more coherent long-form answers and improved tool-use reliability. It also features refined conversational alignment, enabling warmer, more intuitive responses without compromising precision. GPT-5.1 serves as the primary full-capability successor to GPT-5

TextImageFile

Context

400K

Group

GPT

Pricing preview

Input Price: $1.25 /M tokens

Output Price: $10 /M tokens

Slug

openai/gpt-5.1

Text

OpenAI

OpenAI: GPT-5.1 Chat

GPT-5.1 Chat (AKA Instant is the fast, lightweight member of the 5.1 family, optimized for low-latency chat while retaining strong general intelligence. It uses adaptive reasoning to selectively “think” on harder queries, improving accuracy on math, coding, and multi-step tasks without slowing down typical conversations. The model is warmer and more conversational by default, with better instruction following and more stable short-form reasoning. GPT-5.1 Chat is designed for high-throughput, interactive workloads where responsiveness and consistency matter more than deep deliberation.

TextFileImage

Context

128K

Group

GPT

Pricing preview

Input Price: $1.25 /M tokens

Output Price: $10 /M tokens

Slug

openai/gpt-5.1-chat

TextReasoning

OpenAI

OpenAI: GPT-5.1-Codex

GPT-5.1-Codex is a specialized version of GPT-5.1 optimized for software engineering and coding workflows. It is designed for both interactive development sessions and long, independent execution of complex engineering tasks. The model supports building projects from scratch, feature development, debugging, large-scale refactoring, and code review. Compared to GPT-5.1, Codex is more steerable, adheres closely to developer instructions, and produces cleaner, higher-quality code outputs. Reasoning effort can be adjusted with the `reasoning.effort` parameter. Read the [docs here](https://openrouter.ai/docs/use-cases/reasoning-tokens#reasoning-effort-level) Codex integrates into developer environments including the CLI, IDE extensions, GitHub, and cloud tasks. It adapts reasoning effort dynamically—providing fast responses for small tasks while sustaining extended multi-hour runs for large projects. The model is trained to perform structured code reviews, catching critical flaws by reasoning over dependencies and validating behavior against tests. It also supports multimodal inputs such as images or screenshots for UI development and integrates tool use for search, dependency installation, and environment setup. Codex is intended specifically for agentic coding applications.

TextImage

Context

400K

Group

GPT

Pricing preview

Input Price: $1.25 /M tokens

Output Price: $10 /M tokens

Slug

openai/gpt-5.1-codex

TextReasoning

Azure

OpenAI: GPT-5.1-Codex-Mini

GPT-5.1-Codex-Mini is a smaller and faster version of GPT-5.1-Codex

TextImage

Context

400K

Group

GPT

Pricing preview

Input Price: $0.25 /M tokens

Output Price: $2 /M tokens

Slug

openai/gpt-5.1-codex-mini

Embeddings

OpenAI

OpenAI: Text Embedding Ada 002

text-embedding-ada-002 is OpenAI's legacy text embedding model.

EmbeddingsText

Context

8.2K

Group

Other

Pricing preview

Input Price: $0.1 /M tokens

Slug

openai/text-embedding-ada-002

Embeddings

OpenAI

OpenAI: Text Embedding 3 Large

text-embedding-3-large is OpenAI's most capable embedding model for both english and non-english tasks. Embeddings are a numerical representation of text that can be used to measure the relatedness between two pieces of text. Embeddings are useful for search, clustering, recommendations, anomaly detection, and classification tasks.

EmbeddingsText

Context

8.2K

Group

Other

Pricing preview

Input Price: $0.13 /M tokens

Slug

openai/text-embedding-3-large

Embeddings

OpenAI

OpenAI: Text Embedding 3 Small

text-embedding-3-small is OpenAI's improved, more performant version of the ada embedding model. Embeddings are a numerical representation of text that can be used to measure the relatedness between two pieces of text. Embeddings are useful for search, clustering, recommendations, anomaly detection, and classification tasks.

EmbeddingsText

Context

8.2K

Group

Other

Pricing preview

Input Price: $0.02 /M tokens

Slug

openai/text-embedding-3-small

ImageReasoning

OpenAI

OpenAI: GPT-5 Image Mini

GPT-5 Image Mini combines OpenAI's advanced language capabilities, powered by [GPT-5 Mini](https://openrouter.ai/openai/gpt-5-mini), with GPT Image 1 Mini for efficient image generation. This natively multimodal model features superior instruction following, text rendering, and detailed image editing with reduced latency and cost. It excels at high-quality visual creation while maintaining strong text understanding, making it ideal for applications that require both efficient image generation and text processing at scale.

ImageTextFile

Context

400K

Group

GPT

Pricing preview

Input Price: $2.5 /M tokens

Output Price: $2 /M tokens

Slug

openai/gpt-5-image-mini

ImageReasoning

OpenAI

OpenAI: GPT-5 Image

[GPT-5](https://openrouter.ai/openai/gpt-5) Image combines OpenAI's GPT-5 model with state-of-the-art image generation capabilities. It offers major improvements in reasoning, code quality, and user experience while incorporating GPT Image 1's superior instruction following, text rendering, and detailed image editing.

ImageTextFile

Context

400K

Group

GPT

Pricing preview

Input Price: $10 /M tokens

Output Price: $10 /M tokens

Slug

openai/gpt-5-image

TextReasoning

OpenAI

OpenAI: o3 Deep Research

o3-deep-research is OpenAI's advanced model for deep research, designed to tackle complex, multi-step research tasks. Note: This model always uses the 'web_search' tool which adds additional cost.

TextImageFile

Context

200K

Group

GPT

Pricing preview

Input Price: $10 /M tokens

Output Price: $40 /M tokens

Slug

openai/o3-deep-research

TextReasoning

OpenAI

OpenAI: o4 Mini Deep Research

o4-mini-deep-research is OpenAI's faster, more affordable deep research model—ideal for tackling complex, multi-step research tasks. Note: This model always uses the 'web_search' tool which adds additional cost.

TextFileImage

Context

200K

Group

GPT

Pricing preview

Input Price: $2 /M tokens

Output Price: $8 /M tokens

Slug

openai/o4-mini-deep-research

TextReasoning

OpenAI

OpenAI: GPT-5 Pro

GPT-5 Pro is OpenAI’s most advanced model, offering major improvements in reasoning, code quality, and user experience. It is optimized for complex tasks that require step-by-step reasoning, instruction following, and accuracy in high-stakes use cases. It supports test-time routing features and advanced prompt understanding, including user-specified intent like "think hard about this." Improvements include reductions in hallucination, sycophancy, and better performance in coding, writing, and health-related tasks.

TextImageFile

Context

400K

Group

GPT

Pricing preview

Input Price: $15 /M tokens

Output Price: $12 /M tokens

Slug

openai/gpt-5-pro

Page 1 of 15

PreviousNext

Need a model request?

Use the market snapshot for discovery, then ask ImaRouter for rollout.

If a model matters for your product, send the slug, expected traffic, target region, and latency expectations. The team can confirm support status, onboarding priority, or a migration path to an equivalent route on ImaRouter.

Contact

support@imarouter.com

Best for model availability questions, onboarding priority, routing strategy, and enterprise rollout planning.

Models | ImaRouter