Image APIConnected models

Text-to-Image API

Generate commercial-grade images through one API. Access Seedream 5.0, GPT Image 2, Midjourney, Nano Banana Pro, and more — unified key, unified billing, no per-model integrations.

Nano Banana Pro API GPT Image 2 API

GPT Image 2MidjourneyNano Banana ProFlux

Creative preview

One request shape, multiple image engines.

Ship batch generation, edits, and prompt-based visual production without splitting your pipeline across providers.

Image engines

Switch models without rewriting request adapters.

Batch jobs

Submit up to twenty prompts in one async request.

Output ceiling

Commercial image workflows from concept through high-res delivery.

Overview

Why one API for image generation

Image generation models are not interchangeable. Seedream 5.0 is built for commercial image generation and editing. Midjourney excels at stylized art direction. GPT Image 2 handles instruction-following and text rendering. Nano Banana Pro leads on multi-image composition and realistic edits. The right model depends on the job.

Why one API for image generation

Instead of signing up for each provider separately, managing separate API keys, and writing per-model request adapters, ImaRouter gives you a single integration that reaches all of them. Switch models by changing one parameter — your pipeline stays unchanged.

Access multiple top image models with one API key
Unified request schema — no per-provider adapter code
Output normalization: consistent image URL response across all models
Unified cost tracking and billing across all image providers
Commercial-use output supported on all integrated models

Supported models and their strengths

Each model in ImaRouter's image catalog has a distinct capability profile. Understanding which model to use for which job lets you produce better outputs at lower cost.

GPT Image 2 — Instruction-following, in-image text rendering, product mockups, UI screenshots
Seedream 5.0 — ByteDance image generation, editing, subject preservation, and commercial composition
Midjourney — Stylized art direction, editorial illustration, brand visual identity
Nano Banana Pro — Multi-image composition, image editing, relighting, 4K commercial output
Stable Diffusion XL — Fast iteration, open-weight flexibility, custom fine-tune support
Flux — Photorealistic output, fashion and lifestyle imagery

Capabilities

What you can build

The Text-to-Image API is built for product teams, not just AI experimenters. These are the integration patterns teams use it for in production.

What you can build

The Text-to-Image API is built for product teams, not just AI experimenters. These are the integration patterns teams use it for in production.

E-commerce product image generation: generate lifestyle shots, background swaps, and model-on-product imagery at scale
Ad creative automation: generate dozens of creative variants per campaign in one batch request
Social content tools: let users generate on-brand visuals from a text prompt inside your app
Design workflow acceleration: generate mood board imagery, layout comps, and icon concepts
Content moderation testing: generate edge-case image samples for training content filters

Async generation and batch support

High-quality image models can take 5–30 seconds per generation. For product workflows that generate multiple images at once, synchronous blocking requests create a bad user experience.

ImaRouter supports async generation with job polling. Submit a generation request, receive a job ID, and poll for completion — or use a webhook to receive results when the image is ready. Batch endpoints let you submit up to 20 generation jobs in one API call.

POST /v1/image/generate — synchronous, returns image URL directly for fast models
POST /v1/image/generate/async — returns job ID, poll with GET /v1/jobs/{jobId}
Webhook delivery: receive completed images at your endpoint without polling
Batch: submit up to 20 prompts in one request, receive all results when complete

Getting started

Send a standard POST request to /v1/image/generate with your prompt, model, and output size. Use your ImaRouter API key in the Authorization header. The response contains the image URL and metadata including model used, generation time, and cost.

If you already use the OpenAI images API, the schema is compatible — swap the base URL and key to get immediate access to the full model catalog.

Base URL: https://api.imarouter.com/v1/image/generate
Required params: prompt (string), model (string), size (string)
Response: url (string), model, width, height, cost_usd
SDK: compatible with OpenAI image generation client shape

FAQ

Are the generated images commercially usable?

All models accessible through ImaRouter produce commercially licensed output. You own the generated images and can use them in products, ads, and publications without additional licensing fees beyond the generation cost.

What image sizes and formats are supported?

Supported sizes vary by model, ranging from 512×512 to 4096×4096. Common aspect ratios (1:1, 4:3, 16:9, 9:16) are supported across all integrated models. Output is delivered as PNG or JPEG via CDN URL.

Can I fine-tune a model on my own images?

Fine-tuning is available for select models including Stable Diffusion XL and Flux. Upload a training dataset through the dashboard and reference your fine-tuned model ID in generation requests.

How is billing handled across multiple models?

All image generation is billed on a per-image basis at the rate listed for each model in the pricing dashboard. You receive a single monthly invoice covering all models and endpoints. There are no platform fees or minimum commitments.

Launch paths

Explore more routes from the same stack

Text-to-Video API

Ship cinematic clips from prompts with one contract. Top models: Seedance 2.5, Seedance 2.0 mini, Happy Horse 1.1, Wan 2.7 — routing across providers for latency and cost.

Learn more

LLM Routing API

One key for every frontier and open-weight LLM. Route Claude Fable 5, Claude 4.8, Claude 4.7, GPT, Deepseek, and Qwen with automatic fallback, cost caps, and unified billing — zero per-provider wiring.

Learn more

Speech & Audio API

Transcribe with Whisper, synthesize voice with ElevenLabs, generate music with Suno — all through one endpoint. Same key, same billing as your video and image workloads.

Learn more

Text-to-Image API

One request shape, multiple image engines.

Image engines

Batch jobs

Output ceiling

Why one API for image generation

Why one API for image generation

Supported models and their strengths

What you can build

What you can build

Async generation and batch support

Getting started

FAQ

Related links and launch paths

Explore more routes from the same stack

Text-to-Video API

LLM Routing API

Speech & Audio API