Model Directory

Search the modelsyour product can ship.

Browse live routes for Seedance 2.5, Seedance 2.0 mini, Seedance 2.0, Happy Horse 1.1, Seedream 5.0, Claude Fable 5, Claude 4.8, Claude 4.7, Claude 4.6, Claude 4.5, ElevenLabs, Suno, and the rest of your image, video, audio, TTS, and text stack.

Seedance 2.5Seedance 2.0 miniSeedance 2.0Happy Horse 1.1Seedream 5.0Claude Fable 5Claude 4.8Claude 4.7Claude 4.6Claude 4.5ElevenLabsSuno

Live catalogOpen the current model pricing dashboardJump to live model prices, availability, API keys, and route setup.

Search by provider, compare context and pricing signals, then route the right model through one API.

Open live pricing Request a model

Search and filter

Filter by modality, provider, reasoning, or sort order.

Modality

Provider

Sort

Reasoning only

All (683)Audio (23)Embeddings (26)File (71)Image (226)Rerank (3)Text (683)TTS (2)Video (47)

Results

Google AI Studio

Google: Gemma 3n 4B (free)

TextLive

google/gemma-3n-e4b-it

Gemma 3n E4B-it is optimized for efficient execution on mobile and low-resource devices, such as phones, laptops, and tablets. It supports multimodal inputs—including text, visual data, and audio—enabling diverse tasks such as text generation, speech recognition, translation, and image analysis. Leveraging innovations like Per-Layer Embedding (PLE) caching and the MatFormer architecture, Gemma 3n dynamically manages memory usage and computational load by selectively activating model parameters, significantly reducing runtime resource requirements. This model supports a wide linguistic range (trained in over 140 languages) and features a flexible 32K token context window. Gemma 3n can selectively load parameters, optimizing memory and computational efficiency based on the task or device capabilities, making it well-suited for privacy-focused, offline-capable applications and on-device AI solutions. [Read more in the blog post](https://developers.googleblog.com/en/introducing-gemma-3n/)

Text

Context

8.2K

Group

Other

Pricing preview

Input Price: $0 /M tokens

Output Price: $0 /M tokens

Need volume pricing or launch support?

Search the modelsyour product can ship.

Google: Gemma 3n 4B (free)

Google: Gemma 3n 4B

Meta: Llama 3.3 8B Instruct

Nous: DeepHermes 3 Mistral 24B Preview

Mistral: Mistral Medium 3

Google: Gemini 2.5 Pro Preview 05-06

Arcee AI: Caller Large

Arcee AI: Spotlight

Arcee AI: Maestro Reasoning

Arcee AI: Virtuoso Large

Arcee AI: Coder Large

Arcee AI: Virtuoso Medium V2

Arcee AI: Arcee Blitz

Microsoft: Phi 4 Reasoning Plus

Microsoft: Phi 4 Reasoning

Qwen: Qwen3 0.6B

Inception: Mercury Coder

Qwen: Qwen3 1.7B

Qwen: Qwen3 4B

OpenGVLab: InternVL3 14B

OpenGVLab: InternVL3 2B

DeepSeek: DeepSeek Prover V2

Meta: Llama Guard 4 12B

Qwen: Qwen3 30B A3B

Qwen: Qwen3 8B

Qwen: Qwen3 14B

Qwen: Qwen3 32B

Qwen: Qwen3 235B A22B

TNG: DeepSeek R1T Chimera

THUDM: GLM Z1 Rumination 32B

THUDM: GLM Z1 9B

THUDM: GLM 4 9B

Microsoft: MAI DS R1

THUDM: GLM Z1 32B

THUDM: GLM 4 32B

Qwen: Qwen2.5 Coder 7B Instruct

AlfredPros: CodeLLaMa 7B Instruct Solidity

ArliAI: QwQ 32B RpR v1

Agentica: Deepcoder 14B Preview

MoonshotAI: Kimi VL A3B Thinking

xAI: Grok 3 Mini Beta

xAI: Grok 3 Beta

NVIDIA: Llama 3.1 Nemotron Nano 8B v1

NVIDIA: Llama 3.3 Nemotron Super 49B v1

NVIDIA: Llama 3.1 Nemotron Ultra 253B v1

Swallow: Llama 3.1 Swallow 8B Instruct V0.3

Meta: Llama 4 Maverick

Meta: Llama 4 Scout

Need a model that is not in the directory yet?