Video Generation API is now live!

Models

Explore the active model market,from a local OpenRouter snapshot.

This page reads from a local JSON snapshot synced from OpenRouter, so the catalog stays fast, indexable, and stable. Use it to browse current model coverage by provider, modality, reasoning support, context window, and pricing metadata.

Reset

Results

Showing 48 of 683 matching models

Snapshot source: OpenRouter. Synced April 21, 2026 at 8:00 AM. Page 14 of 15.

This route is built from local JSON so the catalog stays stable for browsing and SEO. If you need a specific model on ImaRouter, treat this page as a discovery reference and then contact the team for availability.

Text

Mistral

Mistral: Mistral Large 3 2512

Mistral Large 3 2512 is Mistral’s most capable model to date, featuring a sparse mixture-of-experts architecture with 41B active parameters (675B total), and released under the Apache 2.0 license.

TextImage

Context

262.1K

Group

Mistral

Pricing preview

Input Price: $0.5 /M tokens

Output Price: $1.5 /M tokens

Slug

mistralai/mistral-large-2512

Text

Unknown provider

AllenAI: Olmo 3 7B Instruct

Olmo 3 7B Instruct is a supervised instruction-fine-tuned variant of the Olmo 3 7B base model, optimized for instruction-following, question-answering, and natural conversational dialogue. By leveraging high-quality instruction data and an open training pipeline, it delivers strong performance across everyday NLP tasks while remaining accessible and easy to integrate. Developed by Ai2 under the Apache 2.0 license, the model offers a transparent, community-friendly option for instruction-driven applications.

Text

Context

65.5K

Group

Other

Pricing preview

No display pricing published in the current snapshot.

Slug

allenai/olmo-3-7b-instruct

TextReasoning

Unknown provider

Sherlock Think Alpha

This model was an early snapshot of Grok 4.1 Fast with reasoning enabled. Try the official launch of Grok 4.1 Fast [here](/x-ai/grok-4.1-fast) This is a cloaked model provided to the community to gather feedback. A frontier reasoning model that excels at tool calling, with a 1.8M context window and multimodal support. **Note:** All prompts and completions for this model are logged by the provider and may be used to improve the model.

TextImage

Context

1.8M

Group

Other

Pricing preview

No display pricing published in the current snapshot.

Slug

openrouter/sherlock-think-alpha

Text

Unknown provider

Sherlock Dash Alpha

This model was an early snapshot of Grok 4.1 Fast with reasoning disabled. Try the official launch of Grok 4.1 Fast [here](/x-ai/grok-4.1-fast) This is a cloaked model provided to the community to gather feedback. A frontier non-reasoning model that excels at tool calling, with a 1.8M context window and multimodal support. **Note:** All prompts and completions for this model are logged by the provider and may be used to improve the model.

TextImage

Context

1.8M

Group

Other

Pricing preview

No display pricing published in the current snapshot.

Slug

openrouter/sherlock-dash-alpha

Text

Unknown provider

Polaris Alpha

This model was an early snapshot of GPT-5.1 with reasoning effort set to minimal. Try the official launch of GPT-5.1 [here](/openai/gpt-5.1) This is a cloaked model provided to the community to gather feedback. A powerful, general-purpose model that excels across real-world tasks, with standout performance in coding, tool calling, and instruction following. **Note:** All prompts and completions for this model are logged by the provider and may be used to improve the model.

TextImage

Context

256K

Group

Other

Pricing preview

No display pricing published in the current snapshot.

Slug

openrouter/polaris-alpha

TextReasoning

Together

Deep Cogito: Cogito v2.1 671B

Cogito v2.1 671B MoE represents one of the strongest open models globally, matching performance of frontier closed and open models. This model is trained using self play with reinforcement learning to reach state-of-the-art performance on multiple categories (instruction following, coding, longer queries and creative writing). This advanced system demonstrates significant progress toward scalable superintelligence through policy improvement.

Text

Context

128K

Group

Other

Pricing preview

Input Price: $1.25 /M tokens

Output Price: $1.25 /M tokens

Slug

deepcogito/cogito-v2.1-671b

Embeddings

DeepInfra

Intfloat: Multilingual-E5-Large

The multilingual-e5-large embedding model encodes sentences, paragraphs, and documents across over 90 languages into a 1024-dimensional dense vector space, delivering robust semantic embeddings optimized for multilingual retrieval, cross-language similarity, and large-scale data search.

EmbeddingsText

Context

512

Group

Other

Pricing preview

Input Price: $0.01 /M tokens

Slug

intfloat/multilingual-e5-large

Embeddings

DeepInfra

Intfloat: E5-Base-v2

The e5-base-v2 embedding model encodes English sentences and paragraphs into a 768-dimensional dense vector space, producing efficient and high-quality semantic embeddings optimized for tasks such as semantic search, similarity scoring, retrieval and clustering.

EmbeddingsText

Context

512

Group

Other

Pricing preview

Input Price: $0.005 /M tokens

Slug

intfloat/e5-base-v2

Embeddings

DeepInfra

Intfloat: E5-Large-v2

The e5-large-v2 embedding model maps English sentences, paragraphs, and documents into a 1024-dimensional dense vector space, delivering high-accuracy semantic embeddings optimized for retrieval, semantic search, reranking, and similarity-scoring tasks.

EmbeddingsText

Context

512

Group

Other

Pricing preview

Input Price: $0.01 /M tokens

Slug

intfloat/e5-large-v2

Embeddings

DeepInfra

Thenlper: GTE-Large

The gte-large embedding model converts English sentences, paragraphs and moderate-length documents into a 1024-dimensional dense vector space, delivering high-quality semantic embeddings optimized for information retrieval, semantic textual similarity, reranking and clustering tasks. Trained via multi-stage contrastive learning on a large domain-diverse relevance corpus, it offers excellent performance across general-purpose embedding use-cases.

EmbeddingsText

Context

512

Group

Other

Pricing preview

Input Price: $0.01 /M tokens

Slug

thenlper/gte-large

Embeddings

DeepInfra

Thenlper: GTE-Base

The gte-base embedding model encodes English sentences and paragraphs into a 768-dimensional dense vector space, delivering efficient and effective semantic embeddings optimized for textual similarity, semantic search, and clustering applications.

EmbeddingsText

Context

512

Group

Other

Pricing preview

Input Price: $0.005 /M tokens

Slug

thenlper/gte-base

Embeddings

DeepInfra

BAAI: bge-m3

The bge-m3 embedding model encodes sentences, paragraphs, and long documents into a 1024-dimensional dense vector space, delivering high-quality semantic embeddings optimized for multilingual retrieval, semantic search, and large-context applications.

EmbeddingsText

Context

8.2K

Group

Other

Pricing preview

Input Price: $0.01 /M tokens

Slug

baai/bge-m3

Embeddings

DeepInfra

BAAI: bge-large-en-v1.5

The bge-large-en-v1.5 embedding model maps English sentences, paragraphs, and documents into a 1024-dimensional dense vector space, delivering high-fidelity semantic embeddings optimized for semantic search, document retrieval, and downstream NLP tasks in English.

EmbeddingsText

Context

512

Group

Other

Pricing preview

Input Price: $0.01 /M tokens

Slug

baai/bge-large-en-v1.5

Embeddings

DeepInfra

Sentence Transformers: multi-qa-mpnet-base-dot-v1

The multi-qa-mpnet-base-dot-v1 embedding model transforms sentences and short paragraphs into a 768-dimensional dense vector space, generating high-quality semantic embeddings optimized for question-and-answer retrieval, semantic search, and similarity-scoring across diverse content.

EmbeddingsText

Context

512

Group

Other

Pricing preview

Input Price: $0.005 /M tokens

Slug

sentence-transformers/multi-qa-mpnet-base-dot-v1

Embeddings

DeepInfra

BAAI: bge-base-en-v1.5

The bge-base-en-v1.5 embedding model converts English sentences and paragraphs into 768-dimensional dense vectors, delivering efficient, high-quality semantic embeddings optimized for retrieval, semantic search, and document-matching workflows. This version (v1.5) features improved similarity-score distribution and stronger retrieval performance out of the box.

EmbeddingsText

Context

512

Group

Other

Pricing preview

Input Price: $0.005 /M tokens

Slug

baai/bge-base-en-v1.5

Embeddings

DeepInfra

Sentence Transformers: all-MiniLM-L12-v2

The all-MiniLM-L12-v2 embedding model maps sentences and short paragraphs into a 384-dimensional dense vector space, producing efficient and high-quality semantic embeddings optimized for tasks such as semantic search, clustering, and similarity-scoring.

EmbeddingsText

Context

512

Group

Other

Pricing preview

Input Price: $0.005 /M tokens

Slug

sentence-transformers/all-minilm-l12-v2

Embeddings

DeepInfra

Sentence Transformers: paraphrase-MiniLM-L6-v2

The paraphrase-MiniLM-L6-v2 embedding model converts sentences and short paragraphs into a 384-dimensional dense vector space, producing high-quality semantic embeddings optimized for paraphrase detection, semantic similarity scoring, clustering, and lightweight retrieval tasks.

EmbeddingsText

Context

512

Group

Other

Pricing preview

Input Price: $0.005 /M tokens

Slug

sentence-transformers/paraphrase-minilm-l6-v2

Embeddings

DeepInfra

Sentence Transformers: all-MiniLM-L6-v2

The all-MiniLM-L6-v2 embedding model maps sentences and short paragraphs into a 384-dimensional dense vector space, enabling high-quality semantic representations that are ideal for downstream tasks such as information retrieval, clustering, similarity scoring, and text ranking.

EmbeddingsText

Context

512

Group

Other

Pricing preview

Input Price: $0.005 /M tokens

Slug

sentence-transformers/all-minilm-l6-v2

Embeddings

DeepInfra

Sentence Transformers: all-mpnet-base-v2

The all-mpnet-base-v2 embedding model encodes sentences and short paragraphs into a 768-dimensional dense vector space, providing high-fidelity semantic embeddings well suited for tasks like information retrieval, clustering, similarity scoring, and text ranking.

EmbeddingsText

Context

512

Group

Other

Pricing preview

Input Price: $0.005 /M tokens

Slug

sentence-transformers/all-mpnet-base-v2

Embeddings

Unknown provider

Qwen: Qwen3 Embedding 0.6B

The Qwen3 Embedding model series is the latest proprietary model of the Qwen family, specifically designed for text embedding and ranking tasks. This series inherits the exceptional multilingual capabilities, long-text understanding, and reasoning skills of its foundational model. The Qwen3 Embedding series represents significant advancements in multiple text embedding and ranking tasks, including text retrieval, code retrieval, text classification, text clustering, and bitext mining.

EmbeddingsText

Context

8.2K

Group

Other

Pricing preview

No display pricing published in the current snapshot.

Slug

qwen/qwen3-embedding-0.6b

Text

Amazon Bedrock

Amazon: Nova Premier 1.0

Amazon Nova Premier is the most capable of Amazon’s multimodal models for complex reasoning tasks and for use as the best teacher for distilling custom models.

TextImage

Context

1M

Group

Nova

Pricing preview

Input Price: $2.5 /M tokens

Output Price: $12.5 /M tokens

Slug

amazon/nova-premier-v1

Embeddings

Mistral

Mistral: Mistral Embed 2312

Mistral Embed is a specialized embedding model for text data, optimized for semantic search and RAG applications. Developed by Mistral AI in late 2023, it produces 1024-dimensional vectors that effectively capture semantic relationships in text.

EmbeddingsText

Context

8.2K

Group

Mistral

Pricing preview

Input Price: $0.1 /M tokens

Slug

mistralai/mistral-embed-2312

Embeddings

Google AI Studio

Google: Gemini Embedding 001

gemini-embedding-001 provides a unified cutting edge experience across domains, including science, legal, finance, and coding. This embedding model has consistently held a top spot on the Massive Text Embedding Benchmark (MTEB) Multilingual leaderboard since the experimental launch in March.

EmbeddingsText

Context

20K

Group

Gemini

Pricing preview

Input Price: $0.15 /M tokens

Slug

google/gemini-embedding-001

Embeddings

Mistral

Mistral: Codestral Embed 2505

Mistral Codestral Embed is specially designed for code, perfect for embedding code databases, repositories, and powering coding assistants with state-of-the-art retrieval.

EmbeddingsText

Context

8.2K

Group

Mistral

Pricing preview

Input Price: $0.15 /M tokens

Slug

mistralai/codestral-embed-2505

Text

Mistral

Mistral: Voxtral Small 24B 2507

Voxtral Small is an enhancement of Mistral Small 3, incorporating state-of-the-art audio input capabilities while retaining best-in-class text performance. It excels at speech transcription, translation and audio understanding. Input audio is priced at $100 per million seconds.

TextAudio

Context

32K

Group

Mistral

Pricing preview

Input Price: $0.1 /M tokens

Output Price: $0.3 /M tokens

Slug

mistralai/voxtral-small-24b-2507

Embeddings

Nebius Token Factory

Qwen: Qwen3 Embedding 8B

The Qwen3 Embedding model series is the latest proprietary model of the Qwen family, specifically designed for text embedding and ranking tasks. This series inherits the exceptional multilingual capabilities, long-text understanding, and reasoning skills of its foundational model. The Qwen3 Embedding series represents significant advancements in multiple text embedding and ranking tasks, including text retrieval, code retrieval, text classification, text clustering, and bitext mining.

EmbeddingsText

Context

32K

Group

Other

Pricing preview

Input Price: $0.01 /M tokens

Slug

qwen/qwen3-embedding-8b

Embeddings

DeepInfra

Qwen: Qwen3 Embedding 4B

The Qwen3 Embedding model series is the latest proprietary model of the Qwen family, specifically designed for text embedding and ranking tasks. This series inherits the exceptional multilingual capabilities, long-text understanding, and reasoning skills of its foundational model. The Qwen3 Embedding series represents significant advancements in multiple text embedding and ranking tasks, including text retrieval, code retrieval, text classification, text clustering, and bitext mining.

EmbeddingsText

Context

32.8K

Group

Other

Pricing preview

Input Price: $0.02 /M tokens

Slug

qwen/qwen3-embedding-4b

TextReasoning

Unknown provider

Andromeda Alpha

This model has been revealed as NVIDIA Nemotron Nano 2 VL. It continues to be offered for free by NVIDIA [here](https://openrouter.ai/nvidia/nemotron-nano-12b-v2-vl:free). This is a small reasoning VLM trained for image understanding. It's strengths include multi-image comprehension (6+ images), especially those containing charts and text. This is a cloaked model provided to the community to gather feedback. Note: All prompts and output are logged to improve the provider’s model and its product and services. Please do not upload any personal, confidential, or otherwise sensitive information. This is a trial use only. Do not use for production or business-critical systems.

TextImage

Context

128K

Group

Other

Pricing preview

No display pricing published in the current snapshot.

Slug

openrouter/andromeda-alpha

Text

Cloudflare

IBM: Granite 4.0 Micro

Granite-4.0-H-Micro is a 3B parameter from the Granite 4 family of models. These models are the latest in a series of models released by IBM. They are fine-tuned for long context tool calling.

Text

Context

131K

Group

Other

Pricing preview

Input Price: $0.017 /M tokens

Output Price: $0.11 /M tokens

Slug

ibm-granite/granite-4.0-h-micro

TextReasoning

Unknown provider

Deep Cogito: Cogito V2 Preview Llama 405B

Cogito v2 405B is a dense hybrid reasoning model that combines direct answering capabilities with advanced self-reflection. It represents a significant step toward frontier intelligence with dense architecture delivering performance competitive with leading closed models. This advanced reasoning system combines policy improvement with massive scale for exceptional capabilities.

Text

Context

131.1K

Group

Llama3

Pricing preview

No display pricing published in the current snapshot.

Slug

deepcogito/cogito-v2-preview-llama-405b

Text

NovitaAI

Qwen: Qwen3 VL 8B Instruct

Qwen3-VL-8B-Instruct is a multimodal vision-language model from the Qwen3-VL series, built for high-fidelity understanding and reasoning across text, images, and video. It features improved multimodal fusion with Interleaved-MRoPE for long-horizon temporal reasoning, DeepStack for fine-grained visual-text alignment, and text-timestamp alignment for precise event localization. The model supports a native 256K-token context window, extensible to 1M tokens, and handles both static and dynamic media inputs for tasks like document parsing, visual question answering, spatial reasoning, and GUI control. It achieves text understanding comparable to leading LLMs while expanding OCR coverage to 32 languages and enhancing robustness under varied visual conditions.

TextImage

Context

131.1K

Group

Qwen3

Pricing preview

Input Price: $0.08 /M tokens

Output Price: $0.5 /M tokens

Slug

qwen/qwen3-vl-8b-instruct

Text

Relace

Relace: Relace Apply 3

Relace Apply 3 is a specialized code-patching LLM that merges AI-suggested edits straight into your source files. It can apply updates from GPT-4o, Claude, and others into your files at 10,000 tokens/sec on average. The model requires the prompt to be in the following format: <instruction>{instruction}</instruction> <code>{initial_code}</code> <update>{edit_snippet}</update> Zero Data Retention is enabled for Relace. Learn more about this model in their [documentation](https://docs.relace.ai/api-reference/instant-apply/apply)

Text

Context

256K

Group

Other

Pricing preview

Input Price: $0.85 /M tokens

Output Price: $1.25 /M tokens

Slug

relace/relace-apply-3

Text

Unknown provider

Sonoma Dusk Alpha

This is a cloaked model provided to the community to gather feedback. A fast and intelligent general-purpose frontier model with a 2 million token context window. Supports image inputs and parallel tool calling. Note: It’s free to use during this testing period, and prompts and completions are logged by the model creator for feedback and training.

TextImage

Context

2M

Group

Other

Pricing preview

No display pricing published in the current snapshot.

Slug

openrouter/sonoma-dusk-alpha

TextReasoning

Unknown provider

Sonoma Sky Alpha

This is a cloaked model provided to the community to gather feedback. A maximally intelligent general-purpose frontier model with a 2 million token context window. Supports image inputs and parallel tool calling. Note: It’s free to use during this testing period, and prompts and completions are logged by the model creator for feedback and training.

TextImage

Context

2M

Group

Other

Pricing preview

No display pricing published in the current snapshot.

Slug

openrouter/sonoma-sky-alpha

TextReasoning

Unknown provider

Deep Cogito: Cogito V2 Preview Llama 70B

Cogito v2 70B is a dense hybrid reasoning model that combines direct answering capabilities with advanced self-reflection. Built with iterative policy improvement, it delivers strong performance across reasoning tasks while maintaining efficiency through shorter reasoning chains and improved intuition.

Text

Context

131.1K

Group

Llama3

Pricing preview

No display pricing published in the current snapshot.

Slug

deepcogito/cogito-v2-preview-llama-70b

Text

Unknown provider

Horizon Beta

This is a cloaked model provided to the community to gather feedback. This is an improved version of [Horizon Alpha](/openrouter/horizon-alpha) Note: It’s free to use during this testing period, and prompts and completions are logged by the model creator for feedback and training.

TextImage

Context

256K

Group

Other

Pricing preview

No display pricing published in the current snapshot.

Slug

openrouter/horizon-beta

Text

Unknown provider

Horizon Alpha

This was a cloaked model provided to the community to gather feedback. It has been deprecated - see [Horizon Beta](/openrouter/horizon-beta). Note: It’s free to use during this testing period, and prompts and completions are logged by the model creator for feedback and training.

TextImage

Context

256K

Group

Other

Pricing preview

No display pricing published in the current snapshot.

Slug

openrouter/horizon-alpha

TextReasoning

Switchpoint

Switchpoint Router

Switchpoint AI's router instantly analyzes your request and directs it to the optimal AI from an ever-evolving library. As the world of LLMs advances, our router gets smarter, ensuring you always benefit from the industry's newest models without changing your workflow. This model is configured for a simple, flat rate per response here on OpenRouter. It's powered by the full routing engine from [Switchpoint AI](https://www.switchpoint.dev).

Text

Context

131.1K

Group

Other

Pricing preview

Input Price: $0.85 /M tokens

Output Price: $3.4 /M tokens

Slug

switchpoint/router

Text

Morph

Morph: Morph V3 Large

Morph's high-accuracy apply model for complex code edits. ~4,500 tokens/sec with 98% accuracy for precise code transformations. The model requires the prompt to be in the following format: <instruction>{instruction}</instruction> <code>{initial_code}</code> <update>{edit_snippet}</update> Zero Data Retention is enabled for Morph. Learn more about this model in their [documentation](https://docs.morphllm.com/quickstart)

Text

Context

262.1K

Group

Other

Pricing preview

Input Price: $0.9 /M tokens

Output Price: $1.9 /M tokens

Slug

morph/morph-v3-large

Text

Morph

Morph: Morph V3 Fast

Morph's fastest apply model for code edits. ~10,500 tokens/sec with 96% accuracy for rapid code transformations. The model requires the prompt to be in the following format: <instruction>{instruction}</instruction> <code>{initial_code}</code> <update>{edit_snippet}</update> Zero Data Retention is enabled for Morph. Learn more about this model in their [documentation](https://docs.morphllm.com/quickstart)

Text

Context

81.9K

Group

Other

Pricing preview

Input Price: $0.8 /M tokens

Output Price: $1.2 /M tokens

Slug

morph/morph-v3-fast

TextReasoning

Unknown provider

Cypher Alpha

This is a cloaked model provided to the community to gather feedback. It's an all-purpose model supporting real-world, long-context tasks including code generation. Note: All prompts and completions for this model are logged by the provider and may be used to improve the model and other products and services. You remain responsible for any required end user notices and consents and for ensuring that no personal, confidential, or otherwise sensitive information, including data from individuals under the age of 18, is submitted.

Text

Context

1M

Group

Other

Pricing preview

No display pricing published in the current snapshot.

Slug

openrouter/cypher-alpha

Text

Unknown provider

Morph: Fast Apply

Morph Apply is a specialized code-patching LLM that merges AI-suggested edits straight into your source files. It can apply updates from GPT-4o, Claude, and others into your files at 4000+ tokens per second. The model requires the prompt to be in the following format: <code>${originalCode}</code>\n<update>${updateSnippet}</update> Learn more about this model in their [documentation](https://docs.morphllm.com/)

Text

Context

32K

Group

Other

Pricing preview

No display pricing published in the current snapshot.

Slug

morph/morph-v2

Text

Unknown provider

Optimus Alpha

This is a cloaked model provided to the community to gather feedback. It's geared toward real world use cases, including programming. **Note:** All prompts and completions for this model are logged by the provider and may be used to improve the model.

TextImage

Context

1M

Group

Other

Pricing preview

No display pricing published in the current snapshot.

Slug

openrouter/optimus-alpha

Text

Unknown provider

Quasar Alpha

This is a cloaked model provided to the community to gather feedback. It’s a powerful, all-purpose model supporting long-context tasks, including code generation. **Note:** All prompts and completions for this model are logged by the provider and may be used to improve the model.

TextImage

Context

1M

Group

Other

Pricing preview

No display pricing published in the current snapshot.

Slug

openrouter/quasar-alpha

TextReasoning

Perplexity

Perplexity: Sonar Reasoning Pro

Note: Sonar Pro pricing includes Perplexity search pricing. See [details here](https://docs.perplexity.ai/guides/pricing#detailed-pricing-breakdown-for-sonar-reasoning-pro-and-sonar-pro) Sonar Reasoning Pro is a premier reasoning model powered by DeepSeek R1 with Chain of Thought (CoT). Designed for advanced use cases, it supports in-depth, multi-step queries with a larger context window and can surface more citations per search, enabling more comprehensive and extensible responses.

TextImage

Context

128K

Group

Other

Pricing preview

Input Price: $2 /M tokens

Output Price: $8 /M tokens

Slug

perplexity/sonar-reasoning-pro

Text

Perplexity

Perplexity: Sonar Pro

Note: Sonar Pro pricing includes Perplexity search pricing. See [details here](https://docs.perplexity.ai/guides/pricing#detailed-pricing-breakdown-for-sonar-reasoning-pro-and-sonar-pro) For enterprises seeking more advanced capabilities, the Sonar Pro API can handle in-depth, multi-step queries with added extensibility, like double the number of citations per search as Sonar on average. Plus, with a larger context window, it can handle longer and more nuanced searches and follow-up questions.

TextImage

Context

200K

Group

Other

Pricing preview

Input Price: $3 /M tokens

Output Price: $15 /M tokens

Slug

perplexity/sonar-pro

TextReasoning

Perplexity

Perplexity: Sonar Deep Research

Sonar Deep Research is a research-focused model designed for multi-step retrieval, synthesis, and reasoning across complex topics. It autonomously searches, reads, and evaluates sources, refining its approach as it gathers information. This enables comprehensive report generation across domains like finance, technology, health, and current events. Notes on Pricing ([Source](https://docs.perplexity.ai/guides/pricing#detailed-pricing-breakdown-for-sonar-deep-research)) - Input tokens comprise of Prompt tokens (user prompt) + Citation tokens (these are processed tokens from running searches) - Deep Research runs multiple searches to conduct exhaustive research. Searches are priced at $5/1000 searches. A request that does 30 searches will cost $0.15 in this step. - Reasoning is a distinct step in Deep Research since it does extensive automated reasoning through all the material it gathers during its research phase. Reasoning tokens here are a bit different than the CoTs in the answer - these are tokens that we use to reason through the research material prior to generating the outputs via the CoTs. Reasoning tokens are priced at $3/1M tokens

Text

Context

128K

Group

Other

Pricing preview

Input Price: $2 /M tokens

Output Price: $8 /M tokens

Slug

perplexity/sonar-deep-research

TextReasoning

Unknown provider

Perplexity: R1 1776

R1 1776 is a version of DeepSeek-R1 that has been post-trained to remove censorship constraints related to topics restricted by the Chinese government. The model retains its original reasoning capabilities while providing direct responses to a wider range of queries. R1 1776 is an offline chat model that does not use the perplexity search subsystem. The model was tested on a multilingual dataset of over 1,000 examples covering sensitive topics to measure its likelihood of refusal or overly filtered responses. [Evaluation Results](https://cdn-uploads.huggingface.co/production/uploads/675c8332d01f593dc90817f5/GiN2VqC5hawUgAGJ6oHla.png) Its performance on math and reasoning benchmarks remains similar to the base R1 model. [Reasoning Performance](https://cdn-uploads.huggingface.co/production/uploads/675c8332d01f593dc90817f5/n4Z9Byqp2S7sKUvCvI40R.png) Read more on the [Blog Post](https://perplexity.ai/hub/blog/open-sourcing-r1-1776)

Text

Context

128K

Group

DeepSeek

Pricing preview

No display pricing published in the current snapshot.

Slug

perplexity/r1-1776

Page 14 of 15

Need a model request?

Use the market snapshot for discovery, then ask ImaRouter for rollout.

If a model matters for your product, send the slug, expected traffic, target region, and latency expectations. The team can confirm support status, onboarding priority, or a migration path to an equivalent route on ImaRouter.

Contact

support@imarouter.com

Best for model availability questions, onboarding priority, routing strategy, and enterprise rollout planning.

Models | ImaRouter