DeepSeek V4 Pro by Vercel AI Gateway

Model details

DeepSeek V4 Pro

DeepSeek V4 Pro is engineered as a sophisticated Mixture-of-Experts language model, utilizing a massive parameter count to achieve high-level reasoning and agentic capabilities. Its architecture is defined by a hybrid attention mechanism that integrates Compressed Sparse Attention and Heavily Compressed Attention, which significantly optimizes inference efficiency and KV cache usage when handling extensive context windows. To ensure signal propagation stability, the design incorporates manifold-constrained hyper-connections that reinforce traditional residual pathways, allowing the model to maintain performance across demanding, large-scale data processing tasks.

The model undergoes a rigorous two-stage post-training pipeline designed to refine its specialized capabilities. This process begins with independent domain-expert cultivation using supervised fine-tuning and group relative policy optimization, which allows the model to develop nuanced proficiency across diverse subjects. Following this, the system employs a unified model consolidation phase through on-policy distillation, effectively merging these expert insights into a cohesive, high-performing architecture that balances specialized knowledge with general-purpose utility.

In practical application, the model excels in STEM, mathematics, and complex coding environments, frequently outperforming other open-weight alternatives and approaching the performance levels of top-tier closed-source systems. While it offers significant advantages in cost-efficiency and long-context handling, users should note that the model is released as a preview, and the supplied evidence does not detail the specific composition of its pre-training datasets or the full extent of its safety alignment protocols. Consequently, while it represents a major step forward in open-source AI, its behavior in highly sensitive or adversarial contexts remains a subject for ongoing evaluation.

Vercel AI Gatewaydeepseek/deepseek-v4-prodeepseek

Quick Info

Provider: Vercel AI Gateway
Model key: deepseek/deepseek-v4-pro
Release date: Apr 23, 2026
Last updated: Apr 24, 2026
Input modalities

Cost

Input token cost: $1.74
Output token cost: $3.48

Limits

Output tokens: 384,000 tokens

Use it for free

Latest news about DeepSeek V4 Pro

Coverage

China’s DeepSeek unveils latest models a year after upending global tech | Technology News | Al Jazeera

Chinese startup says DeepSeek-V4-Pro beats all rival open models for maths and coding.

Coverage

DeepSeek API Update: deepseek-v4-pro and v4-flash Launch with 1M Context and Dual Modes — Migration Guide and 2026 Deadline | AI News Detail

According to @deepseek_ai, the DeepSeek API now supports the new deepseek-v4-pro and deepseek-v4-flash models with 1M context windows and dual Thinking and...

CoverageComparison

DeepSeek V4 Pro vs Flash: What Launched, What Changed, and the Huawei Chip

DeepSeek V4 is live with two models. V4-Pro approaches Claude Opus 4.6; V4-Flash is faster and cheaper. Here's which to use, how to migrate your API, and what the Huawei chip story actually means.

Use it for free

Videos about DeepSeek V4 Pro

First impressions of DeepSeek V4 (open source)

Try it yourself: https://arena.ai/code The Arena team put DeepSeek V4 Pro through a battery of one-shot generation tests — 3D voxel scenes, ...

How DeepSeek-V4 Breaks the Million-Token Bottleneck: CSA, HCA, mHC, and Muon Explained

... DeepSeek-V4-Pro, DeepSeek-V4-Flash, and DeepSeek-V4-Pro-Max. We walk through the paper's full systems-and-modeling story: why vanilla ...

NEW DeepSeek-V4 is HERE (FREE!)

... DeepSeek V4 (Pro and Flash) alongside other model updates, showing that DeepSeek V4 is available free to try in instant and expert modes ...

DeepSeek V4 is Here - Pro and Flash - Model That Made All GPU Clusters Obsolete

This video introduces and tests DeepSeek-V4 series, including DeepSeek-V4-Pro with 1.6T parameters and DeepSeek-V4-Flash.

DeepSeek V4: A Deep Dive

Disclaimer: This video is generated with Google's NotebookLM. https://huggingface.co/deepseek-ai/DeepSeek-V4-Pro The provided research ...

Videos about DeepSeek V4 Pro

Use it for free

Recent tweets and retweets from Vercel AI Gateway

Apr 21, 2026, 1:29 AMUTC

Additional security bulletin updates include: • Clarification that account and project deletion do not eliminate environment variable risk • Guidance on multi-factor authentication • Product updates to help you strengthen your security posture vercel.com/kb/bulletin/verce…

Apr 21, 2026, 1:29 AMUTC

In collaboration with @github, @Microsoft, @npmjs, and @SocketSecurity, our security team has confirmed that no npm packages published by Vercel have been compromised. There is no evidence of tampering, and we believe the supply chain remains safe. vercel.com/kb/bulletin/verce…

Apr 19, 2026, 6:51 PMUTC

Our investigation has revealed that the incident originated from a third-party AI tool with hundreds of users whose Google Workspace OAuth app was compromised. We recommend that Google Workspace Administrators check for usage of this app immediately.…

Apr 19, 2026, 5:42 PMUTC

Our investigation is ongoing. In the meantime, we have updated the security bulletin with best practices you can follow for peace of mind: vercel.com/kb/bulletin/verce…

Apr 19, 2026, 2:00 PMUTC

We’ve identified a security incident that involved unauthorized access to certain internal Vercel systems, impacting a limited subset of customers. Please see our security bulletin: vercel.com/kb/bulletin/verce…

Recent tweets and retweets from Vercel AI Gateway

More models around DeepSeek V4 Pro

Browse family

Also served by

This exact model name is also listed by 4 other providers.

DeepSeek OpenCode Go OpenRouter Venice AI

20 providers

DeepSeek V3.2