DeepSeek
DeepSeek V4 Flash
前沿3.4M下载量1.3K点赞Apr 2026发布日期1.0M tokens上下文MIT许可证98 卓越质量
DeepSeek V4 Flash (284B parameters) requires approximately 160.8 GB of VRAM with NVFP4 quantization. As a Mixture of Experts model with 13B active parameters, it uses less memory than its total parameter count suggests. For the best balance of quality and speed, we recommend hardware with at least 185 GB of VRAM.
快速开始
— 复制粘贴即可本地运行Copy-paste commands to run DeepSeek V4 Flash on your machine.
Run
docker run --rm -it ghcr.io/ggerganov/llama.cpp:full \
--hf-repo "deepseek-ai/DeepSeek-V4-Flash" \
--hf-file "DeepSeek-V4-Flash-NVFP4.gguf" \
-c 4096 -ngl 99Quick specs
Parameters284B (13B active)
Architecturemoe (MoE)
Context1.0M tokens
Modalitytext
Min RAM110.8 GB
Rec. RAM159 GB (NVFP4)
LicenseMIT
FamilyDeepSeek
✓ Code✓ Reasoning
About this model
- •284B total / 13B active sparse MoE — 256 routed + 1 shared expert
- •Native FP4 experts: ~158 GB on disk
- •1M-token context with near-frontier coding quality
- •Runs on a single 192 GB unified-memory box or a small GPU server
相关模型
快速推荐
最佳硬件
DeepSeek V4 Flash 的最佳选择
运行此模型
量化选项
各量化级别的 VRAM 估算
No hardware detected — fit column shows raw VRAM estimates
| Quant | Bits | VRAM | Quality | Fit |
|---|---|---|---|---|
Q2_K | 2 | 110.8 GB | Low | — |
Q3_K_S | 3 | 139.2 GB | Low | — |
NVFP4 | 4 | 159.0 GB | Medium | — |
Q4_K_M | 4 | 173.2 GB | Medium | — |
Q5_K_M | 5 | 204.5 GB | High | — |
Q6_K | 6 | 232.9 GB | High | — |
Q8_0 | 8 | 303.9 GB | Very High | — |
F16 | 16 | 582.2 GB | Maximum | — |
Quality benchmarks
DeepSeek V4 Flash benchmark scores
Coding
SWE-bench Verified—
HumanEval+—
Aider Polyglot—
LiveCodeBench91.6%
Reasoning
MMLU-Pro86.2%
GPQA Diamond—
MATH-500—
ARC Challenge—
Source: vendor-reported · 2026-04-24
硬件兼容性
全部硬件的适配估算
Computing compatibility...
内存详细分析
Reference: RTX 2060 6GB
Weights158.0 GB
KV Cache1.3 GB
Runtime0.9 GB
Headroom0.6 GB
常见问题
FAQ — DeepSeek V4 Flash
另请参阅