DeepSeek
DeepSeek R1 671B
前沿5.2M下载量13.4K点赞Jan 2025发布日期131K tokens上下文MIT许可证84 优秀质量
DeepSeek R1 671B (671B parameters) requires approximately 470.4 GB of VRAM with Q4_K_M quantization. As a Mixture of Experts model with 37B active parameters, it uses less memory than its total parameter count suggests. For the best balance of quality and speed, we recommend hardware with at least 541 GB of VRAM.
快速开始
— 复制粘贴即可本地运行Copy-paste commands to run DeepSeek R1 671B on your machine.
Run
ollama run deepseek-r1Quick specs
Parameters671B (37B active)
Architecturemoe (MoE)
Context131K tokens
Modalitytext
Min RAM261.7 GB
Rec. RAM409.3 GB (Q4_K_M)
LicenseMIT
FamilyDeepSeek
✓ Chat✓ Reasoning
About this model
- •We directly apply reinforcement learning (RL) to the base model without relying on supervised fine-tuning (SFT) as a preliminary step. This...
- •We introduce our pipeline to develop DeepSeek-R1. The pipeline incorporates two RL stages aimed at discovering improved reasoning patterns and...
相关模型
量化选项
各量化级别的 VRAM 估算
No hardware detected — fit column shows raw VRAM estimates
| Quant | Bits | VRAM | Quality | Fit |
|---|---|---|---|---|
Q2_K | 2 | 261.7 GB | Low | — |
Q3_K_S | 3 | 328.8 GB | Low | — |
NVFP4 | 4 | 375.8 GB | Medium | — |
Q4_K_M | 4 | 409.3 GB | Medium | — |
Q5_K_M | 5 | 483.1 GB | High | — |
Q6_K | 6 | 550.2 GB | High | — |
Q8_0 | 8 | 718.0 GB | Very High | — |
F16 | 16 | 1375.6 GB | Maximum | — |
Quality benchmarks
DeepSeek R1 671B benchmark scores
Coding
SWE-bench Verified49.2%
HumanEval+85.0%
Aider Polyglot53.3%
LiveCodeBench65.9%
Reasoning
MMLU-Pro84.0%
GPQA Diamond71.5%
MATH-50097.3%
ARC Challenge—
General
Chatbot Arena—
IFEval83.3%
Source: official · 2025-01-20
硬件兼容性
全部硬件的适配估算
Computing compatibility...
内存详细分析
Reference: RTX 2060 6GB
Weights409.3 GB
KV Cache59.6 GB
Runtime0.9 GB
Headroom0.6 GB
常见问题
FAQ — DeepSeek R1 671B
另请参阅