DeepSeek
DeepSeek R1 671B
Frontier5.2MDownloads13.4KCurtidasJan 2025Publicado131K tokensContextoMITLicença84 ForteQualidade
DeepSeek R1 671B (671B parameters) requires approximately 470.4 GB of VRAM with Q4_K_M quantization. As a Mixture of Experts model with 37B active parameters, it uses less memory than its total parameter count suggests. For the best balance of quality and speed, we recommend hardware with at least 541 GB of VRAM.
Comece agora
— copie e cole para rodar localmenteCopy-paste commands to run DeepSeek R1 671B on your machine.
Run
ollama run deepseek-r1Quick specs
Parameters671B (37B active)
Architecturemoe (MoE)
Context131K tokens
Modalitytext
Min RAM261.7 GB
Rec. RAM409.3 GB (Q4_K_M)
LicenseMIT
FamilyDeepSeek
✓ Chat✓ Reasoning
About this model
- •We directly apply reinforcement learning (RL) to the base model without relying on supervised fine-tuning (SFT) as a preliminary step. This...
- •We introduce our pipeline to develop DeepSeek-R1. The pipeline incorporates two RL stages aimed at discovering improved reasoning patterns and...
Modelos relacionados
Opções de quantização
Estimativas de VRAM por nível de quantização
No hardware detected — fit column shows raw VRAM estimates
| Quant | Bits | VRAM | Quality | Fit |
|---|---|---|---|---|
Q2_K | 2 | 261.7 GB | Low | — |
Q3_K_S | 3 | 328.8 GB | Low | — |
NVFP4 | 4 | 375.8 GB | Medium | — |
Q4_K_M | 4 | 409.3 GB | Medium | — |
Q5_K_M | 5 | 483.1 GB | High | — |
Q6_K | 6 | 550.2 GB | High | — |
Q8_0 | 8 | 718.0 GB | Very High | — |
F16 | 16 | 1375.6 GB | Maximum | — |
Quality benchmarks
DeepSeek R1 671B benchmark scores
Coding
SWE-bench Verified49.2%
HumanEval+85.0%
Aider Polyglot53.3%
LiveCodeBench65.9%
Reasoning
MMLU-Pro84.0%
GPQA Diamond71.5%
MATH-50097.3%
ARC Challenge—
General
Chatbot Arena—
IFEval83.3%
Source: official · 2025-01-20
Compatibilidade de hardware
Estimativas de compatibilidade para todo o hardware
Computing compatibility...
Detalhamento de memória
Reference: RTX 2060 6GB
Weights409.3 GB
KV Cache59.6 GB
Runtime0.9 GB
Headroom0.6 GB
Perguntas frequentes
FAQ — DeepSeek R1 671B
Veja também