DeepSeek
DeepSeek R1 671B
Frontera5.2MDescargas13.4KMe gustaJan 2025Publicado131K tokensContextoMITLicencia84 FuerteCalidad
DeepSeek R1 671B (671B parameters) requires approximately 470.4 GB of VRAM with Q4_K_M quantization. As a Mixture of Experts model with 37B active parameters, it uses less memory than its total parameter count suggests. For the best balance of quality and speed, we recommend hardware with at least 541 GB of VRAM.
Comenzar
— copia y pega para ejecutar en localCopy-paste commands to run DeepSeek R1 671B on your machine.
Run
ollama run deepseek-r1Quick specs
Parameters671B (37B active)
Architecturemoe (MoE)
Context131K tokens
Modalitytext
Min RAM261.7 GB
Rec. RAM409.3 GB (Q4_K_M)
LicenseMIT
FamilyDeepSeek
✓ Chat✓ Reasoning
About this model
- •We directly apply reinforcement learning (RL) to the base model without relying on supervised fine-tuning (SFT) as a preliminary step. This...
- •We introduce our pipeline to develop DeepSeek-R1. The pipeline incorporates two RL stages aimed at discovering improved reasoning patterns and...
Modelos relacionados
Opciones de cuantización
Estimaciones de VRAM por nivel de cuantización
No hardware detected — fit column shows raw VRAM estimates
| Quant | Bits | VRAM | Quality | Fit |
|---|---|---|---|---|
Q2_K | 2 | 261.7 GB | Low | — |
Q3_K_S | 3 | 328.8 GB | Low | — |
NVFP4 | 4 | 375.8 GB | Medium | — |
Q4_K_M | 4 | 409.3 GB | Medium | — |
Q5_K_M | 5 | 483.1 GB | High | — |
Q6_K | 6 | 550.2 GB | High | — |
Q8_0 | 8 | 718.0 GB | Very High | — |
F16 | 16 | 1375.6 GB | Maximum | — |
Quality benchmarks
DeepSeek R1 671B benchmark scores
Coding
SWE-bench Verified49.2%
HumanEval+85.0%
Aider Polyglot53.3%
LiveCodeBench65.9%
Reasoning
MMLU-Pro84.0%
GPQA Diamond71.5%
MATH-50097.3%
ARC Challenge—
General
Chatbot Arena—
IFEval83.3%
Source: official · 2025-01-20
Compatibilidad de hardware
Estimaciones de encaje en todo el hardware
Computing compatibility...
Desglose de memoria
Reference: RTX 2060 6GB
Weights409.3 GB
KV Cache59.6 GB
Runtime0.9 GB
Headroom0.6 GB
Preguntas frecuentes
FAQ — DeepSeek R1 671B
Ver también