DeepSeek
DeepSeek R1 671B
Frontier5.2MDownloads13.4KLikesJan 2025Veröffentlicht131K TokenKontextMITLizenz84 StarkQualität
DeepSeek R1 671B (671B parameters) requires approximately 470.4 GB of VRAM with Q4_K_M quantization. As a Mixture of Experts model with 37B active parameters, it uses less memory than its total parameter count suggests. For the best balance of quality and speed, we recommend hardware with at least 541 GB of VRAM.
Loslegen
— kopieren & einfügen, um lokal auszuführenCopy-paste commands to run DeepSeek R1 671B on your machine.
Run
ollama run deepseek-r1Quick specs
Parameters671B (37B active)
Architecturemoe (MoE)
Context131K tokens
Modalitytext
Min RAM261.7 GB
Rec. RAM409.3 GB (Q4_K_M)
LicenseMIT
FamilyDeepSeek
✓ Chat✓ Reasoning
About this model
- •We directly apply reinforcement learning (RL) to the base model without relying on supervised fine-tuning (SFT) as a preliminary step. This...
- •We introduce our pipeline to develop DeepSeek-R1. The pipeline incorporates two RL stages aimed at discovering improved reasoning patterns and...
Verwandte Modelle
Quantisierungsoptionen
VRAM-Schätzungen nach Quantisierungsstufe
No hardware detected — fit column shows raw VRAM estimates
| Quant | Bits | VRAM | Quality | Fit |
|---|---|---|---|---|
Q2_K | 2 | 261.7 GB | Low | — |
Q3_K_S | 3 | 328.8 GB | Low | — |
NVFP4 | 4 | 375.8 GB | Medium | — |
Q4_K_M | 4 | 409.3 GB | Medium | — |
Q5_K_M | 5 | 483.1 GB | High | — |
Q6_K | 6 | 550.2 GB | High | — |
Q8_0 | 8 | 718.0 GB | Very High | — |
F16 | 16 | 1375.6 GB | Maximum | — |
Quality benchmarks
DeepSeek R1 671B benchmark scores
Coding
SWE-bench Verified49.2%
HumanEval+85.0%
Aider Polyglot53.3%
LiveCodeBench65.9%
Reasoning
MMLU-Pro84.0%
GPQA Diamond71.5%
MATH-50097.3%
ARC Challenge—
General
Chatbot Arena—
IFEval83.3%
Source: official · 2025-01-20
Hardware-Kompatibilität
Eignungsschätzungen für alle Hardware
Computing compatibility...
Speicheraufschlüsselung
Reference: RTX 2060 6GB
Weights409.3 GB
KV Cache59.6 GB
Runtime0.9 GB
Headroom0.6 GB
Häufig gestellte Fragen
FAQ — DeepSeek R1 671B
Siehe auch