DeepSeek
DeepSeek V2.5 236B
Actual8.9KDescargas734Me gustaSep 2024Publicado131K tokensContextoDeepSeekLicencia80 FuerteCalidad
DeepSeek V2.5 236B (236B parameters) requires approximately 204.1 GB of VRAM with Q4_K_M quantization. As a Mixture of Experts model with 21B active parameters, it uses less memory than its total parameter count suggests. For the best balance of quality and speed, we recommend hardware with at least 235 GB of VRAM.
Comenzar
— copia y pega para ejecutar en localCopy-paste commands to run DeepSeek V2.5 236B on your machine.
Run
docker run --rm -it ghcr.io/ggerganov/llama.cpp:full \
--hf-repo "deepseek-ai/DeepSeek-V2.5" \
--hf-file "DeepSeek-V2.5-Q4_K_M.gguf" \
-c 4096 -ngl 99Quick specs
Parameters236B (21B active)
Architecturemoe (MoE)
Context131K tokens
Modalitytext
Min RAM92 GB
Rec. RAM144 GB (Q4_K_M)
LicenseDeepSeek
FamilyDeepSeek
✓ Chat✓ Reasoning
About this model
Modelos relacionados
Selecciones rápidas
Mejor hardware
Mejores opciones para DeepSeek V2.5 236B
Ejecutar este modelo
Opciones de cuantización
Estimaciones de VRAM por nivel de cuantización
No hardware detected — fit column shows raw VRAM estimates
| Quant | Bits | VRAM | Quality | Fit |
|---|---|---|---|---|
Q2_K | 2 | 92.0 GB | Low | — |
Q3_K_S | 3 | 115.6 GB | Low | — |
NVFP4 | 4 | 132.2 GB | Medium | — |
Q4_K_M | 4 | 144.0 GB | Medium | — |
Q5_K_M | 5 | 169.9 GB | High | — |
Q6_K | 6 | 193.5 GB | High | — |
Q8_0 | 8 | 252.5 GB | Very High | — |
F16 | 16 | 483.8 GB | Maximum | — |
Quality benchmarks
DeepSeek V2.5 236B benchmark scores
Reasoning
MMLU-Pro—
GPQA Diamond—
MATH-50082.8%
ARC Challenge—
Source: official · 2024-09-05
Compatibilidad de hardware
Estimaciones de encaje en todo el hardware
Computing compatibility...
Desglose de memoria
Reference: RTX 2060 6GB
Weights144.0 GB
KV Cache58.6 GB
Runtime0.9 GB
Headroom0.6 GB
Preguntas frecuentes
FAQ — DeepSeek V2.5 236B
Ver también