Alibaba
Qwen 2.5 Math 72B
Frontera1.2KDescargas30Me gustaSep 2024Publicado4K tokensContextoApache 2.0Licencia34 BásicoCalidad
Qwen 2.5 Math 72B (72B parameters) requires approximately 50.3 GB of VRAM with Q4_K_M quantization. For the best balance of quality and speed, we recommend hardware with at least 58 GB of VRAM.
Comenzar
— copia y pega para ejecutar en localCopy-paste commands to run Qwen 2.5 Math 72B on your machine.
Run
docker run --rm -it ghcr.io/ggerganov/llama.cpp:full \
--hf-repo "Qwen/Qwen2.5-Math-72B-Instruct" \
--hf-file "Qwen2.5-Math-72B-Instruct-Q4_K_M.gguf" \
-c 4096 -ngl 99Quick specs
Parameters72B
Architecturedense
Context4K tokens
Modalitytext
Min RAM28.1 GB
Rec. RAM43.9 GB (Q4_K_M)
LicenseApache 2.0
FamilyQwen
✓ Reasoning
About this model
Modelos relacionados
Selecciones rápidas
Mejor hardware
Mejores opciones para Qwen 2.5 Math 72B
Ejecutar este modelo
Opciones de cuantización
Estimaciones de VRAM por nivel de cuantización
No hardware detected — fit column shows raw VRAM estimates
| Quant | Bits | VRAM | Quality | Fit |
|---|---|---|---|---|
Q2_K | 2 | 28.1 GB | Low | — |
Q3_K_S | 3 | 35.3 GB | Low | — |
NVFP4 | 4 | 40.3 GB | Medium | — |
Q4_K_M | 4 | 43.9 GB | Medium | — |
Q5_K_M | 5 | 51.8 GB | High | — |
Q6_K | 6 | 59.0 GB | High | — |
Q8_0 | 8 | 77.0 GB | Very High | — |
F16 | 16 | 147.6 GB | Maximum | — |
Quality benchmarks
Qwen 2.5 Math 72B benchmark scores
Reasoning
MMLU-Pro42.4%
GPQA Diamond10.9%
MATH-50087.8%
ARC Challenge—
General
Chatbot Arena—
IFEval40.0%
Source: official · 2024-09-19
Compatibilidad de hardware
Estimaciones de encaje en todo el hardware
Computing compatibility...
Desglose de memoria
Reference: RTX 2060 6GB
Weights43.9 GB
KV Cache4.9 GB
Runtime0.9 GB
Headroom0.6 GB
Preguntas frecuentes
FAQ — Qwen 2.5 Math 72B
Ver también