Alibaba
Qwen 2.5 Math 72B
Frontier1.2KDownloads30CurtidasSep 2024Publicado4K tokensContextoApache 2.0Licença34 BásicoQualidade
Qwen 2.5 Math 72B (72B parameters) requires approximately 50.3 GB of VRAM with Q4_K_M quantization. For the best balance of quality and speed, we recommend hardware with at least 58 GB of VRAM.
Comece agora
— copie e cole para rodar localmenteCopy-paste commands to run Qwen 2.5 Math 72B on your machine.
Run
docker run --rm -it ghcr.io/ggerganov/llama.cpp:full \
--hf-repo "Qwen/Qwen2.5-Math-72B-Instruct" \
--hf-file "Qwen2.5-Math-72B-Instruct-Q4_K_M.gguf" \
-c 4096 -ngl 99Quick specs
Parameters72B
Architecturedense
Context4K tokens
Modalitytext
Min RAM28.1 GB
Rec. RAM43.9 GB (Q4_K_M)
LicenseApache 2.0
FamilyQwen
✓ Reasoning
About this model
Modelos relacionados
Escolhas rápidas
Melhor hardware
Melhores opções para Qwen 2.5 Math 72B
Rodar este modelo
Opções de quantização
Estimativas de VRAM por nível de quantização
No hardware detected — fit column shows raw VRAM estimates
| Quant | Bits | VRAM | Quality | Fit |
|---|---|---|---|---|
Q2_K | 2 | 28.1 GB | Low | — |
Q3_K_S | 3 | 35.3 GB | Low | — |
NVFP4 | 4 | 40.3 GB | Medium | — |
Q4_K_M | 4 | 43.9 GB | Medium | — |
Q5_K_M | 5 | 51.8 GB | High | — |
Q6_K | 6 | 59.0 GB | High | — |
Q8_0 | 8 | 77.0 GB | Very High | — |
F16 | 16 | 147.6 GB | Maximum | — |
Quality benchmarks
Qwen 2.5 Math 72B benchmark scores
Reasoning
MMLU-Pro42.4%
GPQA Diamond10.9%
MATH-50087.8%
ARC Challenge—
General
Chatbot Arena—
IFEval40.0%
Source: official · 2024-09-19
Compatibilidade de hardware
Estimativas de compatibilidade para todo o hardware
Computing compatibility...
Detalhamento de memória
Reference: RTX 2060 6GB
Weights43.9 GB
KV Cache4.9 GB
Runtime0.9 GB
Headroom0.6 GB
Perguntas frequentes
FAQ — Qwen 2.5 Math 72B
Veja também