AlibabaAlibaba

Qwen 2.5 72B

Aktuell
887.8KDownloads942LikesSep 2024Veröffentlicht131K TokenKontextApache 2.0Lizenz73 StarkQualität

Qwen 2.5 72B (72B parameters) requires approximately 50.6 GB of VRAM with Q4_K_M quantization. For the best balance of quality and speed, we recommend hardware with at least 59 GB of VRAM.

Loslegen

— kopieren & einfügen, um lokal auszuführen

Copy-paste commands to run Qwen 2.5 72B on your machine.

Run

ollama run qwen2.5:72b

Quick specs

Parameters72B
Architecturedense
Context131K tokens
Modalitytext
Min RAM28.1 GB
Rec. RAM43.9 GB (Q4_K_M)
LicenseApache 2.0
FamilyQwen
Chat Reasoning

About this model

Qwen2.5 is the latest series of Qwen large language models. For Qwen2.5, we release a number of base language models and instruction-tuned language models ranging from 0.5 to 72 billion parameters. Qwen2.5 brings the following improvements upon Qwen2:

  • Significantly more knowledge and has greatly improved capabilities in coding and mathematics, thanks to our specialized expert models...
  • Significant improvements in instruction following, generating long texts (over 8K tokens), understanding structured data (e.g,...
  • Long-context Support: up to 128K tokens and can generate up to 8K tokens
  • Multilingual support: for over 29 languages, including Chinese, English, French, Spanish, Portuguese, German, Italian, Russian, Japanese, Korean,...

Verwandte Modelle

Deine Hardware

Erkennung...

Schnellauswahl

Beste Hardware

Top-Empfehlungen für Qwen 2.5 72B

Dieses Modell ausführen

Quantisierungsoptionen

VRAM-Schätzungen nach Quantisierungsstufe

No hardware detected — fit column shows raw VRAM estimates

QuantBitsVRAMQualityFit
Q2_K
2
28.1 GB
Low
Q3_K_S
3
35.3 GB
Low
NVFP4
4
40.3 GB
Medium
Q4_K_M
4
43.9 GB
Medium
Q5_K_M
5
51.8 GB
High
Q6_K
6
59.0 GB
High
Q8_0
8
77.0 GB
Very High
F16
16
147.6 GB
Maximum

Quality benchmarks

Qwen 2.5 72B benchmark scores

Benchmark verified

Coding

SWE-bench Verified32.8%
HumanEval+86.6%
Aider Polyglot
LiveCodeBench55.5%

Reasoning

MMLU-Pro71.1%
GPQA Diamond49.0%
MATH-50083.1%
ARC Challenge

General

Chatbot Arena
IFEval84.1%

Source: official · 2024-09-19

Hardware-Kompatibilität

Eignungsschätzungen für alle Hardware

Rechner öffnen

Computing compatibility...

Speicheraufschlüsselung

Reference: RTX 2060 6GB

Weights43.9 GB
KV Cache4.9 GB
Runtime1.2 GB
Headroom0.6 GB

Häufig gestellte Fragen

FAQ — Qwen 2.5 72B

Siehe auch