AlibabaAlibaba

Qwen 2.5 32B

Aktuell
939.4KDownloads351LikesSep 2024Veröffentlicht131K TokenKontextApache 2.0Lizenz79 StarkQualität

Qwen 2.5 32B (32B parameters) requires approximately 25.2 GB of VRAM with Q4_K_M quantization. For the best balance of quality and speed, we recommend hardware with at least 29 GB of VRAM.

Loslegen

— kopieren & einfügen, um lokal auszuführen

Copy-paste commands to run Qwen 2.5 32B on your machine.

Run

ollama run qwen2.5

Quick specs

Parameters32B
Architecturedense
Context131K tokens
Modalitytext
Min RAM12.5 GB
Rec. RAM19.5 GB (Q4_K_M)
LicenseApache 2.0
FamilyQwen
Code Chat Reasoning

About this model

Qwen2.5 is the latest series of Qwen large language models. For Qwen2.5, we release a number of base language models and instruction-tuned language models ranging from 0.5 to 72 billion parameters. Qwen2.5 brings the following improvements upon Qwen2:

  • Significantly more knowledge and has greatly improved capabilities in coding and mathematics, thanks to our specialized expert models...
  • Significant improvements in instruction following, generating long texts (over 8K tokens), understanding structured data (e.g,...
  • Long-context Support: up to 128K tokens and can generate up to 8K tokens
  • Multilingual support: for over 29 languages, including Chinese, English, French, Spanish, Portuguese, German, Italian, Russian, Japanese, Korean,...

Verwandte Modelle

Deine Hardware

Erkennung...

Schnellauswahl

Beste Hardware

Top-Empfehlungen für Qwen 2.5 32B

Dieses Modell ausführen

Quantisierungsoptionen

VRAM-Schätzungen nach Quantisierungsstufe

No hardware detected — fit column shows raw VRAM estimates

QuantBitsVRAMQualityFit
Q2_K
2
12.5 GB
Low
Q3_K_S
3
15.7 GB
Low
NVFP4
4
17.9 GB
Medium
Q4_K_M
4
19.5 GB
Medium
Q5_K_M
5
23.0 GB
High
Q6_K
6
26.2 GB
High
Q8_0
8
34.2 GB
Very High
F16
16
65.6 GB
Maximum

Quality benchmarks

Qwen 2.5 32B benchmark scores

Benchmark verified

Coding

SWE-bench Verified
HumanEval+88.4%
Aider Polyglot
LiveCodeBench51.2%

Reasoning

MMLU-Pro69.0%
GPQA Diamond49.5%
MATH-50083.1%
ARC Challenge

General

Chatbot Arena
IFEval79.5%

Source: official · 2024-09-19

Hardware-Kompatibilität

Eignungsschätzungen für alle Hardware

Rechner öffnen

Computing compatibility...

Speicheraufschlüsselung

Reference: RTX 2060 6GB

Weights19.5 GB
KV Cache3.9 GB
Runtime1.2 GB
Headroom0.6 GB

Häufig gestellte Fragen

FAQ — Qwen 2.5 32B

Siehe auch