Alibaba
Qwen3-VL 30B A3B Instruct
Frontier1.2MDownloads574LikesSep 2025Veröffentlicht256K TokenKontextApache 2.0Lizenz98 HerausragendQualität
Qwen3-VL 30B A3B Instruct (30B parameters) requires approximately 22.8 GB of VRAM with Q4_K_M quantization. As a Mixture of Experts model with 3B active parameters, it uses less memory than its total parameter count suggests. For the best balance of quality and speed, we recommend hardware with at least 27 GB of VRAM.
Loslegen
— kopieren & einfügen, um lokal auszuführenCopy-paste commands to run Qwen3-VL 30B A3B Instruct on your machine.
Run
lms load Qwen3-VL-30B-A3B-Instruct && lms server startQuick specs
Parameters30B (3B active)
Architecturemoe (MoE)
Context256K tokens
Modalitytext+vision
Min RAM11.7 GB
Rec. RAM18.3 GB (Q4_K_M)
LicenseApache 2.0
FamilyQwen VL
✓ Vision✓ Reasoning
About this model
- •Visual Agent: Operates PC/mobile GUIs—recognizes elements, understands functions, invokes tools, completes tasks
- •Visual Coding Boost: Generates Draw.io/HTML/CSS/JS from images/videos
- •Advanced Spatial Perception: Judges object positions, viewpoints, and occlusions; provides stronger 2D grounding and enables 3D grounding for...
- •Long Context & Video Understanding: Native 256K context, expandable to 1M; handles books and hours-long video with full recall and second-level...
- •Enhanced Multimodal Reasoning: Excels in STEM/Math—causal analysis and logical, evidence-based answers
Schnellauswahl
Beste Hardware
Top-Empfehlungen für Qwen3-VL 30B A3B Instruct
Dieses Modell ausführen
Quantisierungsoptionen
VRAM-Schätzungen nach Quantisierungsstufe
No hardware detected — fit column shows raw VRAM estimates
| Quant | Bits | VRAM | Quality | Fit |
|---|---|---|---|---|
Q2_K | 2 | 11.7 GB | Low | — |
Q3_K_S | 3 | 14.7 GB | Low | — |
NVFP4 | 4 | 16.8 GB | Medium | — |
Q4_K_M | 4 | 18.3 GB | Medium | — |
Q5_K_M | 5 | 21.6 GB | High | — |
Q6_K | 6 | 24.6 GB | High | — |
Q8_0 | 8 | 32.1 GB | Very High | — |
F16 | 16 | 61.5 GB | Maximum | — |
Hardware-Kompatibilität
Eignungsschätzungen für alle Hardware
Computing compatibility...
Speicheraufschlüsselung
Reference: RTX 2060 6GB
Weights18.3 GB
KV Cache1.5 GB
Runtime2.4 GB
Headroom0.6 GB
Häufig gestellte Fragen
FAQ — Qwen3-VL 30B A3B Instruct
Siehe auch