Unsloth

Qwen3.5 122B A10B

Name: Qwen3.5 122B A10B
Rating: 51 (29 reviews)
Author: Unsloth

HuggingFace

可用数据有限 — 部分规格可能不完整或为估算值。

431.9K下载量199点赞0K tokens上下文Unknown许可证5 入门质量

Qwen3.5 122B A10B (122B parameters) requires approximately 75.9 GB of VRAM with Q3_K_M quantization. For the best balance of quality and speed, we recommend hardware with at least 88 GB of VRAM.

快速开始

— 复制粘贴即可本地运行

Copy-paste commands to run Qwen3.5 122B A10B on your machine.

Run

docker run --rm -it ghcr.io/ggerganov/llama.cpp:full \
  --hf-repo "unsloth/Qwen3.5-122B-A10B-GGUF" \
  --hf-file "Qwen3.5-122B-A10B-GGUF-Q3_K_M.gguf" \
  -c 4096 -ngl 99

Quick specs

Parameters122B

Architecturedense

Context0K tokens

Modalitytext

Min RAM47.6 GB

Rec. RAM74.4 GB (Q4_K_M)

LicenseUnknown

FamilyQwen

✓ Vision✓ Chat

Qwen3.5 122B A10B 的最佳选择

AMD Instinct MI300A 128GBB

128 GB

NVIDIA H200 141GBC

141 GB

NVIDIA H200 PCIe 141GBC

141 GB

Gaudi 3 128GBC

128 GB

AMD Instinct MI250X 128GBC

128 GB

运行此模型

Qwen3.5 122B A10B on AMD Instinct MI300A 128GB Qwen3.5 122B A10B on NVIDIA H200 141GB Qwen3.5 122B A10B on NVIDIA H200 PCIe 141GB

量化选项

各量化级别的 VRAM 估算

No hardware detected — fit column shows raw VRAM estimates

Quant	Bits	VRAM	Quality	Fit
Q2_K	2	47.6 GB	Low	—
Q3_K_S	3	59.8 GB	Low	—
NVFP4	4	68.3 GB	Medium	—
Q4_K_M	4	74.4 GB	Medium	—
Q5_K_M	5	87.8 GB	High	—
Q6_K	6	100.0 GB	High	—
Q8_0	8	130.5 GB	Very High	—
F16	16	250.1 GB	Maximum	—

硬件兼容性

全部硬件的适配估算

打开计算器

Computing compatibility...

内存详细分析

Reference: RTX 2060 6GB

Weights59.8 GB

KV Cache14.3 GB

Runtime1.2 GB

Headroom0.6 GB

常见问题

FAQ — Qwen3.5 122B A10B

另请参阅

量化指南评分方法打开计算器