MaziyarPanahi
Llama 3.3 70B Instruct
可用数据有限 — 部分规格可能不完整或为估算值。
0K tokens上下文Unknown许可证4 入门质量
Llama 3.3 70B Instruct (70B parameters) requires approximately 52.7 GB of VRAM with Q4_K_M quantization. For the best balance of quality and speed, we recommend hardware with at least 61 GB of VRAM.
Quick specs
Parameters70B
Architecturedense
Context0K tokens
Modalitytext
Min RAM27.3 GB
Rec. RAM42.7 GB (Q4_K_M)
LicenseUnknown
FamilyLlama
✓ Chat
相关模型
快速推荐
最佳硬件
Llama 3.3 70B Instruct 的最佳选择
运行此模型
量化选项
各量化级别的 VRAM 估算
No hardware detected — fit column shows raw VRAM estimates
| Quant | Bits | VRAM | Quality | Fit |
|---|---|---|---|---|
Q2_K | 2 | 27.3 GB | Low | — |
Q3_K_S | 3 | 34.3 GB | Low | — |
NVFP4 | 4 | 39.2 GB | Medium | — |
Q4_K_M | 4 | 42.7 GB | Medium | — |
Q5_K_M | 5 | 50.4 GB | High | — |
Q6_K | 6 | 57.4 GB | High | — |
Q8_0 | 8 | 74.9 GB | Very High | — |
F16 | 16 | 143.5 GB | Maximum | — |
硬件兼容性
全部硬件的适配估算
Computing compatibility...
内存详细分析
Reference: RTX 2060 6GB
Weights42.7 GB
KV Cache8.2 GB
Runtime1.2 GB
Headroom0.6 GB
常见问题
FAQ — Llama 3.3 70B Instruct
另请参阅