NVIDIA
Nemotron Cascade 2 30B A3B
FrontierNemotron Cascade 2 30B A3B (30B parameters) requires approximately 23.0 GB of VRAM with Q4_K_M quantization. As a Mixture of Experts model with 3B active parameters, it uses less memory than its total parameter count suggests. For the best balance of quality and speed, we recommend hardware with at least 27 GB of VRAM.
Comece agora
— copie e cole para rodar localmenteCopy-paste commands to run Nemotron Cascade 2 30B A3B on your machine.
Run
ollama run nemotron-cascade-2Quick specs
About this model
- •MoE: 30B total / 3B active — runs on single RTX 4090
- •Mamba-2 + Transformer hybrid architecture
- •Gold medal IMO 2025, IOI 2025, ICPC 2025
- •92.4% AIME 2025, 87.2% LiveCodeBench v6
- •262K context window
- •Thinking + instruct dual mode
Modelos relacionados
Escolhas rápidas
Melhor hardware
Melhores opções para Nemotron Cascade 2 30B A3B
Rodar este modelo
Opções de quantização
Estimativas de VRAM por nível de quantização
No hardware detected — fit column shows raw VRAM estimates
| Quant | Bits | VRAM | Quality | Fit |
|---|---|---|---|---|
Q2_K | 2 | 11.7 GB | Low | — |
Q3_K_S | 3 | 14.7 GB | Low | — |
NVFP4 | 4 | 16.8 GB | Medium | — |
Q4_K_M | 4 | 18.3 GB | Medium | — |
Q5_K_M | 5 | 21.6 GB | High | — |
Q6_K | 6 | 24.6 GB | High | — |
Q8_0 | 8 | 32.1 GB | Very High | — |
F16 | 16 | 61.5 GB | Maximum | — |
Quality benchmarks
Nemotron Cascade 2 30B A3B benchmark scores
Coding
Reasoning
Source: official · 2026-03-19
Compatibilidade de hardware
Estimativas de compatibilidade para todo o hardware
Computing compatibility...
Detalhamento de memória
Reference: RTX 2060 6GB
Perguntas frequentes
FAQ — Nemotron Cascade 2 30B A3B
Veja também