NVIDIA
Nemotron Cascade 2 30B A3B
FronteraNemotron Cascade 2 30B A3B (30B parameters) requires approximately 23.0 GB of VRAM with Q4_K_M quantization. As a Mixture of Experts model with 3B active parameters, it uses less memory than its total parameter count suggests. For the best balance of quality and speed, we recommend hardware with at least 27 GB of VRAM.
Comenzar
— copia y pega para ejecutar en localCopy-paste commands to run Nemotron Cascade 2 30B A3B on your machine.
Run
ollama run nemotron-cascade-2Quick specs
About this model
- •MoE: 30B total / 3B active — runs on single RTX 4090
- •Mamba-2 + Transformer hybrid architecture
- •Gold medal IMO 2025, IOI 2025, ICPC 2025
- •92.4% AIME 2025, 87.2% LiveCodeBench v6
- •262K context window
- •Thinking + instruct dual mode
Modelos relacionados
Selecciones rápidas
Mejor hardware
Mejores opciones para Nemotron Cascade 2 30B A3B
Ejecutar este modelo
Opciones de cuantización
Estimaciones de VRAM por nivel de cuantización
No hardware detected — fit column shows raw VRAM estimates
| Quant | Bits | VRAM | Quality | Fit |
|---|---|---|---|---|
Q2_K | 2 | 11.7 GB | Low | — |
Q3_K_S | 3 | 14.7 GB | Low | — |
NVFP4 | 4 | 16.8 GB | Medium | — |
Q4_K_M | 4 | 18.3 GB | Medium | — |
Q5_K_M | 5 | 21.6 GB | High | — |
Q6_K | 6 | 24.6 GB | High | — |
Q8_0 | 8 | 32.1 GB | Very High | — |
F16 | 16 | 61.5 GB | Maximum | — |
Quality benchmarks
Nemotron Cascade 2 30B A3B benchmark scores
Coding
Reasoning
Source: official · 2026-03-19
Compatibilidad de hardware
Estimaciones de encaje en todo el hardware
Computing compatibility...
Desglose de memoria
Reference: RTX 2060 6GB
Preguntas frecuentes
FAQ — Nemotron Cascade 2 30B A3B
Ver también