NVIDIA

Nemotron Nano 9B v2

Name: Nemotron Nano 9B v2
Rating: 80 (151 reviews)
Author: NVIDIA

前沿

HuggingFace

Ollama

Jun 2025发布日期131K tokens上下文NVIDIA Open Model许可证70 良好质量

Nemotron Nano 9B v2 (9B parameters) requires approximately 9.7 GB of VRAM with Q4_K_M quantization. For the best balance of quality and speed, we recommend hardware with at least 12 GB of VRAM.

快速开始

— 复制粘贴即可本地运行

Copy-paste commands to run Nemotron Nano 9B v2 on your machine.

Run

ollama run nemotron-nano:9b-v2

Quick specs

Parameters9B

Architecturedense

Context131K tokens

Modalitytext

Min RAM3.5 GB

Rec. RAM5.5 GB (Q4_K_M)

LicenseNVIDIA Open Model

FamilyNemotron

✓ Code✓ Chat✓ Reasoning

About this model

Nemotron Nano 9B v2 is an updated version of NVIDIA's compact reasoning model with improved instruction following, coding, and math capabilities.

•Improved reasoning and coding over v1
•Switchable thinking mode for detailed step-by-step reasoning
•Fits comfortably on 8 GB VRAM GPUs at Q4_K_M

Nemotron Nano 9B v2 的最佳选择

RTX 4070 Ti Super 16GBS

RTX 5080 Laptop 16GBS

16 GB

运行此模型

Nemotron Nano 9B v2 on RTX 4070 Ti Super 16GB Nemotron Nano 9B v2 on RTX 4080 Super 16GB Nemotron Nano 9B v2 on RTX 5070 Ti 16GB

量化选项

各量化级别的 VRAM 估算

No hardware detected — fit column shows raw VRAM estimates

Quant	Bits	VRAM	Quality	Fit
Q2_K	2	3.5 GB	Low	—
Q3_K_S	3	4.4 GB	Low	—
NVFP4	4	5.0 GB	Medium	—
Q4_K_M	4	5.5 GB	Medium	—
Q5_K_M	5	6.5 GB	High	—
Q6_K	6	7.4 GB	High	—
Q8_0	8	9.6 GB	Very High	—
F16	16	18.5 GB	Maximum	—

Quality benchmarks

Nemotron Nano 9B v2 benchmark scores

Benchmark verified

Coding

SWE-bench Verified—

HumanEval+58.5%

Aider Polyglot—

LiveCodeBench—

Reasoning

MMLU-Pro59.4%

GPQA Diamond64.0%

MATH-50097.8%

ARC Challenge—

Source: official · 2025-09-02

硬件兼容性

全部硬件的适配估算

打开计算器

Computing compatibility...

内存详细分析

Reference: RTX 2060 6GB

Weights5.5 GB

KV Cache2.4 GB

Runtime1.2 GB

Headroom0.6 GB

常见问题

FAQ — Nemotron Nano 9B v2

另请参阅

量化指南评分方法打开计算器