Google

Gemma 4 26B A4B

Name: Gemma 4 26B A4B
Rating: 86 (90 reviews)
Author: Google

前沿

HuggingFace

Ollama

Apr 2026发布日期256K tokens上下文Apache-2.0许可证82 优秀质量

Gemma 4 26B A4B (25.200000762939453B parameters) requires approximately 20.8 GB of VRAM with Q4_K_M quantization. As a Mixture of Experts model with 3.799999952316284B active parameters, it uses less memory than its total parameter count suggests. For the best balance of quality and speed, we recommend hardware with at least 24 GB of VRAM.

快速开始

— 复制粘贴即可本地运行

Copy-paste commands to run Gemma 4 26B A4B on your machine.

Run

ollama run gemma4:26b

Quick specs

Parameters25.2B (3.8B active)

Architecturemoe (MoE)

Context256K tokens

Modalitytext

Min RAM9.8 GB

Rec. RAM15.4 GB (Q4_K_M)

LicenseApache-2.0

FamilyGemma

✓ Code✓ Chat✓ Reasoning

About this model

Gemma 4 26B-A4B is Google's MoE model with 25.2B total parameters, 3.8B active per token (128 experts, 8 active). Matches much larger dense models at a fraction of the compute. 256K context. Apache 2.0.

•MoE: 128 experts, 8 active per token
•256K context window
•#3 open model on Arena
•Apache 2.0 license
•89% AIME 2026

Gemma 4 26B A4B 的最佳选择

RTX 5090 32GBS

32 GB

RTX PRO 4500 Blackwell 32GBS

32 GB

NVIDIA V100 32GBS

32 GB

AMD Instinct MI100 32GBS

32 GB

AMD Instinct MI60 32GBS

32 GB

运行此模型

Gemma 4 26B A4B on RTX 5090 32GB Gemma 4 26B A4B on RTX PRO 4500 Blackwell 32GB Gemma 4 26B A4B on NVIDIA V100 32GB

量化选项

各量化级别的 VRAM 估算

No hardware detected — fit column shows raw VRAM estimates

Quant	Bits	VRAM	Quality	Fit
Q2_K	2	9.8 GB	Low	—
Q3_K_S	3	12.3 GB	Low	—
NVFP4	4	14.1 GB	Medium	—
Q4_K_M	4	15.4 GB	Medium	—
Q5_K_M	5	18.1 GB	High	—
Q6_K	6	20.7 GB	High	—
Q8_0	8	27.0 GB	Very High	—
F16	16	51.7 GB	Maximum	—

Quality benchmarks

Gemma 4 26B A4B benchmark scores

Benchmark verified

Coding

SWE-bench Verified—

HumanEval+—

Aider Polyglot—

LiveCodeBench77.1%

Reasoning

MMLU-Pro82.6%

GPQA Diamond82.3%

MATH-500—

ARC Challenge—

Source: official · 2026-04-02

硬件兼容性

全部硬件的适配估算

打开计算器

Computing compatibility...

内存详细分析

Reference: RTX 2060 6GB

Weights15.4 GB

KV Cache3.7 GB

Runtime1.2 GB

Headroom0.6 GB

常见问题

FAQ — Gemma 4 26B A4B

另请参阅

量化指南评分方法打开计算器