HuggingFace

SmolLM3 3B

Name: SmolLM3 3B
Rating: 56 (193 reviews)
Author: HuggingFace

721.8K下载量983点赞Jul 2025发布日期128K tokens上下文Apache 2.0许可证21 入门质量

SmolLM3 3B (3B parameters) requires approximately 5.6 GB of VRAM with Q4_K_M quantization. For the best balance of quality and speed, we recommend hardware with at least 7 GB of VRAM.

快速开始

— 复制粘贴即可本地运行

Copy-paste commands to run SmolLM3 3B on your machine.

Run

lms load SmolLM3-3B && lms server start

Quick specs

Parameters3B

Architecturedense

Context128K tokens

Modalitytext

Min RAM1.2 GB

Rec. RAM1.8 GB (Q4_K_M)

LicenseApache 2.0

FamilySmolLM

✓ Chat✓ Reasoning

About this model

SmolLM3 is a fully open 3B-parameter language model with dual-mode reasoning, 128K context via YARN extrapolation, and native support for 6 languages. Pretrained on 11.2T tokens with a staged curriculum of web, code, math, and reasoning data. Post-trained with 140B reasoning tokens and Anchored Preference Optimization.

•Dual-mode reasoning: extended thinking can be toggled on/off
•128K context via YARN extrapolation from 64K training
•6 natively supported languages: English, French, Spanish, German, Italian, Portuguese
•Fully open: weights, training details, and public data mixture

快速推荐

最佳性价比B

Intel Arc A380 6GB~$139 — 42 tok/s

最佳综合B

RTX 5060 8GB~$299 — 57 tok/s

最佳硬件

SmolLM3 3B 的最佳选择

运行此模型

SmolLM3 3B on RTX 5060 8GB SmolLM3 3B on RTX 5060 Ti 8GB SmolLM3 3B on RTX 5050 8GB

量化选项

各量化级别的 VRAM 估算

No hardware detected — fit column shows raw VRAM estimates

Quant	Bits	VRAM	Quality	Fit
Q2_K	2	1.2 GB	Low	—
Q3_K_S	3	1.5 GB	Low	—
NVFP4	4	1.7 GB	Medium	—
Q4_K_M	4	1.8 GB	Medium	—
Q5_K_M	5	2.2 GB	High	—
Q6_K	6	2.5 GB	High	—
Q8_0	8	3.2 GB	Very High	—
F16	16	6.1 GB	Maximum	—

Quality benchmarks

SmolLM3 3B benchmark scores

Benchmark verified

Coding

SWE-bench Verified—

HumanEval+30.5%

Aider Polyglot—

LiveCodeBench—

Reasoning

MMLU-Pro32.7%

GPQA Diamond35.7%

MATH-500—

ARC Challenge—

General

Chatbot Arena—

IFEval76.7%

Source: official · 2025-07-02

硬件兼容性

全部硬件的适配估算

打开计算器

Computing compatibility...

内存详细分析

Reference: RTX 2060 6GB

Weights1.8 GB

KV Cache2.0 GB

Runtime1.2 GB

Headroom0.6 GB

常见问题

FAQ — SmolLM3 3B

另请参阅

量化指南评分方法打开计算器