Will It Run AI

HuggingFaceHuggingFace

SmolLM3 3B

423.9K下载量961点赞Jul 2025发布日期128K tokens上下文Apache 2.0许可证21 入门质量

SmolLM3 3B (3B parameters) requires approximately 5.6 GB of VRAM with Q4_K_M quantization. For the best balance of quality and speed, we recommend hardware with at least 7 GB of VRAM.

快速开始

— 复制粘贴即可本地运行

Copy-paste commands to run SmolLM3 3B on your machine.

Run

lms load SmolLM3-3B && lms server start

Quick specs

Parameters3B
Architecturedense
Context128K tokens
Modalitytext
Min RAM1.2 GB
Rec. RAM1.8 GB (Q4_K_M)
LicenseApache 2.0
FamilySmolLM
Chat Reasoning

About this model

SmolLM3 is a fully open 3B-parameter language model with dual-mode reasoning, 128K context via YARN extrapolation, and native support for 6 languages. Pretrained on 11.2T tokens with a staged curriculum of web, code, math, and reasoning data. Post-trained with 140B reasoning tokens and Anchored Preference Optimization.

  • Dual-mode reasoning: extended thinking can be toggled on/off
  • 128K context via YARN extrapolation from 64K training
  • 6 natively supported languages: English, French, Spanish, German, Italian, Portuguese
  • Fully open: weights, training details, and public data mixture

你的硬件

检测中...

快速推荐

最佳硬件

SmolLM3 3B 的最佳选择

运行此模型

量化选项

各量化级别的 VRAM 估算

No hardware detected — fit column shows raw VRAM estimates

QuantBitsVRAMQualityFit
Q2_K
2
1.2 GB
Low
Q3_K_S
3
1.5 GB
Low
NVFP4
4
1.7 GB
Medium
Q4_K_M
4
1.8 GB
Medium
Q5_K_M
5
2.2 GB
High
Q6_K
6
2.5 GB
High
Q8_0
8
3.2 GB
Very High
F16
16
6.1 GB
Maximum

Quality benchmarks

SmolLM3 3B benchmark scores

Benchmark verified

Coding

SWE-bench Verified
HumanEval+30.5%
Aider Polyglot
LiveCodeBench

Reasoning

MMLU-Pro32.7%
GPQA Diamond35.7%
MATH-500
ARC Challenge

General

Chatbot Arena
IFEval76.7%

Source: official · 2025-07-02

硬件兼容性

全部硬件的适配估算

打开计算器

Computing compatibility...

内存详细分析

Reference: RTX 2060 6GB

Weights1.8 GB
KV Cache2.0 GB
Runtime1.2 GB
Headroom0.6 GB

常见问题

FAQ — SmolLM3 3B

另请参阅