Mistral

Leanstral 119B A6B

Name: Leanstral 119B A6B
Rating: 90 (16 reviews)
Author: Mistral

Aktuell

HuggingFace

136Downloads168LikesMar 2026Veröffentlicht256K TokenKontextApache 2.0Lizenz84 StarkQualität

Leanstral 119B A6B (119B parameters) requires approximately 84.4 GB of VRAM with Q4_K_M quantization. As a Mixture of Experts model with 6.5B active parameters, it uses less memory than its total parameter count suggests. For the best balance of quality and speed, we recommend hardware with at least 98 GB of VRAM.

Loslegen

— kopieren & einfügen, um lokal auszuführen

Copy-paste commands to run Leanstral 119B A6B on your machine.

Run

docker run --rm -it ghcr.io/ggerganov/llama.cpp:full \
  --hf-repo "mistralai/Leanstral-2603" \
  --hf-file "Leanstral-2603-Q4_K_M.gguf" \
  -c 4096 -ngl 99

Quick specs

Parameters119B (6.5B active)

Architecturemoe (MoE)

Context256K tokens

Modalitytext+vision

Min RAM46.4 GB

Rec. RAM72.6 GB (Q4_K_M)

LicenseApache 2.0

FamilyMistral Small

✓ Vision✓ Code✓ Reasoning

About this model

Leanstral is Mistral's open-weight proof and code agent for Lean 4 workflows, built on the Mistral Small 4 family with multimodal input, tool use, and long-context support.

•119B total params with 6.5B active per token
•Specialized for proof engineering and code-agent workflows
•256K context with multimodal text and image input

Verwandte Modelle

Schnellauswahl

Bestes BudgetS

Mac Studio M3 Ultra 256GB~$6,999 — 17 tok/s

Beste GesamtwahlS

AMD Instinct MI300A 128GB~$12,000 — 113 tok/s

Beste Hardware

Top-Empfehlungen für Leanstral 119B A6B

AMD Instinct MI300A 128GBS

128 GB

NVIDIA H200 141GBS

141 GB

NVIDIA H200 PCIe 141GBS

141 GB

Gaudi 3 128GBS

128 GB

AMD Instinct MI250X 128GBS

128 GB

Dieses Modell ausführen

Leanstral 119B A6B on AMD Instinct MI300A 128GB Leanstral 119B A6B on NVIDIA H200 141GB Leanstral 119B A6B on NVIDIA H200 PCIe 141GB

Quantisierungsoptionen

VRAM-Schätzungen nach Quantisierungsstufe

No hardware detected — fit column shows raw VRAM estimates

Quant	Bits	VRAM	Quality	Fit
Q2_K	2	46.4 GB	Low	—
Q3_K_S	3	58.3 GB	Low	—
NVFP4	4	66.6 GB	Medium	—
Q4_K_M	4	72.6 GB	Medium	—
Q5_K_M	5	85.7 GB	High	—
Q6_K	6	97.6 GB	High	—
Q8_0	8	127.3 GB	Very High	—
F16	16	244.0 GB	Maximum	—

Hardware-Kompatibilität

Eignungsschätzungen für alle Hardware

Rechner öffnen

Computing compatibility...

Speicheraufschlüsselung

Reference: RTX 2060 6GB

Weights72.6 GB

KV Cache8.8 GB

Runtime2.4 GB

Headroom0.6 GB

Häufig gestellte Fragen

FAQ — Leanstral 119B A6B

Siehe auch

Quantisierungsleitfaden Bewertungsmethodik Rechner öffnen