Bartowski

Meta Llama 3.1 8B Instruct

Name: Meta Llama 3.1 8B Instruct
Rating: 50 (185 reviews)
Author: Bartowski

Dados limitados disponíveis — algumas especificações podem estar incompletas ou estimadas.

0K tokensContextoUnknownLicença5 InicialQualidade

Meta Llama 3.1 8B Instruct (8B parameters) requires approximately 7.6 GB of VRAM with Q4_K_M quantization. For the best balance of quality and speed, we recommend hardware with at least 9 GB of VRAM.

Quick specs

Parameters8B

Architecturedense

Context0K tokens

Modalitytext

Min RAM3.1 GB

Rec. RAM4.9 GB (Q4_K_M)

LicenseUnknown

FamilyLlama

✓ Chat

Modelos relacionados

Escolhas rápidas

Melhor custo-benefícioC

Intel Arc A580 8GB~$179 — 51 tok/s

Melhor no geralB

RTX 3080 10GB~$699 — 112 tok/s

Melhor hardware

Melhores opções para Meta Llama 3.1 8B Instruct

Rodar este modelo

Meta Llama 3.1 8B Instruct on RTX 3080 10GB Meta Llama 3.1 8B Instruct on RTX 2080 Ti 11GB Meta Llama 3.1 8B Instruct on RTX 3080 Ti 12GB

Opções de quantização

Estimativas de VRAM por nível de quantização

No hardware detected — fit column shows raw VRAM estimates

Quant	Bits	VRAM	Quality	Fit
Q2_K	2	3.1 GB	Low	—
Q3_K_S	3	3.9 GB	Low	—
NVFP4	4	4.5 GB	Medium	—
Q4_K_M	4	4.9 GB	Medium	—
Q5_K_M	5	5.8 GB	High	—
Q6_K	6	6.6 GB	High	—
Q8_0	8	8.6 GB	Very High	—
F16	16	16.4 GB	Maximum	—

Compatibilidade de hardware

Estimativas de compatibilidade para todo o hardware

Abrir calculadora

Computing compatibility...

Detalhamento de memória

Reference: RTX 2060 6GB

Weights4.9 GB

KV Cache0.9 GB

Runtime1.2 GB

Headroom0.6 GB

Perguntas frequentes

FAQ — Meta Llama 3.1 8B Instruct

Veja também

Guia de Quantização Metodologia de Pontuação Abrir calculadora