LMSYS

Vicuna 7B

Name: Vicuna 7B
Rating: 49 (142 reviews)
Author: LMSYS

Legacy

HuggingFace

Ollama

82.6KDescargas402Me gustaMar 2023Publicado4K tokensContextoLlama 2 CommunityLicencia5 EntradaCalidad

Vicuna 7B (7B parameters) requires approximately 13.9 GB of VRAM with Q4_K_M quantization. For the best balance of quality and speed, we recommend hardware with at least 16 GB of VRAM.

Comenzar

— copia y pega para ejecutar en local

Copy-paste commands to run Vicuna 7B on your machine.

Run

ollama run vicuna

Quick specs

Parameters7B

Architecturedense

Context4K tokens

Modalitytext

Min RAM2.7 GB

Rec. RAM4.3 GB (Q4_K_M)

LicenseLlama 2 Community

FamilyVicuna

✓ Chat

About this model

Vicuna is a chat assistant trained by fine-tuning Llama 2 on user-shared conversations collected from ShareGPT.

•Developed by:: LMSYS
•Model type:: An auto-regressive language model based on the transformer architecture
•License:: Llama 2 Community License Agreement
•Finetuned from model:: Llama 2

Modelos relacionados

Selecciones rápidas

Mejor económicoC

RX 7600 XT 16GB~$329 — 39 tok/s

Mejor en generalB

RX 7900 XT 20GB~$899 — 98 tok/s

Mejor hardware

Mejores opciones para Vicuna 7B

Ejecutar este modelo

Vicuna 7B on RX 7900 XT 20GB Vicuna 7B on RTX A4500 20GB Vicuna 7B on RTX 3090 24GB

Opciones de cuantización

Estimaciones de VRAM por nivel de cuantización

No hardware detected — fit column shows raw VRAM estimates

Quant	Bits	VRAM	Quality	Fit
Q2_K	2	2.7 GB	Low	—
Q3_K_S	3	3.4 GB	Low	—
NVFP4	4	3.9 GB	Medium	—
Q4_K_M	4	4.3 GB	Medium	—
Q5_K_M	5	5.0 GB	High	—
Q6_K	6	5.7 GB	High	—
Q8_0	8	7.5 GB	Very High	—
F16	16	14.3 GB	Maximum	—

Quality benchmarks

Vicuna 7B benchmark scores

Benchmark verified

Reasoning

MMLU-Pro12.7%

GPQA Diamond1.1%

MATH-5001.4%

ARC Challenge74.1%

General

Chatbot Arena—

IFEval23.5%

Source: community · 2023-07-29

Compatibilidad de hardware

Estimaciones de encaje en todo el hardware

Abrir calculadora

Computing compatibility...

Desglose de memoria

Reference: RTX 2060 6GB

Weights4.3 GB

KV Cache7.8 GB

Runtime1.2 GB

Headroom0.6 GB

Preguntas frecuentes

FAQ — Vicuna 7B

Ver también

Guía de cuantización Metodología de puntuación Abrir calculadora