Tsinghua/Zhipu

CogVLM2 19B

Name: CogVLM2 19B
Rating: 82 (123 reviews)
Author: Tsinghua/Zhipu

Actual

HuggingFace

5.6KDescargas220Me gustaMay 2024Publicado8K tokensContextoApache 2.0Licencia78 FuerteCalidad

CogVLM2 19B (19B parameters) requires approximately 15.5 GB of VRAM with Q4_K_M quantization. For the best balance of quality and speed, we recommend hardware with at least 18 GB of VRAM.

Comenzar

— copia y pega para ejecutar en local

Copy-paste commands to run CogVLM2 19B on your machine.

Run

docker run --rm -it ghcr.io/ggerganov/llama.cpp:full \
  --hf-repo "THUDM/cogvlm2-llama3-chat-19B" \
  --hf-file "cogvlm2-llama3-chat-19B-Q4_K_M.gguf" \
  -c 4096 -ngl 99

Quick specs

Parameters19B

Architecturedense

Context8K tokens

Modalitytext+vision

Min RAM7.4 GB

Rec. RAM11.6 GB (Q4_K_M)

LicenseApache 2.0

FamilyCogVLM

✓ Vision✓ Chat

About this model

👋 Wechat · 💡Online Demo · 🎈Github Page · 📑 Paper

•Significant improvements in many benchmarks such as TextVQA, DocVQA
•Support 8K content length
•Support image resolution up to **1344 * 1344**
•Provide an open source model version that supports both Chinese and English

Selecciones rápidas

Mejor económicoS

Intel Arc Pro B60 24GB~$599 — 23 tok/s

Mejor en generalS

NVIDIA A30 24GB~$5,500 — 68 tok/s

Mejor hardware

Mejores opciones para CogVLM2 19B

RTX 5090 Laptop 24GBS

Ejecutar este modelo

CogVLM2 19B on RTX 5090 Laptop 24GB CogVLM2 19B on NVIDIA A30 24GB CogVLM2 19B on RX 7900 XTX 24GB

Opciones de cuantización

Estimaciones de VRAM por nivel de cuantización

No hardware detected — fit column shows raw VRAM estimates

Quant	Bits	VRAM	Quality	Fit
Q2_K	2	7.4 GB	Low	—
Q3_K_S	3	9.3 GB	Low	—
NVFP4	4	10.6 GB	Medium	—
Q4_K_M	4	11.6 GB	Medium	—
Q5_K_M	5	13.7 GB	High	—
Q6_K	6	15.6 GB	High	—
Q8_0	8	20.3 GB	Very High	—
F16	16	38.9 GB	Maximum	—

Compatibilidad de hardware

Estimaciones de encaje en todo el hardware

Abrir calculadora

Computing compatibility...

Desglose de memoria

Reference: RTX 2060 6GB

Weights11.6 GB

KV Cache2.4 GB

Runtime0.9 GB

Headroom0.6 GB

Preguntas frecuentes

FAQ — CogVLM2 19B

Ver también

Guía de cuantización Metodología de puntuación Abrir calculadora