Tsinghua/Zhipu

CogVLM2 19B

Name: CogVLM2 19B
Rating: 82 (123 reviews)
Author: Tsinghua/Zhipu

Atual

HuggingFace

5.6KDownloads220CurtidasMay 2024Publicado8K tokensContextoApache 2.0Licença78 ForteQualidade

CogVLM2 19B (19B parameters) requires approximately 15.5 GB of VRAM with Q4_K_M quantization. For the best balance of quality and speed, we recommend hardware with at least 18 GB of VRAM.

Comece agora

— copie e cole para rodar localmente

Copy-paste commands to run CogVLM2 19B on your machine.

Run

docker run --rm -it ghcr.io/ggerganov/llama.cpp:full \
  --hf-repo "THUDM/cogvlm2-llama3-chat-19B" \
  --hf-file "cogvlm2-llama3-chat-19B-Q4_K_M.gguf" \
  -c 4096 -ngl 99

Quick specs

Parameters19B

Architecturedense

Context8K tokens

Modalitytext+vision

Min RAM7.4 GB

Rec. RAM11.6 GB (Q4_K_M)

LicenseApache 2.0

FamilyCogVLM

✓ Vision✓ Chat

About this model

👋 Wechat · 💡Online Demo · 🎈Github Page · 📑 Paper

•Significant improvements in many benchmarks such as TextVQA, DocVQA
•Support 8K content length
•Support image resolution up to **1344 * 1344**
•Provide an open source model version that supports both Chinese and English

Escolhas rápidas

Melhor custo-benefícioS

Intel Arc Pro B60 24GB~$599 — 23 tok/s

Melhor no geralS

NVIDIA A30 24GB~$5,500 — 68 tok/s

Melhor hardware

Melhores opções para CogVLM2 19B

RTX 5090 Laptop 24GBS

Rodar este modelo

CogVLM2 19B on RTX 5090 Laptop 24GB CogVLM2 19B on NVIDIA A30 24GB CogVLM2 19B on RX 7900 XTX 24GB

Opções de quantização

Estimativas de VRAM por nível de quantização

No hardware detected — fit column shows raw VRAM estimates

Quant	Bits	VRAM	Quality	Fit
Q2_K	2	7.4 GB	Low	—
Q3_K_S	3	9.3 GB	Low	—
NVFP4	4	10.6 GB	Medium	—
Q4_K_M	4	11.6 GB	Medium	—
Q5_K_M	5	13.7 GB	High	—
Q6_K	6	15.6 GB	High	—
Q8_0	8	20.3 GB	Very High	—
F16	16	38.9 GB	Maximum	—

Compatibilidade de hardware

Estimativas de compatibilidade para todo o hardware

Abrir calculadora

Computing compatibility...

Detalhamento de memória

Reference: RTX 2060 6GB

Weights11.6 GB

KV Cache2.4 GB

Runtime0.9 GB

Headroom0.6 GB

Perguntas frequentes

FAQ — CogVLM2 19B

Veja também

Guia de Quantização Metodologia de Pontuação Abrir calculadora