Tsinghua/Zhipu

CogVLM2 19B

Name: CogVLM2 19B
Rating: 82 (123 reviews)
Author: Tsinghua/Zhipu

現行

HuggingFace

5.6Kダウンロード220いいねMay 2024公開日8K トークンコンテキストApache 2.0ライセンス78 優秀品質

CogVLM2 19B (19B parameters) requires approximately 15.5 GB of VRAM with Q4_K_M quantization. For the best balance of quality and speed, we recommend hardware with at least 18 GB of VRAM.

はじめに

— コピー&ペーストでローカル実行

Copy-paste commands to run CogVLM2 19B on your machine.

Run

docker run --rm -it ghcr.io/ggerganov/llama.cpp:full \
  --hf-repo "THUDM/cogvlm2-llama3-chat-19B" \
  --hf-file "cogvlm2-llama3-chat-19B-Q4_K_M.gguf" \
  -c 4096 -ngl 99

Quick specs

Parameters19B

Architecturedense

Context8K tokens

Modalitytext+vision

Min RAM7.4 GB

Rec. RAM11.6 GB (Q4_K_M)

LicenseApache 2.0

FamilyCogVLM

✓ Vision✓ Chat

About this model

👋 Wechat · 💡Online Demo · 🎈Github Page · 📑 Paper

•Significant improvements in many benchmarks such as TextVQA, DocVQA
•Support 8K content length
•Support image resolution up to **1344 * 1344**
•Provide an open source model version that supports both Chinese and English

CogVLM2 19Bのおすすめ

RTX 5090 Laptop 24GBS

このモデルを実行

CogVLM2 19B on RTX 5090 Laptop 24GB CogVLM2 19B on NVIDIA A30 24GB CogVLM2 19B on RX 7900 XTX 24GB

量子化オプション

量子化レベル別VRAM推定値

No hardware detected — fit column shows raw VRAM estimates

Quant	Bits	VRAM	Quality	Fit
Q2_K	2	7.4 GB	Low	—
Q3_K_S	3	9.3 GB	Low	—
NVFP4	4	10.6 GB	Medium	—
Q4_K_M	4	11.6 GB	Medium	—
Q5_K_M	5	13.7 GB	High	—
Q6_K	6	15.6 GB	High	—
Q8_0	8	20.3 GB	Very High	—
F16	16	38.9 GB	Maximum	—

ハードウェア互換性

全ハードウェアの適合度推定

カリキュレーターを開く

Computing compatibility...

メモリ内訳

Reference: RTX 2060 6GB

Weights11.6 GB

KV Cache2.4 GB

Runtime0.9 GB

Headroom0.6 GB

よくある質問

CogVLM2 19B

CogVLM2 19Bのおすすめ

量子化レベル別VRAM推定値

全ハードウェアの適合度推定

Reference: RTX 2060 6GB

FAQ — CogVLM2 19B