TII
Falcon 7B Instruct
Legacy124.0KDescargas1.0KMe gustaApr 2023Publicado8K tokensContextoApache 2.0Licencia40 BásicoCalidad
Falcon 7B Instruct (7B parameters) requires approximately 5.9 GB of VRAM with Q4_K_M quantization. For the best balance of quality and speed, we recommend hardware with at least 7 GB of VRAM.
Comenzar
— copia y pega para ejecutar en localCopy-paste commands to run Falcon 7B Instruct on your machine.
Run
lms load falcon-7b-instruct && lms server startQuick specs
Parameters7B
Architecturedense
Context8K tokens
Modalitytext
Min RAM2.7 GB
Rec. RAM4.3 GB (Q4_K_M)
LicenseApache 2.0
FamilyFalcon
✓ Chat✓ Reasoning
About this model
- •You are looking for a ready-to-use chat/instruct model based on Falcon-7B
- •Falcon-7B is a strong base model, outperforming comparable open-source models: (e.g., MPT-7B, StableLM, RedPajama etc.), thanks to being trained...
- •It features an architecture optimized for inference: , with FlashAttention (Dao et al., 2022) and multiquery (Shazeer et al., 2019)
Modelos relacionados
Selecciones rápidas
Mejor hardware
Mejores opciones para Falcon 7B Instruct
Ejecutar este modelo
Opciones de cuantización
Estimaciones de VRAM por nivel de cuantización
No hardware detected — fit column shows raw VRAM estimates
| Quant | Bits | VRAM | Quality | Fit |
|---|---|---|---|---|
Q2_K | 2 | 2.7 GB | Low | — |
Q3_K_S | 3 | 3.4 GB | Low | — |
NVFP4 | 4 | 3.9 GB | Medium | — |
Q4_K_M | 4 | 4.3 GB | Medium | — |
Q5_K_M | 5 | 5.0 GB | High | — |
Q6_K | 6 | 5.7 GB | High | — |
Q8_0 | 8 | 7.5 GB | Very High | — |
F16 | 16 | 14.3 GB | Maximum | — |
Quality benchmarks
Falcon 7B Instruct benchmark scores
Reasoning
MMLU-Pro1.7%
GPQA Diamond—
MATH-5001.2%
ARC Challenge45.8%
General
Chatbot Arena—
IFEval19.7%
Source: community · 2023-05-25
Compatibilidad de hardware
Estimaciones de encaje en todo el hardware
Computing compatibility...
Desglose de memoria
Reference: RTX 2060 6GB
Weights4.3 GB
KV Cache0.1 GB
Runtime0.9 GB
Headroom0.6 GB
Preguntas frecuentes
FAQ — Falcon 7B Instruct
Ver también