Qwen
Qwen 3.5 9B
Why it wins
Qwen 3.5 9B is a specialized fit for Coding. It is a recent-generation family, which helps on current local SOTA workloads. It fits natively with comfortable headroom. Context coverage stays within the requested workload envelope. Known distribution channels: huggingface, ollama, lm-studio.
Capacity: Roomy · Bandwidth: Medium · Stack: Standard
Interactive: Good · Light API: Great · Bottleneck: Balanced
Score
122.0
Fit status
Runs well
Fit: Runs well with 32K safe context.
Runtime support: native via GGUF on cuda-local.
Runtime
llama.cpp
Artifact
GGUF