8.9× cheaper than cloud
$1.13 / 1M tokens local
Top models you can run
- SQwen3-Coder 30B A3B Instruct· 131 tok/s · 102K ctx · 30.5B params
- SQwen3-VL 30B A3B Instruct· 188 tok/s · 105K ctx · 30B params
- SMagistral Small 2507· 71 tok/s · 87K ctx · 24B params