浏览 AI 模型
共 374 个模型s可用
/
状态:
Sort:
7B8K ctx4.3 GBlegacy
Context length for this model: 8192 tokens (same as https://huggingface.co/mistralai/Mistral-7B-v0.1)
7B16K ctx4.3 GBcurrent
- Project Website: bigcode-project.org - Paper: Link - Point of Contact: [email protected] - Languages: 17 Programming languages
3B16K ctx1.8 GBcurrent
StarCoder2 3B is a compact code generation model trained on 600+ programming languages from The Stack v2.
7B4K ctx4.3 GBlegacy
Introducing DeepSeek LLM, an advanced language model comprising 7 billion parameters. It has been trained from scratch on a vast dataset of 2 trillion tokens in both English and Chinese. In order to foster research, we have made DeepSeek LLM 7B/67B Base and DeepSeek LLM 7B/67B Chat open source for the research community.