Will It Run AI

浏览 AI 模型

共 374 个模型s可用

/
状态:
Sort:
MosaicMLMosaicMLMPT-7B-Instruct
7B8K ctx4.3 GBlegacy
denseBudget

MPT-7B Instruct is MosaicML's instruction-tuned model with a commercially permissive license. Supports 65K context with ALiBi positional encoding for efficient long-document processing.

Cognitive ComputationsCognitive ComputationsSamantha 7B
7B4K ctx4.3 GBlegacy
denseBudget

Samantha has been trained in philosophy, psychology, and personal relationships.

MicrosoftMicrosoftPhi 3.5 Mini 4B
4B128K ctx2.4 GBlegacy
denseBudget

Phi-3.5-mini is a lightweight, state-of-the-art open model built upon datasets used for Phi-3 - synthetic data and filtered publicly available websites - with a focus on very high-quality, reasoning dense data. The model belongs to the Phi-3 model family and supports 128K token context length. The model underwent a rigorous enhancement process, incorporating both supervised fine-tuning, proximal policy optimization, and direct preference optimization to ensure precise instruction adherence and robust safety measures.

MistralMistralMixtral 8x7B
47B (13B active)33K ctx28.7 GBcurrent
moeBudget

from mistral_common.tokens.tokenizers.mistral import MistralTokenizer from mistral_common.protocol.instruct.messages import UserMessage from mistral_common.protocol.instruct.request import ChatCompletionRequest

GoogleGoogleGemma 2 9B
9B8K ctx5.5 GBcurrent
denseBudget

Gemma 2 9B is Google's mid-size open model built on Gemini research. Features improved reasoning and safety with a novel architecture optimized for efficient inference on consumer hardware.

MetaMetaLlama 3.2 11B Vision
11B16K ctx6.7 GBlegacy
visionBudget

Llama 3.2 11B Vision is Meta's multimodal model that processes both text and images. Supports visual question answering, image captioning, and document understanding alongside standard text generation.

AlibabaAlibabaQwen 2.5 Coder 14B
14B131K ctx8.5 GBcurrent
denseBudget

Qwen2.5-Coder is the latest series of Code-Specific Qwen large language models (formerly known as CodeQwen). As of now, Qwen2.5-Coder has covered six mainstream model sizes, 0.5, 1.5, 3, 7, 14, 32 billion parameters, to meet the needs of different developers. Qwen2.5-Coder brings the following improvements upon CodeQwen1.5:

MistralMistralMixtral 8x22B
141B (39B active)66K ctx86 GBcurrent
moeBudget

from mistral_common.tokens.tokenizers.mistral import MistralTokenizer from mistral_common.protocol.instruct.messages import UserMessage from mistral_common.protocol.instruct.request import ChatCompletionRequest

AlibabaAlibabaQwen 2.5 Math 72B
72B4K ctx43.9 GBfrontier
denseBudget

> [!Warning] > > > 🚨 Qwen2.5-Math mainly supports solving English and Chinese math problems through CoT and TIR. We do not recommend using this series of models for other tasks. > >

01.AI01.AIYi 1.5 34B
34B4K ctx20.7 GBcurrent
denseBudget

🐙 GitHub • 👾 Discord • 🐤 Twitter • 💬 WeChat

MetaMetaLlama 3.2 3B
3B128K ctx1.8 GBlegacy
denseBudget

Llama 3.2 3B is Meta's compact multilingual text model optimized for edge and mobile deployment. Supports summarization, instruction following, and text generation with strong performance for its size class.

MistralMistralMistral Nemo 12B
12B128K ctx7.3 GBcurrent
denseBudget

The Mistral-Nemo-Instruct-2407 Large Language Model (LLM) is an instruct fine-tuned version of the Mistral-Nemo-Base-2407. Trained jointly by Mistral AI and NVIDIA, it significantly outperforms existing models smaller or similar in size.

MistralMistralMistral 7B Instruct v0.3
7B8K ctx4.3 GBlegacy
denseBudget

The Mistral-7B-Instruct-v0.3 Large Language Model (LLM) is an instruct fine-tuned version of the Mistral-7B-v0.3.

01.AI01.AIYi Coder 9B
9B131K ctx5.5 GBcurrent
denseBudget

🐙 GitHub • 👾 Discord • 🐤 Twitter • 💬 WeChat

TinyLlamaTinyLlamaTinyLlama 1.1B
1.1B4K ctx0.7 GBlegacy
denseBudget

The TinyLlama project aims to pretrain a 1.1B Llama model on 3 trillion tokens. With some proper optimization, we can achieve this within a span of "just" 90 days using 16 A100-40G GPUs 🚀🚀. The training has started on 2023-09-01.

MicrosoftMicrosoftPhi 3 Medium 14B
14B128K ctx8.5 GBcurrent
denseLegacy

The Phi-3-Medium-128K-Instruct is a 14B parameters, lightweight, state-of-the-art open model trained with the Phi-3 datasets that includes both synthetic data and the filtered publicly available websites data with a focus on high-quality and reasoning dense properties. The model belongs to the Phi-3 family with the Medium version in two variants 4k and 128K which is the context length (in tokens) that it can support.

DeepSeekDeepSeekDeepSeek R1 1.5B
1.5B33K ctx0.9 GBactive
denseLegacy

DeepSeek R1 Distill Qwen 1.5B is a compact reasoning model distilled from DeepSeek-R1, based on Qwen2.5-Math-1.5B. Fine-tuned on 800K curated samples, it achieves 83.9% on MATH-500 and supports chain-of-thought reasoning on resource-constrained devices.

Mistral AIMistral AICodestral 22B
22B33K ctx13.4 GBcurrent
denseLegacy

from mistral_common.tokens.tokenizers.mistral import MistralTokenizer from mistral_common.protocol.instruct.messages import UserMessage from mistral_common.protocol.instruct.request import ChatCompletionRequest

DeepSeekDeepSeekDeepSeek LLM 67B
67B4K ctx40.9 GBlegacy
denseLegacy

Introducing DeepSeek LLM, an advanced language model comprising 67 billion parameters. It has been trained from scratch on a vast dataset of 2 trillion tokens in both English and Chinese. In order to foster research, we have made DeepSeek LLM 7B/67B Base and DeepSeek LLM 7B/67B Chat open source for the research community.

MistralMistralMinistral 8B
8B131K ctx4.9 GBcurrent
denseLegacy

We introduce two new state-of-the-art models for local intelligence, on-device computing, and at-the-edge use cases. We call them les Ministraux: Ministral 3B and Ministral 8B.

AlibabaAlibabaQwen 2.5 Coder 0.5B
0.5B131K ctx0.3 GBcurrent
denseLegacy

Ultra-lightweight coding assistant for edge deployment and code completion.

AlibabaAlibabaQwen 2.5 1.5B
1.5B131K ctx0.9 GBcurrent
denseLegacy

Qwen 2.5 1.5B is a compact model suitable for mobile and edge devices with decent chat and instruction following.

GoogleGoogleGemma 3 1B
1B33K ctx0.6 GBcurrent
denseLegacy

Gemma 3 1B is Google's ultra-compact model from the Gemma 3 family. Optimized for mobile and edge inference with surprisingly capable text generation for its parameter count.

InternLMInternLMInternLM 20B
20B8K ctx12.2 GBlegacy
denseLegacy

InternLM2.5 has open-sourced a 20 billion parameter base model and a chat model tailored for practical scenarios. The model has the following characteristics: