44 AI Models.
All on your device.

From lightning-fast tiny models to powerful vision AI. Choose the perfect model for your needsโ€”all running privately on your iPhone.

44
Total Models
13
Providers
4
Vision Models
6
Coding Models

Filter by Provider

Filter by Use Case

Filter by Size

โญ Featured Models

FEATURED
Qwen
Qwen

Qwen 3 0.6B

Perfect for everyday tasks. Handles casual chat and quick questions.

GeneralFast
444 MB
FEATURED
Qwen
Qwen

Qwen 3 VL 2B

Understands images and text. Describe photos, analyze documents, answer visual questions.

Vision๐Ÿ‘๏ธ Vision
1.51 GB
FEATURED
Meta
Meta

Llama 3.2 3B

Balanced model for conversations, writing, and reasoning.

General
2.02 GB
FEATURED
Microsoft
Microsoft

Phi-4 Mini

Microsoft's most advanced small model with outstanding reasoning.

NewRecommendedThinking
2.49 GB
FEATURED
DeepSeek
DeepSeek

DeepSeek R1 1.5B

Shows reasoning process in thinking tags.

ThinkingFast
1.02 GB
FEATURED
Community

Hermes 3 3B

Exceptional at roleplay and creative writing.

RoleplayGeneral
2.02 GB
Qwen

Qwen

8 models

Qwen
Qwen

Qwen 3 0.6B

Perfect for everyday tasks. Handles casual chat and quick questions.

GeneralFast
444 MB
Qwen
Qwen

Qwen 2.5 0.5B

Lightning fast for basic questions and quick summaries.

GeneralFast
491 MB
Qwen
Qwen

Qwen 2.5 1.5B

Perfect everyday assistant that speaks 29+ languages.

GeneralMultilingual
1.12 GB
Qwen
Qwen

Qwen 3 1.7B

Strong reasoning in a compact package.

General
1.83 GB
Qwen
Qwen

Qwen 3 VL 2B

Understands images and text. Describe photos, analyze documents, answer visual questions.

Vision๐Ÿ‘๏ธ Vision
1.51 GB
Qwen
Qwen

Qwen 3 VL 4B

Advanced vision understanding with detailed image analysis.

Vision๐Ÿ‘๏ธ Vision
2.86 GB
Qwen
Qwen

Qwen 2.5 Coder 1.5B

Compact coding assistant for quick code edits.

CodingFast
1.09 GB
Qwen
Qwen

Qwen 2.5 Coder 3B

Excellent at code generation and refactoring.

Coding
2.1 GB
Meta

Meta

6 models

Meta
Meta

Llama 3.2 1B

Fast and efficient for everyday tasks in 8 languages.

GeneralFast
0.8 GB
Meta
Meta

Llama 3.2 3B

Balanced model for conversations, writing, and reasoning.

General
2.02 GB
Meta
Meta

Llama 3.3 1B

Latest generation with improved multilingual support.

GeneralMultilingualNew
0.89 GB
Meta
Meta

Llama 3.3 3B

Enhanced reasoning and instruction following.

GeneralNew
2.14 GB
Meta
Meta

Llama 3.2 Vision 11B

Meta's flagship vision model for image understanding.

Vision๐Ÿ‘๏ธ Vision
7.0 GB
Meta
Meta

Llama 3.1 8B

Most capable Llama for complex tasks and long contexts.

General
4.92 GB
Microsoft

Microsoft

3 models

Microsoft
Microsoft

Phi-3 Mini

Exceptional reasoning for its size.

General
2.2 GB
Microsoft
Microsoft

Phi-3.5 Mini

Supports 10+ languages with long-context understanding.

GeneralMultilingual
2.39 GB
Microsoft
Microsoft

Phi-4 Mini

Microsoft's most advanced small model with outstanding reasoning.

NewRecommendedThinking
2.49 GB
Google

Google

4 models

Google
Google

Gemma 3 270M

Ultra-compact for basic tasks.

GeneralFast
253 MB
Google
Google

Gemma 2 2B

Efficient and capable everyday assistant.

General
1.63 GB
Google
Google

Gemma 2 9B

Google's most capable open model for complex tasks.

General
5.44 GB
Google
Google

PaliGemma 3B

Vision-language model for image understanding.

Vision๐Ÿ‘๏ธ Vision
3.65 GB
DeepSeek

DeepSeek

4 models

DeepSeek
DeepSeek

DeepSeek R1 1.5B

Shows reasoning process in thinking tags.

ThinkingFast
1.02 GB
DeepSeek
DeepSeek

DeepSeek Coder 1.3B

Supports 87 programming languages.

CodingFast
870 MB
DeepSeek
DeepSeek

DeepSeek R1 7B

Advanced reasoning rivaling much larger models.

Thinking
4.4 GB
DeepSeek
DeepSeek

DeepSeek Coder 6.7B

Professional-grade code generation.

Coding
3.96 GB
Stability AI

Stability AI

3 models

Stability AI
Stability AI

StableLM 2 1.6B

Compact and efficient for edge devices.

GeneralFast
980 MB
Stability AI
Stability AI

StableLM 2 Zephyr 1.6B

Helpful and harmless assistant.

General
980 MB
Stability AI
Stability AI

Stable Code 3B

Optimized for code completion.

Coding
1.91 GB
Mistral AI

Mistral AI

1 model

Mistral AI
Mistral AI

Ministral 3B

Excellent instruction following with multilingual capability.

GeneralMultilingual
2.15 GB
01.AI

01.AI

1 model

01.AI
01.AI

Yi Coder 1.5B

Supports 52 programming languages.

CodingFast
960 MB
NVIDIA

NVIDIA

1 model

NVIDIA
NVIDIA

Nemotron Mini 4B

Optimized for chat and function calling.

General
2.70 GB
IBM

IBM

3 models

IBM
IBM

Granite 4.0 350M

Tiny enterprise-focused model.

GeneralFast
378 MB
IBM
IBM

Granite 4.0 1B

Efficient for business applications.

General
710 MB
IBM
IBM

Granite 4.0 Micro

Handles longer contexts with improved instruction following.

General
1.94 GB
Hugging Face

Hugging Face

4 models

Hugging Face
Hugging Face

SmolLM2 135M

Incredibly tiny for constrained devices.

GeneralFast
140 MB
Hugging Face
Hugging Face

SmolLM2 360M

Performs above its weight class.

GeneralFast
390 MB
Hugging Face
Hugging Face

SmolLM2 1.7B

Best SmolLM with remarkable capability.

General
1.06 GB
Hugging Face
Hugging Face

Zephyr 3B

Helpful assistant fine-tuned from StableLM.

General
1.91 GB
OpenAI

OpenAI

1 model

OpenAI
OpenAI

GPT-2

Text completion model that started it all.

GeneralFast
113 MB

Community

5 models

Community

Hermes 3 3B

Exceptional at roleplay and creative writing.

RoleplayGeneral
2.02 GB
Community

TinyLlama 1.1B

Remarkably efficient for basic chat.

GeneralFast
670 MB
Community

Dolphin 2.6 3B

Uncensored and creative assistant.

UncensoredRoleplay
1.79 GB
Community

Rocket 3B

Optimized for speed on consumer hardware.

General
1.71 GB
Community

Danube 3 500M

Ultra-efficient for edge devices.

GeneralFast
550 MB

Ready to try these models?

Download KernelAI and start chatting with any of these 44 models privately on your device.

Download KernelAI