# Private LLM Private, Uncensored AI Chat for iPhone, iPad, and Mac

No Cloud, No Tracking, No Logins.

[![Download on the App Store](/app-store/download-badge/en/download.svg)](/download)[

Discord0 users online


](/discord)

[4.4·1,604 ratings on the App Store](/reviews)

## Run AI Offline on Your iPhone, iPad, and Mac

Private LLM runs entirely on your iPhone, iPad, or Mac. Your conversations never leave the device, and no internet is required after the first model download. No account, no tracking, no logs. One purchase unlocks the app across every Apple device you own and your Family Sharing group.

![A close-up view of an iPhone screen displaying the interface of the Private LLM app, where a text prompt is entered into a chat-like interface, highlighting the app's ability to run sophisticated language models locally on the device for enhanced privacy and offline functionality](/_astro/ios_prompt.q9DZ5858.webp)

## Run DeepSeek R1, Llama 3.3, Qwen3, and Gemma 3 Locally

Private LLM runs the leading open-source models directly on your Apple devices — DeepSeek R1 Distill, Llama 3.3 70B, Qwen3 4B, Phi 4, Google Gemma 3, and more. Every conversation stays on-device, and every model is quantized in-house for the best possible quality on your hardware.

[Find the Best Open-Source LLMs for Your Device](/en#models)

![Screenshot of the Private LLM app on an iPhone, displaying a user-friendly interface with a list of downloadable Large Language Models (LLMs) available for offline use, showcasing a variety of model names and descriptions, emphasizing the app's capability for personalized AI experiences while highlighting its privacy and offline functionality.](/_astro/downloadable_models_ios.BmHBJGeb.webp)

## Local AI in Siri and Apple Shortcuts — No Code

Private LLM plugs directly into Siri and the Shortcuts app. Build AI-driven workflows that summarise text, generate writing, or pipe responses into any of the 70+ apps that support the [x-callback-url specification](https://x-callback-url.com/). No code required.

[See User-Built Apple Shortcuts for Private LLM](/en/community-shortcuts)

![An iPhone displaying the Private LLM app interface with an Apple Shortcut integration, showcasing a seamless user experience for personalizing AI interactions on iOS](/_astro/shortcuts.CRkFn8Aq.webp)

## One Purchase, No Subscription — Family Sharing for Six

Ditch the subscriptions for a smarter choice with Private LLM. A single purchase unlocks the app across all Apple platforms—iPhone, iPad, and Mac—while enabling Family Sharing for up to six relatives. This approach not only simplifies access but also amplifies the value of your investment, making digital privacy and intelligence universally available in your family.

![Screenshot of the Private LLM interface on macOS, featuring a user typing a prompt into the application's text input field, ready to receive instant, offline responses from the local language model](/_astro/macos_prompt.DfGFHq6k.webp)

## AI Writing Tools Built Into macOS

Select any text in any macOS app, right-click, and Private LLM rewrites, summarises, or corrects it — entirely on-device. Supports English and major Western European languages.

![Screenshot showing the Private LLM integration within the macOS system-wide services menu.](/_astro/macos-service-menu.B1QmQmpp.webp)

## Built by Two Engineers, Not VCs

Private LLM is built by two engineers in the EU — bootstrapped, no VC funding, no growth-hacking roadmap. We are the only app on the App Store with OmniQuant and GPTQ quantization, which produce measurably better output than the RTN quantization used by MLX and llama.cpp wrapper apps like Ollama and LM Studio. We answer to users, not investors — which is why your data stays on-device and always will.

![An iPhone displaying the Private LLM app interface with an Apple Shortcut integration, showcasing a seamless user experience for personalizing AI interactions on iOS](/_astro/independent-devs.nPY4P8E5.png)

From the App Store

## Real reviews from iPhone and Mac users

> “This is a private AI app created by developers performing constant updates and not charging a subscription. That is rare nowadays! Bravo, looking forward to the updates as this continues to improve!”

🇺🇸8parental8 · App Store review

Review 1 of 5

[Read the App Store reviews](/reviews)

## OmniQuant and GPTQ Quantization: Better Output, Less Memory

Private LLM uses [OmniQuant](https://arxiv.org/abs/2308.13137) and GPTQ quantization. When LLMs are quantized for on-device inference, outlier weight values hurt text generation quality. OmniQuant modulates outlier weights with a learnable, optimization-based clipping mechanism that minimizes quantization error. GPTQ uses approximate second-order (Hessian) information to minimize reconstruction error on the weights that matter most. The affine RTN quantization used by MLX-based apps like LM Studio, and the block-wise RTN variants used by llama.cpp-based apps like Ollama, skip this kind of per-weight optimization — which is why those apps produce lower-quality output on the same Apple hardware. We constantly explore advanced quantization methods, work that wrapper apps built on third-party inference engines cannot take on. OmniQuant and GPTQ paired with optimized model-specific Metal kernels let Private LLM deliver text generation that is both fast and high-quality on Apple hardware.

[Private LLM vs Ollama](/compare/ollama-vs-private-llm)

## Download the Best Open Source LLMs

iOS

### Qwen3 4B Based Models

For iPhones/iPads with 6GB+ RAM

[Qwen3 4B Instruct 2507](https://huggingface.co/Qwen/Qwen3-4B-Instruct-2507)[Qwen3 4B Instruct 2507 Abliterated (Uncensored)](https://huggingface.co/huihui-ai/Huihui-Qwen3-4B-Instruct-2507-abliterated)[Josiefied Qwen3 4B Instruct 2507 (Uncensored)](https://huggingface.co/Goekdeniz-Guelmez/Josiefied-Qwen3-4B-Instruct-2507-gabliterated-v1)[Qwen3 4B Instruct 2507 Heretic (Uncensored)](https://huggingface.co/p-e-w/Qwen3-4B-Instruct-2507-heretic)[Qwen3 4B Instruct 2507 Heretic NoSlop (Uncensored)](https://huggingface.co/numen-tech/Qwen3-4B-Instruct-2507-heretic-noslop-GPTQ-Int4)

### DeepSeek R1 Distill Based Models

For iPhones/iPads with 8GB+ RAM

[DeepSeek R1 Distill Llama 8B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-8B)[DeepSeek R1 Distill Qwen 7B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-7B)[DeepSeek R1 Distill Llama 8B Abliterated (Uncensored)](https://huggingface.co/huihui-ai/DeepSeek-R1-Distill-Llama-8B-abliterated)

### DeepSeek R1 Distill Based Models

For iPhones/iPads with 16GB+ RAM

[DeepSeek R1 Distill Qwen 14B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-14B)

### Meta Llama 3.2 3B Based Models

For iPhones/iPads with 6GB+ RAM

[Meta Llama 3.2 3B Instruct 🦙](https://huggingface.co/meta-llama/Llama-3.2-3B-Instruct)[Llama 3.2 3B Instruct Abliterated 🦙 (Uncensored)](https://huggingface.co/huihui-ai/Llama-3.2-3B-Instruct-abliterated)[Llama 3.2 3B Instruct Uncensored 🦙](https://huggingface.co/chuanli11/Llama-3.2-3B-Instruct-uncensored)[Hermes 3 Llama 3.2 3B](https://huggingface.co/NousResearch/Hermes-3-Llama-3.2-3B)[FuseChat Llama 3.2 3B Instruct](https://huggingface.co/FuseAI/FuseChat-Llama-3.2-3B-Instruct)[Dolphin 3.0 Llama 3.2 3B 🐬 (Uncensored)](https://huggingface.co/cognitivecomputations/Dolphin3.0-Llama3.2-3B)

### Meta Llama 3.2 1B Based Models

For iPhones/iPads with 4GB+ RAM

[Meta Llama 3.2 1B Instruct 🦙](https://huggingface.co/meta-llama/Llama-3.2-1B-Instruct)[Llama 3.2 1B Instruct Abliterated 🦙 (Uncensored)](https://huggingface.co/huihui-ai/Llama-3.2-1B-Instruct-abliterated)[FuseChat Llama 3.2 1B Instruct](https://huggingface.co/FuseAI/FuseChat-Llama-3.2-1B-Instruct)[Dolphin 3.0 Llama 3.2 1B 🐬 (Uncensored)](https://huggingface.co/cognitivecomputations/Dolphin3.0-Llama3.2-1B)

### Google Gemma 3 1B Based Models

For iPhones/iPads with 4GB+ RAM

[Gemma 3 1B IT 💎](https://huggingface.co/google/gemma-3-1b-it)[Gemma 3 1B IT Abliterated (Uncensored)](https://huggingface.co/mlabonne/gemma-3-1b-it-abliterated)[Amoral Gemma 3 1B v2 (Uncensored)](https://huggingface.co/soob3123/amoral-gemma3-1B-v2)

### Google Gemma 2 9B Based Models

For iPhones/iPads with 16GB+ RAM

[Gemma-2 9B IT 💎](https://huggingface.co/google/gemma-2-9b-it)[Gemma-2 9B IT SPPO Iter3](https://huggingface.co/UCLA-AGI/Gemma-2-9B-It-SPPO-Iter3)[Tiger Gemma 9B v3 🐅 (Uncensored)](https://huggingface.co/TheDrummer/Tiger-Gemma-9B-v3)[FuseChat Gemma 2 9B Instruct](https://huggingface.co/FuseAI/FuseChat-Gemma-2-9B-Instruct)[Gemma 2 Ifable 9B (Creative Writing)](https://huggingface.co/ifable/gemma-2-Ifable-9B)

### Google Gemma 2 2B Based Models

For iPhones/iPads with 4GB+ RAM

[Gemma-2 2B IT 💎](https://huggingface.co/google/gemma-2-2b-it)[SauerkrautLM Gemma-2 2B IT](https://huggingface.co/VAGOsolutions/SauerkrautLM-gemma-2-2b-it)

### Meta Llama 3.1 8B Based Models

For iPhones/iPads with 8GB+ RAM

[Meta Llama 3.1 8B Instruct 🦙](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct)[Meta Llama 3.1 8B Instruct Abliterated 🦙(Uncensored)](https://huggingface.co/mlabonne/Meta-Llama-3.1-8B-Instruct-abliterated)[Hermes 3 Llama 3.1 8B](https://huggingface.co/NousResearch/Hermes-3-Llama-3.1-8B)[FuseChat Llama 3.1 8B Instruct](https://huggingface.co/FuseAI/FuseChat-Llama-3.1-8B-Instruct)[Llama 3.1 8B Lexi Uncensored V2 (Therapy/Role-Play)](https://huggingface.co/Orenguteng/Llama-3.1-8B-Lexi-Uncensored-V2)[Dolphin 3.0 Llama 3.1 8B 🐬 (Uncensored)](https://huggingface.co/cognitivecomputations/Dolphin3.0-Llama3.1-8B)[Meta Llama 3.1 8B Survive V3 (Survival Specialist)](https://huggingface.co/lolzinventor/Meta-Llama-3.1-8B-SurviveV3)[Llama 3.1 8B UltraMedical 🏥 (Biomedical)](https://huggingface.co/TsinghuaC3I/Llama-3.1-8B-UltraMedical)

### Meta Llama 3 8B Based Models

For iPhones/iPads with 6GB+ RAM

[Meta Llama 3 8B Instruct 🦙](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct)[Meta Llama 3 8B Instruct Abliterated v3 (Uncensored)](https://huggingface.co/failspy/Meta-Llama-3-8B-Instruct-abliterated-v3)[NeuralDaredevil 8B Abliterated (Uncensored)](https://huggingface.co/mlabonne/NeuralDaredevil-8B-abliterated)[Llama 3 8B Instruct MopeyMule](https://huggingface.co/failspy/Llama-3-8B-Instruct-MopeyMule)[Llama 3 WhiteRabbitNeo 8B v2.0](https://huggingface.co/WhiteRabbitNeo/Llama-3-WhiteRabbitNeo-8B-v2.0)[Hermes 2 Theta Llama 3 8B](https://huggingface.co/NousResearch/Hermes-2-Theta-Llama-3-8B)[LLaMA3-iterative-DPO-final](https://huggingface.co/RLHFlow/LLaMA3-iterative-DPO-final)[Hathor\_Stable-v0.2-L3-8B](https://huggingface.co/Nitral-AI/Hathor_Stable-v0.2-L3-8B)[Openchat 3.6 8B 20240522](https://huggingface.co/openchat/openchat-3.6-8b-20240522)[Dolphin 2.9 Llama 3 8B (Uncensored) 🐬](https://huggingface.co/cognitivecomputations/dolphin-2.9-llama3-8b)[Llama 3 Smaug 8B](https://huggingface.co/abacusai/Llama-3-Smaug-8B)[Hermes 2 Pro Llama 3 8B ☤](https://huggingface.co/NousResearch/Hermes-2-Pro-Llama-3-8B)[OpenBioLLM-8B 🧬 (Biomedical)](https://huggingface.co/aaditya/Llama3-OpenBioLLM-8B)[L3 Umbral Mind RP v3.0 8B 🌓](https://huggingface.co/Casual-Autopsy/L3-Umbral-Mind-RP-v3.0-8B)[Llama 3 Instruct 8B SPPO Iter3](https://huggingface.co/UCLA-AGI/Llama-3-Instruct-8B-SPPO-Iter3)

### Qwen 2.5 Based Models

For iPhones/iPads with 4GB+ RAM

[Qwen 2.5 0.5B Unquantized](https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct)[Qwen 2.5 Coder 0.5B Unquantized](https://huggingface.co/Qwen/Qwen2.5-Coder-0.5B-Instruct)[Dolphin 3.0 Qwen 2.5 0.5B 🐬 (Uncensored)](https://huggingface.co/cognitivecomputations/Dolphin3.0-Qwen2.5-0.5B)[Qwen 2.5 1.5B](https://huggingface.co/Qwen/Qwen2.5-1.5B-Instruct)[Qwen 2.5 Coder 1.5B](https://huggingface.co/Qwen/Qwen2.5-Coder-1.5B-Instruct)[EVA-D Qwen2.5 1.5B v0.0 (Role-Play/Story Writing)](https://huggingface.co/EVA-UNIT-01/EVA-D-Qwen2.5-1.5B-v0.0)[Dolphin 3.0 Qwen 2.5 1.5B 🐬 (Uncensored)](https://huggingface.co/cognitivecomputations/Dolphin3.0-Qwen2.5-1.5B)[Qwen 2.5 3B](https://huggingface.co/Qwen/Qwen2.5-3B-Instruct)[Qwen 2.5 Coder 3B](https://huggingface.co/Qwen/Qwen2.5-Coder-3B-Instruct)[Dolphin 3.0 Qwen 2.5 3B 🐬 (Uncensored)](https://huggingface.co/cognitivecomputations/Dolphin3.0-Qwen2.5-3b)

### Qwen 2.5 Based Models

For iPhones/iPads with 8GB+ RAM

[Qwen 2.5 7B](https://huggingface.co/Qwen/Qwen2.5-7B-Instruct)[FuseChat Qwen 2.5 7B Instruct](https://huggingface.co/FuseAI/FuseChat-Qwen-2.5-7B-Instruct)[EVA Qwen2.5 7B v0.1 (Role-Play/Story Writing)](https://huggingface.co/EVA-UNIT-01/EVA-Qwen2.5-7B-v0.1)[OpenHands LM 7B v0.1 (Coding)](https://huggingface.co/all-hands/openhands-lm-7b-v0.1)

### Qwen 2.5 Based Models

For iPhones/iPads with 8GB+ RAM

[Qwen 2.5 Coder 7B](https://huggingface.co/Qwen/Qwen2.5-Coder-7B-Instruct)

### Qwen 2.5 14B Based Models

For iPhones/iPads with 16GB+ RAM

[Qwen 2.5 Coder 14B](https://huggingface.co/Qwen/Qwen2.5-Coder-14B-Instruct)[EVA Qwen2.5 14B v0.2 (Role-Play/Story Writing)](https://huggingface.co/EVA-UNIT-01/EVA-Qwen2.5-14B-v0.2)

### Phi-3 Mini 3.8B Based Models

For iPhones/iPads with 6GB+ RAM

[Phi-3 Mini 4K Instruct](https://huggingface.co/microsoft/Phi-3-mini-4k-instruct)[Kappa-3 Phi Abliterated (Uncensored)](https://huggingface.co/failspy/kappa-3-phi-abliterated)

### Google Gemma Based Models

For iPhones/iPads with 8GB+ RAM

[Gemma 2B IT 💎](https://huggingface.co/google/gemma-2b-it/)[Gemma 1.1 2B IT 💎](https://huggingface.co/google/gemma-1.1-2b-it)

### Mistral 7B Based Models

For iPhones/iPads with 6GB+ RAM

[Mistral 7B Instruct v0.3](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.3)[Mistral 7B Instruct v0.2](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2)[OpenHermes 2.5 Mistral 7B ☤](https://huggingface.co/teknium/OpenHermes-2.5-Mistral-7B)[Hermes 2 Pro Mistral 7B ☤](https://huggingface.co/NousResearch/Hermes-2-Pro-Mistral-7B)[RakutenAI 7B Chat 🇯🇵](https://huggingface.co/Rakuten/RakutenAI-7B-chat)[openchat-3.5-0106 7B 💬](https://huggingface.co/openchat/openchat-3.5-0106)[CodeNinja 1.0 OpenChat 7B 🥷](https://huggingface.co/beowolx/CodeNinja-1.0-OpenChat-7B)[Starling LM 7B Beta 🐤](https://huggingface.co/Nexusflow/Starling-LM-7B-beta)[Dolphin 2.8 Mistral 7B v0.2 (Uncensored) 🐬](https://huggingface.co/cognitivecomputations/dolphin-2.8-mistral-7b-v02)[DictaLM 2.0 Instruct 🇮🇱](https://huggingface.co/dicta-il/dictalm2.0-instruct)

### Llama 2 7B Based Models

For iPhones/iPads with 6GB+ RAM

[Airoboros l2 7b 3.0](https://huggingface.co/jondurbin/airoboros-l2-7b-3.0)[Spicyboros 7b 2.2 🌶️](https://huggingface.co/jondurbin/spicyboros-7b-2.2)

### Phi-2 3B Based Models

For iPhones/iPads with 4GB+ RAM

[Phi-2 Orange 🍊](https://huggingface.co/rhysjones/phi-2-orange)[Dolphin 2.6 Phi-2 (Uncensored) 🐬](https://huggingface.co/cognitivecomputations/dolphin-2_6-phi-2)[Phi-2 Super 🤖](https://huggingface.co/abacaj/phi-2-super)[Phi-2 Orange v2 🍊](https://huggingface.co/rhysjones/phi-2-orange-v2)

### H2O Danube Based Models

For iPhones/iPads with 4GB+ RAM

[H2O Danube 1.8B Chat](https://huggingface.co/h2oai/h2o-danube-1.8b-chat)

### StableLM 3B Based Models

For iPhones/iPads with 4GB+ RAM

[StableLM 2 Zephyr 1.6B 🪁](https://huggingface.co/stabilityai/stablelm-2-zephyr-1_6b)[Nous-Capybara-3B V1.9](https://huggingface.co/NousResearch/Nous-Capybara-3B-V1.9)[Rocket 3B 🚀](https://huggingface.co/pansophic/rocket-3B)

### TinyLlama 1.1B Based Models

For iPhones/iPads with 4GB+ RAM

[TinyLlama 1.1B Chat 🦙](https://huggingface.co/TinyLlama/TinyLlama-1.1B-Chat-v1.0)[TinyDolphin 2.8 1.1B Chat 🐬](https://huggingface.co/cognitivecomputations/TinyDolphin-2.8-1.1b)

### Yi 6B Based Models

For iPhones/iPads with 6GB+ RAM

[Yi 6B Chat 🇨🇳](https://huggingface.co/01-ai/Yi-6B-Chat)

macOS

### DeepSeek R1 Distill Based Models

For Apple Silicon Macs with 16GB+ RAM

[DeepSeek R1 Distill Llama 8B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-8B)[DeepSeek R1 Distill Llama 8B Abliterated (Uncensored)](https://huggingface.co/huihui-ai/DeepSeek-R1-Distill-Llama-8B-abliterated)[DeepSeek R1 Distill Qwen 7B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-7B)[DeepSeek R1 Distill Qwen 14B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-14B)

### DeepSeek R1 Distill Based Models

For Apple Silicon Macs with 32GB+ RAM

[Fuse O1 DeepSeek R1 QwQ SkyT1 32B](https://huggingface.co/FuseAI/FuseO1-DeepSeekR1-QwQ-SkyT1-32B-Preview)[DeepSeek R1 Distill Qwen 32B Abliterated (Uncensored)](https://huggingface.co/huihui-ai/DeepSeek-R1-Distill-Qwen-32B-abliterated)

### DeepSeek R1 Distill Based Models

For Apple Silicon Macs with 48GB+ RAM

[DeepSeek R1 Distill Llama 70B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-70B)[R1 1776 Distill Llama 70B](https://huggingface.co/perplexity-ai/r1-1776-distill-llama-70b)

### Google Gemma 3 1B Based Models

For Apple Silicon Macs with 8GB+ RAM

[Gemma 3 1B IT 💎](https://huggingface.co/google/gemma-3-1b-it)[Gemma 3 1B IT Abliterated (Uncensored)](https://huggingface.co/mlabonne/gemma-3-1b-it-abliterated)[Amoral Gemma 3 1B v2 (Uncensored)](https://huggingface.co/soob3123/amoral-gemma3-1B-v2)

### Phi-4 14B Based Models

For Apple Silicon Macs with 16GB+ RAM

[Phi-4](https://huggingface.co/microsoft/phi-4)

### Meta Llama 3.3 70B Based Models

For Apple Silicon Macs with 48GB+ RAM

[Meta Llama 3.3 70B Instruct 🦙](https://huggingface.co/meta-llama/Llama-3.3-70B-Instruct)[Llama 3.3 70B Instruct Abliterated (Uncensored)](https://huggingface.co/huihui-ai/Llama-3.3-70B-Instruct-abliterated)[EVA LLaMA 3.33 70B v0.1 (Role-Play/Story Writing)](https://huggingface.co/EVA-UNIT-01/EVA-LLaMA-3.33-70B-v0.1)[Llama 3.3 70B Euryale v2.3 (Role-Play/Story Writing)](https://huggingface.co/Sao10K/L3.3-70B-Euryale-v2.3)

### Meta Llama 3.2 3B Based Models

For Apple Silicon Macs with 8GB+ RAM

[Meta Llama 3.2 3B Instruct 🦙](https://huggingface.co/meta-llama/Llama-3.2-3B-Instruct)[Dolphin 3.0 Llama 3.2 3B 🐬 (Uncensored)](https://huggingface.co/cognitivecomputations/Dolphin3.0-Llama3.2-3B)[Llama 3.2 3B Instruct Abliterated 🦙 (Uncensored)](https://huggingface.co/huihui-ai/Llama-3.2-3B-Instruct-abliterated)[Llama 3.2 3B Instruct Uncensored 🦙](https://huggingface.co/chuanli11/Llama-3.2-3B-Instruct-uncensored)[Hermes 3 Llama 3.2 3B](https://huggingface.co/NousResearch/Hermes-3-Llama-3.2-3B)[FuseChat Llama 3.2 3B Instruct](https://huggingface.co/FuseAI/FuseChat-Llama-3.2-3B-Instruct)

### Meta Llama 3.2 1B Based Models

For Apple Silicon Macs with 8GB+ RAM

[Meta Llama 3.2 1B Instruct 🦙](https://huggingface.co/meta-llama/Llama-3.2-1B-Instruct)[Dolphin 3.0 Llama 3.2 1B 🐬 (Uncensored)](https://huggingface.co/cognitivecomputations/Dolphin3.0-Llama3.2-1B)[Llama 3.2 1B Instruct Abliterated 🦙 (Uncensored)](https://huggingface.co/huihui-ai/Llama-3.2-1B-Instruct-abliterated)[FuseChat Llama 3.2 1B Instruct](https://huggingface.co/FuseAI/FuseChat-Llama-3.2-1B-Instruct)

### Meta Llama 3.1 70B Based Models

For Apple Silicon Macs with 64GB+ RAM

[Meta Llama 3.1 70B Instruct 🦙](https://huggingface.co/meta-llama/Meta-Llama-3.1-70B-Instruct)

### Meta Llama 3.1 8B Based Models

For Apple Silicon Macs with 8GB+ RAM

[Meta Llama 3.1 8B Instruct 🦙](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct)[Meta Llama 3.1 8B Instruct Abliterated 🦙(Uncensored)](https://huggingface.co/mlabonne/Meta-Llama-3.1-8B-Instruct-abliterated)[Hermes 3 Llama 3.1 8B](https://huggingface.co/NousResearch/Hermes-3-Llama-3.1-8B)[FuseChat Llama 3.1 8B Instruct](https://huggingface.co/FuseAI/FuseChat-Llama-3.1-8B-Instruct)[Llama 3.1 8B Lexi Uncensored V2 (Therapy/Role-Play)](https://huggingface.co/Orenguteng/Llama-3.1-8B-Lexi-Uncensored-V2)[Dolphin 3.0 Llama 3.1 8B 🐬 (Uncensored)](https://huggingface.co/cognitivecomputations/Dolphin3.0-Llama3.1-8B)[Meta Llama 3.1 8B Survive V3 (Survival Specialist)](https://huggingface.co/lolzinventor/Meta-Llama-3.1-8B-SurviveV3)[Llama 3.1 8B UltraMedical 🏥 (Biomedical)](https://huggingface.co/TsinghuaC3I/Llama-3.1-8B-UltraMedical)

### Qwen 2.5 Based Models

For Apple Silicon Macs with 8GB+ RAM

[Qwen 2.5 0.5B Unquantized](https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct)[Qwen 2.5 1.5B](https://huggingface.co/Qwen/Qwen2.5-1.5B-Instruct)[Qwen 2.5 3B](https://huggingface.co/Qwen/Qwen2.5-3B-Instruct)[Qwen 2.5 7B](https://huggingface.co/Qwen/Qwen2.5-7B-Instruct)[Qwen 2.5 Coder 0.5B Unquantized](https://huggingface.co/Qwen/Qwen2.5-Coder-0.5B-Instruct)[Qwen 2.5 Coder 1.5B](https://huggingface.co/Qwen/Qwen2.5-Coder-1.5B-Instruct)[Qwen 2.5 Coder 3B](https://huggingface.co/Qwen/Qwen2.5-Coder-3B-Instruct)[Qwen 2.5 Coder 7B](https://huggingface.co/Qwen/Qwen2.5-Coder-7B-Instruct)[FuseChat Qwen 2.5 7B Instruct](https://huggingface.co/FuseAI/FuseChat-Qwen-2.5-7B-Instruct)[EVA-D Qwen2.5 1.5B v0.0 (Role-Play/Story Writing)](https://huggingface.co/EVA-UNIT-01/EVA-D-Qwen2.5-1.5B-v0.0)[EVA Qwen2.5 7B v0.1 (Role-Play/Story Writing)](https://huggingface.co/EVA-UNIT-01/EVA-Qwen2.5-7B-v0.1)[Dolphin 3.0 Qwen 2.5 0.5B 🐬 (Uncensored)](https://huggingface.co/cognitivecomputations/Dolphin3.0-Qwen2.5-0.5B)[Dolphin 3.0 Qwen 2.5 1.5B 🐬 (Uncensored)](https://huggingface.co/cognitivecomputations/Dolphin3.0-Qwen2.5-1.5B)[Dolphin 3.0 Qwen 2.5 3B 🐬 (Uncensored)](https://huggingface.co/cognitivecomputations/Dolphin3.0-Qwen2.5-3b)

### Qwen 2.5 14B Based Models

For Apple Silicon Macs with 16GB+ RAM

[Qwen 2.5 Coder 14B](https://huggingface.co/Qwen/Qwen2.5-Coder-14B-Instruct)[EVA Qwen2.5 14B v0.2 (Role-Play/Story Writing)](https://huggingface.co/EVA-UNIT-01/EVA-Qwen2.5-14B-v0.2)

### Qwen3 4B Based Models

For Apple Silicon Macs with 16GB+ RAM

[Qwen3 4B Instruct 2507](https://huggingface.co/Qwen/Qwen3-4B-Instruct-2507)[Qwen3 4B Instruct 2507 Abliterated (Uncensored)](https://huggingface.co/huihui-ai/Huihui-Qwen3-4B-Instruct-2507-abliterated)[Josiefied Qwen3 4B Instruct 2507 (Uncensored)](https://huggingface.co/Goekdeniz-Guelmez/Josiefied-Qwen3-4B-Instruct-2507-gabliterated-v1)[Qwen3 4B Instruct 2507 Heretic (Uncensored)](https://huggingface.co/p-e-w/Qwen3-4B-Instruct-2507-heretic)[Qwen3 4B Instruct 2507 Heretic NoSlop (Uncensored)](https://huggingface.co/numen-tech/Qwen3-4B-Instruct-2507-heretic-noslop-GPTQ-Int4)

### Qwen 2.5 32B Based Models

For Apple Silicon Macs with 24GB+ RAM

[Qwen 2.5 32B](https://huggingface.co/Qwen/Qwen2.5-32B-Instruct)[Qwen 2.5 Coder 32B](https://huggingface.co/Qwen/Qwen2.5-Coder-32B-Instruct)[EVA Qwen2.5 32B v0.2 (Role-Play/Story Writing)](https://huggingface.co/EVA-UNIT-01/EVA-Qwen2.5-32B-v0.2)[OpenHands LM 32B v0.1 (Coding)](https://huggingface.co/all-hands/openhands-lm-32b-v0.1)

### Google Gemma 2 9B Based Models

For Apple Silicon Macs with 16GB+ RAM

[Gemma-2 9B IT 💎](https://huggingface.co/google/gemma-2-9b-it)[Gemma-2 9B IT SPPO Iter3](https://huggingface.co/UCLA-AGI/Gemma-2-9B-It-SPPO-Iter3)[Tiger Gemma 9B v3 🐅 (Uncensored)](https://huggingface.co/TheDrummer/Tiger-Gemma-9B-v3)[FuseChat Gemma 2 9B Instruct](https://huggingface.co/FuseAI/FuseChat-Gemma-2-9B-Instruct)[Gemma 2 Ifable 9B (Creative Writing)](https://huggingface.co/ifable/gemma-2-Ifable-9B)

### Google Gemma 2 2B Based Models

For Apple Silicon Macs with 8GB+ RAM

[Gemma-2 2B IT 💎](https://huggingface.co/google/gemma-2-2b-it)[SauerkrautLM Gemma-2 2B IT](https://huggingface.co/VAGOsolutions/SauerkrautLM-gemma-2-2b-it)

### Meta Llama 3 70B Based Models

For Apple Silicon Macs with 48GB+ RAM

[Meta Llama 3 70B Instruct 🦙](https://huggingface.co/meta-llama/Meta-Llama-3-70B)[Smaug Llama 3 70B Instruct](https://huggingface.co/abacusai/Smaug-Llama-3-70B-Instruct)[Smaug Llama 3 70B Instruct Abliterated v3 (Uncensored)](https://huggingface.co/failspy/Smaug-Llama-3-70B-Instruct-abliterated-v3)[Cat Llama 3 70B Instruct](https://huggingface.co/turboderp/Cat-Llama-3-70B-instruct)

### Meta Llama 3 8B Based Models

For Apple Silicon Macs with 8GB+ RAM

[Meta Llama 3 8B Instruct 🦙](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct)[Meta Llama 3 8B Instruct Abliterated v3 (Uncensored)](https://huggingface.co/failspy/Meta-Llama-3-8B-Instruct-abliterated-v3)[NeuralDaredevil 8B Abliterated (Uncensored)](https://huggingface.co/mlabonne/NeuralDaredevil-8B-abliterated)[Llama 3 8B Instruct MopeyMule](https://huggingface.co/failspy/Llama-3-8B-Instruct-MopeyMule)[Llama 3 WhiteRabbitNeo 8B v2.0](https://huggingface.co/WhiteRabbitNeo/Llama-3-WhiteRabbitNeo-8B-v2.0)[Hermes 2 Theta Llama 3 8B](https://huggingface.co/NousResearch/Hermes-2-Theta-Llama-3-8B)[LLaMA3-iterative-DPO-final](https://huggingface.co/RLHFlow/LLaMA3-iterative-DPO-final)[Hathor\_Stable-v0.2-L3-8B](https://huggingface.co/Nitral-AI/Hathor_Stable-v0.2-L3-8B)[Openchat 3.6 8B 20240522](https://huggingface.co/openchat/openchat-3.6-8b-20240522)[Dolphin 2.9 Llama 3 8B (Uncensored) 🐬](https://huggingface.co/cognitivecomputations/dolphin-2.9-llama3-8b)[Llama 3 Smaug 8B](https://huggingface.co/abacusai/Llama-3-Smaug-8B)[Hermes 2 Pro Llama 3 8B ☤](https://huggingface.co/NousResearch/Hermes-2-Pro-Llama-3-8B)[OpenBioLLM-8B 🧬 (Biomedical)](https://huggingface.co/aaditya/Llama3-OpenBioLLM-8B)[L3 Umbral Mind RP v3.0 8B 🌓](https://huggingface.co/Casual-Autopsy/L3-Umbral-Mind-RP-v3.0-8B)[Llama 3 Instruct 8B SPPO Iter3](https://huggingface.co/UCLA-AGI/Llama-3-Instruct-8B-SPPO-Iter3)

### Phi-3 Mini 3.8B Based Models

For Apple Silicon Macs with 8GB+ RAM

[Phi-3 Mini 4K Instruct](https://huggingface.co/microsoft/Phi-3-mini-4k-instruct)[Kappa-3 Phi Abliterated (Uncensored)](https://huggingface.co/failspy/kappa-3-phi-abliterated)

### Google Gemma Based Models

For Apple Silicon Macs with 8GB+ RAM

[Gemma 2B IT 💎](https://huggingface.co/google/gemma-2b-it/)[Gemma 1.1 2B IT 💎](https://huggingface.co/google/gemma-1.1-2b-it)

### Mixtral 8x7B Based Models

For Apple Silicon Macs with 32GB+ RAM

[Mixtral-8x7B-Instruct-v0.1](https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1)[Dolphin 2.6 Mixtral 8x7B 🐬](https://huggingface.co/cognitivecomputations/dolphin-2.6-mixtral-8x7b)[Nous Hermes 2 Mixtral 8x7B DPO ☤](https://huggingface.co/NousResearch/Nous-Hermes-2-Mixtral-8x7B-DPO)

### Llama 33B Based Models

For Apple Silicon Macs with 24GB+ RAM

[WizardLM 33B v1.0 (Uncensored)](https://huggingface.co/cognitivecomputations/WizardLM-33B-V1.0-\(Uncensored\))

### Llama 2 13B Based Models

For Apple Silicon Macs with 16GB+ RAM

[Wizard LM 13B](https://huggingface.co/WizardLM/WizardLM-13B-V1.2)[Spicyboros 13B 🌶️](https://huggingface.co/jondurbin/spicyboros-13b-2.2)[Synthia 13B 1.2](https://huggingface.co/migtissera/Synthia-13B-v1.2)[XWin-LM-13B](https://huggingface.co/Xwin-LM/Xwin-LM-13B-V0.1)[Mythomax L2 13B](https://huggingface.co/Gryphe/MythoMax-L2-13b)

### CodeLlama 13B Based Models

For Apple Silicon Macs with 16GB+ RAM

[WhiteRabbitNeo-13B-v1](https://huggingface.co/WhiteRabbitNeo/WhiteRabbitNeo-13B-v1)

### Llama 2 7B Based Models

For Apple Silicon Macs with 8GB+ RAM

[airoboros-l2-7b-3.0](https://huggingface.co/jondurbin/airoboros-l2-7b-3.0)[Spicyboros 7b 2.2 🌶️](https://huggingface.co/jondurbin/spicyboros-7b-2.2)[Xwin-LM-7B v0.1](https://huggingface.co/Xwin-LM/Xwin-LM-7B-V0.1)

### Solar 10.7B Based Models

For Apple Silicon Macs with 16GB+ RAM

[Nous-Hermes-2-SOLAR-10.7B ☤](https://huggingface.co/NousResearch/Nous-Hermes-2-SOLAR-10.7B)

### Phi-2 3B Based Models

For Apple Silicon Macs with 8GB+ RAM

[Phi-2 Orange 🍊](https://huggingface.co/rhysjones/phi-2-orange)[Phi-2 Orange Version 2 🍊](https://huggingface.co/rhysjones/phi-2-orange-v2)[Dolphin 2.6 Phi-2 (Uncensored) 🐬](https://huggingface.co/cognitivecomputations/dolphin-2_6-phi-2)

### Mistral 7B Based Models

For Apple Silicon Macs with 8GB+ RAM

[Mistral 7B Instruct v0.3](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.3)[Mistral 7B Instruct v0.2](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2)[Mistral Instruct v0.1](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1)[Mistral-7B-OpenOrca](https://huggingface.co/Open-Orca/Mistral-7B-OpenOrca)[Zephyr 7B Beta 🪁](https://huggingface.co/HuggingFaceH4/zephyr-7b-beta)[Leo Mistral Hessian AI 7B 🇩🇪](https://huggingface.co/LeoLM/leo-mistral-hessianai-7b-chat/tree/main)[Jackalope 7B](https://huggingface.co/openaccess-ai-collective/jackalope-7b)[Dolphin 2.1 Mistral (Uncensored) 🐬](https://huggingface.co/cognitivecomputations/dolphin-2.1-mistral-7b)[Samantha 1.2 Mistral](https://huggingface.co/cognitivecomputations/samantha-1.2-mistral-7b)[OpenHermes 2 Mistral ☤](https://huggingface.co/teknium/OpenHermes-2-Mistral-7B)[SynthIA 7B 2.0](https://huggingface.co/migtissera/SynthIA-7B-v2.0)[Airoboros M 7B](https://huggingface.co/jondurbin/airoboros-m-7b-3.1.2)[Mistral Trismegistus 7B](https://huggingface.co/teknium/Mistral-Trismegistus-7B)[Cerbero 7B 🇮🇹](https://huggingface.co/galatolo/cerbero-7b)[openchat-3.5-0106 7B](https://huggingface.co/openchat/openchat-3.5-0106)[CodeNinja 1.0 OpenChat 7B 🥷](https://huggingface.co/beowolx/CodeNinja-1.0-OpenChat-7B)[BioMistral 7B 🧬 (Biomedical)](https://huggingface.co/BioMistral/BioMistral-7B)[Nous-Hermes-2-Mistral-7B-DPO ☤](https://huggingface.co/NousResearch/Nous-Hermes-2-Mistral-7B-DPO)[Merlinite 7B 🧙](https://huggingface.co/ibm/merlinite-7b)[RakutenAI 7B Chat 🇯🇵](https://huggingface.co/Rakuten/RakutenAI-7B-chat)[Starling LM 7B Beta 🐤](https://huggingface.co/Nexusflow/Starling-LM-7B-beta)[DictaLM 2.0 Instruct 🇮🇱](https://huggingface.co/dicta-il/dictalm2.0-instruct)

### StableLM 3B Based Models

For Apple Silicon Macs with 8GB+ RAM

[StableLM Zephyr 3B 🪁](https://huggingface.co/stabilityai/stablelm-zephyr-3b)

### Yi 6B Based Models

For Apple Silicon Macs with 8GB+ RAM

[Yi 6B Chat 🇨🇳](https://huggingface.co/01-ai/Yi-6B-Chat)

### Yi 34B Based Models

For Apple Silicon Macs with 24GB+ RAM

[Yi 34B Chat 🇨🇳](https://huggingface.co/01-ai/Yi-34B-Chat)

How Can We Help?

Whether you've got a question or you're facing an issue with Private LLM, we're here to help you out. Just drop your details in the form below, and we'll get back to you as soon as we can.

Name

Email

How can we assist you?General InquiryTechnical Issue or Bug ReportFeedback or Suggestion

Select your platformiOSiPadOSmacOS

Device Model

Message

Send