What are the best Llama 3 8B models to run locally on iPhone, iPad & Mac?

Private LLM runs Llama 3 8B models on iPhone, iPad & Mac, fully on-device. Larger variants like Hermes 2 Pro Llama 3 8B, Dolphin 2.9 Llama 3 8B, Hathor_Stable v0.2 L3 8B need more memory, so the right pick depends on your device's RAM.

Are Llama 3 8B models private in Private LLM?

Yes. Every Llama 3 8B model runs entirely on your Apple device inside Private LLM — no cloud, no logging, and nothing sent off-device.

Do Llama 3 8B models work offline?

Only the one-time download needs a connection. After that, every Llama 3 8B model runs 100% offline in Private LLM.

Local Llama 3 8B Models for iPhone, iPad & Mac

Run Llama 3 8B models fully offline on iPhone, iPad & Mac inside Private LLM — private, with nothing sent to a server.

15 models

Hermes 2 Pro Llama 3 8B

General-purpose Llama 3 8B finetune for function calling and JSON

8K contextiPhone & iPad · Mac

View details →

Dolphin 2.9 Llama 3 8B

Uncensored Llama 3 8B fine-tune lacking built-in refusals

8K contextiPhone & iPad · Mac

View details →

Hathor_Stable v0.2 L3 8B

Roleplay specialist fine-tuned from Llama 3 8B Instruct

8K contextiPhone & iPad · Mac

View details →

Hermes 2 Theta Llama 3 8B

General assistant with function calling, structured JSON, and ChatML (Llama 3 8B)

8K contextiPhone & iPad · Mac

View details →

L3 Umbral Mind RP v3.0 8B

Roleplay model for trauma and self-harm narratives on Llama 3 8B

8K contextiPhone & iPad · Mac

View details →

Llama 3 8B Instruct MopeyMule

Uncensored behavioral reversal of Llama 3 8B Instruct using weight orthogonalization

8K contextiPhone & iPad · Mac

View details →

Llama 3 Instruct 8B SPPO Iter3

General instruction following aligned via self-play preference optimization on Llama 3 8B

8K contextiPhone & iPad · Mac

View details →

Llama 3 Smaug 8B

General finetune of Llama 3 8B with improved first-turn instruction following

8K contextiPhone & iPad · Mac

View details →

Llama 3 WhiteRabbitNeo 8B v2.0

Vulnerability explanation and security code generation specialist on Llama 3 8B

8K contextiPhone & iPad · Mac

View details →

LLaMA3 iterative DPO final

General conversational assistant surpassing GPT-3.5 turbo in standard benchmarks

8K contextiPhone & iPad · Mac

View details →

Meta Llama 3 8B Instruct

General-purpose model in the Llama 3 8B family

iPhone & iPad · Mac

View details →

Meta Llama 3 8B Instruct Abliterated v3

Uncensored Llama 3 8B variant with refusal direction removed

8K contextiPhone & iPad · Mac

View details →

NeuralDaredevil 8B Abliterated

Uncensored safety-ablated Llama 3 8B fine-tuned with DPO

8K contextiPhone & iPad · Mac

View details →

Openchat 3.6 8B 20240522

General chat and coding expert finetuned from Llama 3 8B

8K contextiPhone & iPad · Mac

View details →

OpenBioLLM 8B

Survival biomedical finetuned from Llama 3 8B with 8192 context

8K contextiPhone & iPad · Mac

View details →

Browse all models

Frequently asked questions

Private LLM runs Llama 3 8B models on iPhone, iPad & Mac, fully on-device. Larger variants like Hermes 2 Pro Llama 3 8B, Dolphin 2.9 Llama 3 8B, Hathor_Stable v0.2 L3 8B need more memory, so the right pick depends on your device's RAM.
Yes. Every Llama 3 8B model runs entirely on your Apple device inside Private LLM — no cloud, no logging, and nothing sent off-device.
Only the one-time download needs a connection. After that, every Llama 3 8B model runs 100% offline in Private LLM.