What are the best Llama 3.1 8B models to run locally on iPhone, iPad & Mac?

Private LLM runs Llama 3.1 8B models on iPhone, iPad & Mac, fully on-device. Larger variants like Dolphin 3.0 Llama 3.1 8B, FuseChat Llama 3.1 8B Instruct, Hermes 3 Llama 3.1 8B need more memory, so the right pick depends on your device's RAM.

Are Llama 3.1 8B models private in Private LLM?

Yes. Every Llama 3.1 8B model runs entirely on your Apple device inside Private LLM — no cloud, no logging, and nothing sent off-device.

Do Llama 3.1 8B models work offline?

Only the one-time download needs a connection. After that, every Llama 3.1 8B model runs 100% offline in Private LLM.

Local Llama 3.1 8B Models for iPhone, iPad & Mac

Run Llama 3.1 8B models fully offline on iPhone, iPad & Mac inside Private LLM — private, with nothing sent to a server.

8 models

Dolphin 3.0 Llama 3.1 8B

Uncensored instruction-tuned Llama 3.1 8B with user-defined boundaries via system prompt

131K contextiPhone & iPad · Mac

View details →

FuseChat Llama 3.1 8B Instruct

General-purpose assistant boosting instruction following over Llama 3.1 8B

131K contextiPhone & iPad · Mac

View details →

Hermes 3 Llama 3.1 8B

General-purpose Llama 3.1 8B model handling 131k-token contexts and function calling

131K contextiPhone & iPad · Mac

View details →

Llama 3.1 8B Lexi Uncensored V2

Uncensored Llama 3.1 8B finetune with safety refusals removed

131K contextiPhone & iPad · Mac

View details →

Llama 3.1 8B UltraMedical

Medical exam and clinical knowledge specialist built on Llama 3.1 8B

131K contextiPhone & iPad · Mac

View details →

Meta Llama 3.1 8B Instruct

General-purpose model in the Llama 3.1 8B family

iPhone & iPad · Mac

View details →

Meta Llama 3.1 8B Instruct Abliterated

Uncensored Llama 3.1 8B Instruct with refusal disabled by abliteration

131K contextiPhone & iPad · Mac

View details →

Meta Llama 3.1 8B Survive V3

Survival advisor finetuned from Llama 3.1 for step-by-step shelter building

131K contextiPhone & iPad · Mac

View details →

Browse all models

Frequently asked questions

Private LLM runs Llama 3.1 8B models on iPhone, iPad & Mac, fully on-device. Larger variants like Dolphin 3.0 Llama 3.1 8B, FuseChat Llama 3.1 8B Instruct, Hermes 3 Llama 3.1 8B need more memory, so the right pick depends on your device's RAM.
Yes. Every Llama 3.1 8B model runs entirely on your Apple device inside Private LLM — no cloud, no logging, and nothing sent off-device.
Only the one-time download needs a connection. After that, every Llama 3.1 8B model runs 100% offline in Private LLM.