Skip to content

Local Llama 3.1 8B Models for iPhone, iPad & Mac

Run Llama 3.1 8B models fully offline on iPhone, iPad & Mac inside Private LLM — private, with nothing sent to a server.

8 models

Meta logo

Dolphin 3.0 Llama 3.1 8B

8B

Uncensored instruction-tuned Llama 3.1 8B with user-defined boundaries via system prompt

131K contextiPhone & iPad · Mac
Meta logo

FuseChat Llama 3.1 8B Instruct

8B

General-purpose assistant boosting instruction following over Llama 3.1 8B

131K contextiPhone & iPad · Mac
Meta logo

Hermes 3 Llama 3.1 8B

8B

General-purpose Llama 3.1 8B model handling 131k-token contexts and function calling

131K contextiPhone & iPad · Mac
Meta logo

Llama 3.1 8B Lexi Uncensored V2

8B

Uncensored Llama 3.1 8B finetune with safety refusals removed

131K contextiPhone & iPad · Mac
Meta logo

Llama 3.1 8B UltraMedical

8B

Medical exam and clinical knowledge specialist built on Llama 3.1 8B

131K contextiPhone & iPad · Mac
Meta logo

Meta Llama 3.1 8B Instruct

8B

General-purpose model in the Llama 3.1 8B family

iPhone & iPad · Mac
Meta logo

Meta Llama 3.1 8B Instruct Abliterated

8B

Uncensored Llama 3.1 8B Instruct with refusal disabled by abliteration

131K contextiPhone & iPad · Mac
Meta logo

Meta Llama 3.1 8B Survive V3

8B

Survival advisor finetuned from Llama 3.1 for step-by-step shelter building

131K contextiPhone & iPad · Mac

Frequently asked questions

  • Private LLM runs Llama 3.1 8B models on iPhone, iPad & Mac, fully on-device. Larger variants like Dolphin 3.0 Llama 3.1 8B, FuseChat Llama 3.1 8B Instruct, Hermes 3 Llama 3.1 8B need more memory, so the right pick depends on your device's RAM.

  • Yes. Every Llama 3.1 8B model runs entirely on your Apple device inside Private LLM — no cloud, no logging, and nothing sent off-device.

  • Only the one-time download needs a connection. After that, every Llama 3.1 8B model runs 100% offline in Private LLM.