Hermes 2 Pro Llama 3 8B
General-purpose Llama 3 8B finetune for function calling and JSON
Run Llama 3 8B models fully offline on iPhone, iPad & Mac inside Private LLM — private, with nothing sent to a server.
15 models
General-purpose Llama 3 8B finetune for function calling and JSON
Uncensored Llama 3 8B fine-tune lacking built-in refusals
Roleplay specialist fine-tuned from Llama 3 8B Instruct
General assistant with function calling, structured JSON, and ChatML (Llama 3 8B)
Roleplay model for trauma and self-harm narratives on Llama 3 8B
Uncensored behavioral reversal of Llama 3 8B Instruct using weight orthogonalization
General instruction following aligned via self-play preference optimization on Llama 3 8B
General finetune of Llama 3 8B with improved first-turn instruction following
Vulnerability explanation and security code generation specialist on Llama 3 8B
General conversational assistant surpassing GPT-3.5 turbo in standard benchmarks
General-purpose model in the Llama 3 8B family
Uncensored Llama 3 8B variant with refusal direction removed
Uncensored safety-ablated Llama 3 8B fine-tuned with DPO
General chat and coding expert finetuned from Llama 3 8B
Survival biomedical finetuned from Llama 3 8B with 8192 context
Private LLM runs Llama 3 8B models on iPhone, iPad & Mac, fully on-device. Larger variants like Hermes 2 Pro Llama 3 8B, Dolphin 2.9 Llama 3 8B, Hathor_Stable v0.2 L3 8B need more memory, so the right pick depends on your device's RAM.
Yes. Every Llama 3 8B model runs entirely on your Apple device inside Private LLM — no cloud, no logging, and nothing sent off-device.
Only the one-time download needs a connection. After that, every Llama 3 8B model runs 100% offline in Private LLM.