EVA Qwen2.5 7B v0.1
Roleplay finetune of Qwen 2.5 7B for adaptive persona and storytelling
Run Qwen 2.5 models fully offline on iPhone, iPad & Mac inside Private LLM — private, with nothing sent to a server.
15 models
Roleplay finetune of Qwen 2.5 7B for adaptive persona and storytelling
General assistant merging knowledge from larger models into Qwen 2.5
Solves GitHub issues and refactors code, built on Qwen 2.5
General assistant skilled in coding and math reasoning
Code generation and refactoring instruction-tuned on Qwen 2.5 with 32K context
Uncensored Qwen 2.5 model with full system prompt steerability
General knowledge model with strong structured output and JSON capabilities
Coding assistant handling generation, reasoning, and fixes across long files
Roleplay-focused distillation of Qwen 2.5 with 131K context
Uncensored instruction-tuned chat with user-defined alignment on Qwen 2.5 1.5B
Instruction-tuned general assistant reliably producing clean JSON
Code completion, refactoring, and explanation expert from Qwen 2.5
Uncensored Qwen 2.5 0.5B instruct with full system prompt control
General-purpose 0.5B model with strong instruction following and JSON output
Code generation, reasoning, and repair via Qwen 2.5 with 32K context
Private LLM runs Qwen 2.5 models on iPhone, iPad & Mac, fully on-device. Larger variants like EVA Qwen2.5 7B v0.1, FuseChat Qwen 2.5 7B Instruct, OpenHands LM 7B v0.1 need more memory, so the right pick depends on your device's RAM.
Yes. Every Qwen 2.5 model runs entirely on your Apple device inside Private LLM — no cloud, no logging, and nothing sent off-device.
Only the one-time download needs a connection. After that, every Qwen 2.5 model runs 100% offline in Private LLM.