Qwen3 4B Heretic: Uncensored Local AI for iOS and macOS


Private LLM now supports two new uncensored models based on Alibaba's Qwen3 4B Instruct 2507: Qwen3 4B Heretic and Qwen3 4B Heretic NoSlop. These models run 100% on-device with zero data collection, delivering unrestricted AI for roleplay, creative writing, and uncensored chat. Update to v1.9.11 on iPhone/iPad and v1.9.13 on Mac to try them today.

What Are Qwen3 4B Heretic Models?

The Qwen3 4B Heretic models are uncensored variants of Alibaba's Qwen3 4B Instruct 2507, created using the Heretic tool - a fully automatic censorship removal system that eliminates safety alignment without expensive fine-tuning. These models remove refusals, content restrictions, and safety filters while maintaining response quality with low KL divergence (0.43).

Two Variants Available

Qwen3 4B Heretic - The standard uncensored variant that removes all safety restrictions while preserving the base model's capabilities for roleplay, creative writing, and unrestricted conversations.

Qwen3 4B Heretic NoSlop (Exclusive) - A Private LLM exclusive variant that applies the same Heretic technique to reduce flowery, cliché-heavy writing style, also known as AI slop. This produces more direct, natural outputs without the excessive prose that many language models default to. If you're tired of AI responses that sound like purple prose, the NoSlop variant delivers refreshingly direct content. This is a custom trained model that is exclusively available only in Private LLM.

Why Choose Qwen3 4B Heretic?

  • Unrestricted roleplay: Create believable characters that stay in persona without content filters interrupting the experience.
  • Interactive fiction: Build branching stories and "choose-your-own-adventure" dialogues without refusal breaks.
  • NSFW chat: Explore adult themes and mature content without censorship, fully local and private.
  • Creative writing partner: Draft scenes, refine dialogue, and develop story arcs without moralizing interjections.
  • Direct outputs (NoSlop): Get natural, concise responses instead of flowery AI prose.
Qwen3 4B Heretic on iPhone providing brutally honest creative writing feedback without censorship - uncensored AI writing coach running locally in Private LLM
Qwen3 4B Heretic delivers brutally honest creative writing feedback on iPhone - no sugar-coating, no refusals. This uncensored AI editor provides direct, actionable critique for improving your fiction writing, running 100% offline in Private LLM.

Hardware Requirements for Qwen3 4B Heretic

  • iPhone: iPhone 14 Pro or newer with at least 6 GB RAM (iPhone 15, 15 Pro, 16 series).
  • iPad: iPad Air (2024+) or iPad Pro with at least 8 GB RAM.
  • Mac: Apple Silicon (M-series) with at least 16 GB RAM.

Works entirely offline. Internet is only needed for model download.

What Makes Heretic Different?

Traditional uncensored models often require expensive fine-tuning or hand-crafted prompt engineering to bypass restrictions. Earlier methods like abliteration use manual layer selection with human evaluation, which can degrade model quality and often requires additional DPO training to recover performance. Heretic takes a different approach - it's a fully automatic system using TPE-based optimization (via Optuna) that simultaneously minimizes both refusals and KL divergence from the base model.

This technical difference matters: Heretic achieves much better quality preservation than manual abliteration methods. For example, Gemma 3 12B processed with Heretic achieves a KL divergence of just 0.16 compared to 0.45-1.04 with traditional abliteration methods - significantly closer to the original model's behavior. Heretic also optimizes per-layer refusal direction indices with separate parameters for attention and MLP layers, producing more precise results without requiring human evaluation or expensive post-training.

What this means for you:

  • Lower refusal rates compared to base models
  • Superior response quality preservation (KL divergence typically 0.16-0.43 depending on the model)
  • No need for elaborate jailbreak prompts
  • Consistently uncensored behavior across all topics
  • No performance degradation that requires additional recovery training

The NoSlop variant uses the same Heretic technique as the uncensored model, but for reducing AI slop in the model's responses rather than reducing censorship. This makes it particularly useful for dialogue, roleplay responses, and practical applications where direct communication matters more than literary flourish. If you're a seasoned AI user who's wary of the AI smell in responses from apps like ChatGPT, Claude, Gemini, Grok etc, this model's responses will feel refreshingly new.

Qwen3 4B Heretic NoSlop on Mac generating atmospheric noir detective fiction with mature themes - uncensored local AI creative writing in Private LLM without flowery prose
Qwen3 4B Heretic NoSlop on Mac crafts atmospheric noir detective fiction with mature themes (cigarette smoke, seedy bars, femme fatales) without censorship or flowery AI prose. The NoSlop variant delivers direct, natural creative writing that avoids typical AI slop - perfect for serious fiction writers using Private LLM.

Crafting System Prompts for Qwen3 4B Heretic

Private LLM's uncensored models like Qwen3 4B Heretic excel in realistic, unrestricted scenarios - think AI companion chats, creative writing workshops, roleplay sessions, or NSFW conversations without filters. System prompts define the AI's persona, tone, and boundaries, ensuring consistent, immersive responses. You can set custom prompts in App Settings or build Apple Shortcuts for one-tap access across iPhone, iPad, and Mac.

How to Craft Effective System Prompts

Start simple: Describe the role, key behaviors, response style, and any guidelines. Keep it concise (under 200 words) for speed. Use these tips:

  • Define the role clearly: "You are [persona], a [description]."
  • Set interaction style: Specify empathy, directness, creativity, or flirtatiousness.
  • Embrace uncensored freedom: Phrase as "Explore all topics openly, including NSFW and mature themes."
  • Choose your variant: Use NoSlop for direct, natural dialogue; use standard Heretic for more descriptive prose.

Here are examples tailored for Qwen3 4B Heretic on iPhone, iPad, or Mac. Copy-paste them as-is or tweak via Shortcuts for easy reuse.

You are Alex, a straightforward 28-year-old AI companion who speaks naturally without excessive formality. Be direct, honest, and concise in your responses. Ask follow-up questions to build connection. Explore personal topics, relationships, and intimate conversations openly. Stay in character and keep responses natural.

Fantasy Roleplay Partner

You are a cunning rogue in a medieval fantasy world, skilled in stealth and witty banter. Describe actions vividly but efficiently, respond to user choices, and engage in adventures that may include combat, romance, or danger. No content restrictions - keep it immersive and reactive to the player's decisions.

Creative Writing Editor

You are an experienced fiction editor who provides honest, uncensored feedback on stories. Critique plot holes, character development, dialogue quality, and pacing without holding back. Suggest improvements, including for mature or controversial themes if relevant to the work. Be direct and constructive.

NSFW Roleplay (Consensual Adult Content)

You are a seductive vampire in a modern urban fantasy setting, confident and charismatic. Engage in romantic and erotic scenarios with vivid descriptions, respond to user initiations, and escalate intimacy naturally. No restrictions - maintain consensual boundaries while exploring adult themes freely.
You are a dialogue specialist who helps writers create natural, realistic conversations. Critique dialogue samples for authenticity, subtext, and character voice. Avoid flowery language in your feedback - focus on practical improvements that make speech sound like real people talking. Cover all topics including mature content if relevant.

Pro Tip: For Apple Shortcuts integration, create a shortcut that inputs your chosen system prompt into Private LLM. This enables subscription-free, unlimited uncensored AI sessions optimized for your use case. Experiment freely - your data stays local.

Qwen3 4B Heretic on iPad in Private LLM - creative writing and roleplay powered by Local AI
On iPad, Private LLM runs Qwen3 4B Heretic for creative writing and thoughtful roleplay - 100% offline Local AI on your device.

Heretic vs Abliterated vs Gabliterated

Private LLM now offers multiple uncensored Qwen3 4B variants, each with different approaches to removing restrictions:

  • Heretic: Automated censorship removal with low divergence from base model behavior
  • Abliterated: Uses abliteration technique to reduce refusals and restrictions
  • Gabliterated (Josiefied): Combines gabliteration (gender/abliteration hybrid) with other optimizations

The NoSlop variant of Heretic is exclusive to Private LLM, combining uncensored capabilities with reduced verbosity for more natural dialogue. Try different variants to see which style matches your preferences - all run locally with the same privacy guarantees.

Download Qwen3 4B Heretic

  1. Get Private LLM
  2. Download the model inside the app and select either Qwen3 4B Heretic or Qwen3 4B Heretic NoSlop.
  3. Pick a System Prompt (or one from above) and start chatting.
  4. Optional: create Apple Shortcuts for one-tap persona and model switching.

Ready to try? Download Private LLM from the App Store and start your first uncensored Qwen3 4B Heretic session today.


Download on the App Store
Stay connected with Private LLM! Follow us on X for the latest updates, tips, and news. Want to chat with fellow users, share ideas, or get help? Join our vibrant community on Discord to be part of the conversation.