Qwythos-9B: Community Qwen Merge for Creative Writing and Roleplay

Local LLMs

June 23, 2026 5 min read

Curated by Oleksandr Kuzmenko, AI Product EngineerUpdated June 23, 2026Sources cited on every story

AI-assisted · editor-reviewedHow we use AI

Local LLMs

Qwythos-9B-Claude-Mythos-5 is a community-driven model designed to bring Claude-style expressive prose and deep narrative generation to local setups. Built using a passthrough merge recipe that expands Qwen2.5-7B into a 9-billion parameter model, it is optimized for creative writing and conversational roleplay.

Impact: Medium

Why it matters

Developers can run an expressive narrative model locally using GGUF quantization without relying on cloud-based Claude APIs.

TL;DR

01Combines Qwen2.5-7B's multilingual capability with Claude-style prose style guides.
02Uses a YAML-defined passthrough merge recipe on GitHub to scale parameters without training from scratch.
03GGUF quantization allows running a highly expressive model locally on typical 8GB VRAM consumer hardware.

Key facts

Base Model: Qwen2.5-7B-Instruct
Parameters: 9.02 Billion (via passthrough merge)
Quantization Formats: FP16, GGUF (Q4_K_M, Q5_K_M, Q8_0)
Primary Use Case: Creative writing, conversational roleplay

Architectural Merge and Recipe

The creation of Qwythos-9B relies on a passthrough merge (often called a frankenmerge) configured via YAML and hosted on GitHub. Rather than starting from a native 9B base model (since an official Qwen 9B base does not exist), the merge recipe slices and duplicates specific intermediate layers of the Qwen2.5-7B-Instruct architecture. This mathematical expansion increases the context processing depth for stylistic tasks, specifically targeting narrative pacing.

Datasets and Stylistic Fine-Tuning

To achieve its distinct prose, the merged base is aligned with custom datasets styled after Anthropic's Claude and the community-favorite Mythos roleplay models. The resulting model exhibits a higher tolerance for creative length, descriptive adjectives, and complex dialogue formatting compared to vanilla instruction models, which tend to be overly concise or analytical.

Local Execution and GGUF Support

Thanks to the community-provided GGUF quantizations, you can run Qwythos-9B locally. Quantized variants like Q4_K_M or Q5_K_M balance memory footprint and output coherence. Running this 9B model on llama.cpp or Ollama requires approximately 6GB to 8GB of VRAM, making it highly accessible for standard laptop GPUs.

Try it in 2 minutes

# Download the GGUF model using huggingface-cli\nhuggingface-cli download mradermacher/Qwythos-9B-Claude-Mythos-5-GGUF Qwythos-9B-Claude-Mythos-5.Q4_K_M.gguf --local-dir .\n\n# Run using llama.cpp\n./llama-cli -m Qwythos-9B-Claude-Mythos-5.Q4_K_M.gguf -p "Write a descriptive sci-fi narrative about an autonomous mining probe:" -n 256

bash

✓ When to use

When you need rich, descriptive creative writing and narrative styling without paying for API calls.
When hosting roleplay or narrative agents locally on consumer-grade hardware (8GB VRAM).

✕ When NOT to use

Not recommended for precise coding tasks, math solving, or strictly structured JSON outputs.
When high-throughput enterprise API performance is required.

What to do today

Clone the YAML merge recipe from GitHub to inspect layer configurations.
Download the Q4_K_M GGUF weights from Hugging Face for a balanced local run.
Load the model into llama.cpp or Ollama and test with descriptive writing prompts.

#Qwen#Claude#llama.cpp#Ollama

Sources

ShareShare on X Share on LinkedIn

# Download the GGUF model using huggingface-cli\nhuggingface-cli download mradermacher/Qwythos-9B-Claude-Mythos-5-GGUF Qwythos-9B-Claude-Mythos-5.Q4_K_M.gguf --local-dir .\n\n# Run using llama.cpp\n./llama-cli -m Qwythos-9B-Claude-Mythos-5.Q4_K_M.gguf -p "Write a descriptive sci-fi narrative about an autonomous mining probe:" -n 256

Qwythos-9B: Community Qwen Merge for Creative Writing and Roleplay

Architectural Merge and Recipe

Datasets and Stylistic Fine-Tuning

Local Execution and GGUF Support

Get the morning AI brief

Qwythos-9B: Community Qwen Merge for Creative Writing and Roleplay

Architectural Merge and Recipe

Datasets and Stylistic Fine-Tuning

Local Execution and GGUF Support

Get the morning AI brief