Skip to content
ATAI Today Brief
HomeNewsConceptsGuidesToolbox
AboutSubscribeUA
Subscribe

AI Today Brief

The daily AI-engineering brief. Built in public. EN · UA.

XTelegramLinkedInYouTubeRSS
NewsConceptsGuidesSubscribeAdvertiseAboutEditorial policyAI disclosurePrivacyTerms

© 2026 AI Today Brief. All rights reserved.

  1. Home/
  2. News/
  3. Local LLMs/
  4. Qwythos-9B: Community Qwen Merge for Creative Writing and Roleplay
Local LLMs

Qwythos-9B: Community Qwen Merge for Creative Writing and Roleplay

June 23, 2026· 5 min read
OKCurated by Oleksandr Kuzmenko, AI Product Engineer·Updated June 23, 2026·Sources cited on every story
AI-assisted · editor-reviewed·How we use AI
Local LLMs

Qwythos-9B-Claude-Mythos-5 is a community-driven model designed to bring Claude-style expressive prose and deep narrative generation to local setups. Built using a passthrough merge recipe that expands Qwen2.5-7B into a 9-billion parameter model, it is optimized for creative writing and conversational roleplay.

Impact: Medium

Why it matters

Developers can run an expressive narrative model locally using GGUF quantization without relying on cloud-based Claude APIs.

TL;DR

  • 01Combines Qwen2.5-7B's multilingual capability with Claude-style prose style guides.
  • 02Uses a YAML-defined passthrough merge recipe on GitHub to scale parameters without training from scratch.
  • 03GGUF quantization allows running a highly expressive model locally on typical 8GB VRAM consumer hardware.

Key facts

Base Model
Qwen2.5-7B-Instruct
Parameters
9.02 Billion (via passthrough merge)
Quantization Formats
FP16, GGUF (Q4_K_M, Q5_K_M, Q8_0)
Primary Use Case
Creative writing, conversational roleplay

Architectural Merge and Recipe

The creation of Qwythos-9B relies on a passthrough merge (often called a frankenmerge) configured via YAML and hosted on GitHub. Rather than starting from a native 9B base model (since an official Qwen 9B base does not exist), the merge recipe slices and duplicates specific intermediate layers of the Qwen2.5-7B-Instruct architecture. This mathematical expansion increases the context processing depth for stylistic tasks, specifically targeting narrative pacing.

Datasets and Stylistic Fine-Tuning

To achieve its distinct prose, the merged base is aligned with custom datasets styled after Anthropic's Claude and the community-favorite Mythos roleplay models. The resulting model exhibits a higher tolerance for creative length, descriptive adjectives, and complex dialogue formatting compared to vanilla instruction models, which tend to be overly concise or analytical.

Local Execution and GGUF Support

Thanks to the community-provided GGUF quantizations, you can run Qwythos-9B locally. Quantized variants like Q4_K_M or Q5_K_M balance memory footprint and output coherence. Running this 9B model on llama.cpp or Ollama requires approximately 6GB to 8GB of VRAM, making it highly accessible for standard laptop GPUs.

Try it in 2 minutes

# Download the GGUF model using huggingface-cli\nhuggingface-cli download mradermacher/Qwythos-9B-Claude-Mythos-5-GGUF Qwythos-9B-Claude-Mythos-5.Q4_K_M.gguf --local-dir .\n\n# Run using llama.cpp\n./llama-cli -m Qwythos-9B-Claude-Mythos-5.Q4_K_M.gguf -p "Write a descriptive sci-fi narrative about an autonomous mining probe:" -n 256

bash

✓ When to use

  • When you need rich, descriptive creative writing and narrative styling without paying for API calls.
  • When hosting roleplay or narrative agents locally on consumer-grade hardware (8GB VRAM).

✕ When NOT to use

  • Not recommended for precise coding tasks, math solving, or strictly structured JSON outputs.
  • When high-throughput enterprise API performance is required.

What to do today

  • →Clone the YAML merge recipe from GitHub to inspect layer configurations.
  • →Download the Q4_K_M GGUF weights from Hugging Face for a balanced local run.
  • →Load the model into llama.cpp or Ollama and test with descriptive writing prompts.
#Qwen#Claude#llama.cpp#Ollama

Sources

  • vsecoder/Qwythos-9B-Claude-Mythos-5 on Hugging Face
  • mradermacher/Qwythos-9B-Claude-Mythos-5-GGUF on Hugging Face
  • vsecoder/Qwythos-Merge-Recipe on GitHub
ShareShare on XShare on LinkedIn

Email digest

Get the morning AI brief

One email a day — the stories that matter for engineers, founders and tech leads. Human-edited, with links to primary sources.

  • ✓120+ sources scanned daily
  • ✓Edited by a human
  • ✓1 email per day
  • ✓EN + UA

By subscribing you agree to the privacy policy.