Llm

Self Hosted AI: Actually Running Local LLMs for a Multi-User Household

How self-hosted AI became the final piece of my homelab puzzle, delivering true parallel processing for multi-user setups and unlocking the real superpower of knowledge management.

Derek Armstrong

• Apr 29, 2026 • 8 min read

Homelab

Running Qwen3.6 27B Locally on Dual RTX 3090s with vLLM v0.19

How I went from a blank Docker template to 116+ tok/s with speculative decoding, FlashInfer, and a 160k context window on dual 3090s.

Derek Armstrong

• Apr 26, 2026 • 8 min read

Ai-Tools

Qwen3.5 Showdown: 27B Q8 vs 35B-A3B Q8 — Real-World Testing for Local AI

A real-world comparison of Qwen3.5 27B Q8 and 35B-A3B Q8 running locally on a dual RTX 3090 homelab — which one actually belongs in your daily workflow?

Derek Armstrong

• Apr 5, 2026 • 6 min read

Python

Pydantic and Pydantic-AI: Type Safety That Actually Earns Its Keep

Pydantic is one of those libraries I underestimated until the day it saved me four hours of debugging. Here's what it actually does, where it hurts, and why Pydantic-AI has me …

Derek Armstrong

• Apr 9, 2025 • 7 min read

Ai-Tools

Run your own AI LLM in two commands

Set up your own AI chatbot locally using Meta's Llama model and Docker in just two commands

Derek Armstrong

• Apr 23, 2024 • 1 min read