We built moderation for an open, anonymous, multilingual wall — a cheap layered cascade, ~$0 on APIs. Then someone tried to bypass it with a ROT13-encoded jailbreak, and the LLM judge saw through it. Here's the design and what r/selfhosted poked at.
SEO Week 2026 reframed search as a math problem — vector distances, entropy reduction, entity graphs, brand-as-centroid. We kept 3 ideas, cut 4, and patched our site this morning to ship them. Here's what worked.
Trained a DoRA adapter on 6128 personal Telegram messages. $1.50 on a single Vast.ai RTX 3090. Result: 100% blind A/B win vs stock Qwen3-8B. Zero catastrophic forgetting. One prompt where DoRA beat the real human at sounding like themselves.
We tested Microsoft BitNet b1.58 on a base M2. Metal gives 12 t/s. CPU-only produces gibberish. The real value of 1.58-bit is RAM, not speed.
A systematic study: 5 pipeline variants, training data leakage via Cyrillic text, and why the "sandwich" approach is a workaround, not a fix.
Three FLUX.2-klein LoRAs (space art, ukiyo-e, logos) and a personal DoRA adapter on Qwen3-8B. With training configs and repro steps.
Why «add a chatbot» and «put an agent inside the process» are two different jobs with different outcomes.
How to turn «we want AI» into «we locked the number and we own it». Step by step.
The model is 10% of the project. The other 90% is data, integrations, and SLA. Why the ones who get this win.
Not mysticism, not a PR slogan. Just math: where exactly in the business AI takes load off, and how to measure it.