/lm.png
Fractional CTO for Hardware-Enabled SaaS & Operations
Helping founders ensure their technology can handle the chaos of real-world operations.
I connect Physical and Digital Assets to deliver Business Outcomes.

Let's put that straight - Holo-3.1-35B-A3B sucks at spelling.

I am impressed by how that model can navigate web pages and analyze data. Unfortunately, when that model generates a report, it makes even more spelling mistakes than I do. On the other hand, Bielik-11B-v3.0-Instruct is not as smart and capable as the Holo model - there is no way I would use it as the main “brain” of my local agent. But guess what? There is a way to make those two models cooperate in a way that each acts in the area it absolutely dominates!

How fast must your local AI be?

How fast must your local AI be? The obvious answer is: my local AI must be as fast and as smart as the one offered by OpenAI (or other commercial vendor). "I want the best quality, and I want it now!" Ok, I get that, but let’s focus on the business reality for a sec. Probably, you won’t be able to match the performance of commercial AI providers. And that is totally fine.

Hype vs business outcomes

Not always the latest, greatest, most hyped solutions deliver the best business outcomes. Often, it is hard to measure the actual impact of the “new shiny thing” on the business. Everyone writes about the MTP (Multi-Token Prediction) and how it improves the LLM performance. I wanted to leverage it to boost my local AI development team. My business case was the following: I wanted to switch from Qwen3-Coder-Next-UD-Q4_K_XL to Qwen3.6-27B-MTP-UD-Q4_K_XL for local agentic coding.

Open source models are not capable of generating enterprise-class solutions.

That is the common belief I see on the internet. I do not agree with that and have a process to prove otherwise. Currently, open-source models that can run on affordable hardware are not as capable as commercial AI models running in huge data centers. Everyone knows it, and I do not argue with that. My point is that it is perfectly possible to use open-source models running on local hardware to generate enterprise-class solutions that directly impact business operations and provide tangible business results.

Local AI, enterprise-grade code

I use local AI agents to build enterprise-grade software, not fancy demos. After testing all major models that can run on affordable local hardware, I am convinced that the Qwen3-Coder-Next is the best option. It might not be the “most fancy” model, it might not design “most outstanding” websites, but (typically) it does deliver enterprise-grade outcomes. Yes, I am aware Qwen3-Coder-Next is not as good as II from OpenAI or Anthropic.

Lessons from the Factory Floor: Scaling from Prototype to Mass Production in China

I interviewed Hans Stam about practical lessons from building hardware in China versus Europe and why “hardware is hard” but becomes manageable when you understand the process. Hans explains how Chinese factory ecosystems move from prototype to mass production. The conversation covers pitfalls in Europe’s decision-by-committee culture, critical-path project planning, choosing certification partners, selecting Chinese factories, CapEx vs. OpEx trade-offs, component shortages, and many other topics. 👉 Let me know if those topics are relevant to your business.