How NetEase Games achieved 30-second LLM cold starts on Kubernetes
At NetEase Games, we learned a hard lesson about large language model (LLM) inference in production: elastic compute is only useful if data can move just as fast. “Elastic compute is only useful if data can…
How to get engineering time back from Kubernetes upgrades
Kubernetes powers your products, but with that power and flexibility comes organizational challenges around managing complexity and maintenance. It can be tough for an organization to keep up with the speed of open source, especially at…
Benchmarking AI agent retrieval strategies on Kubernetes bug fixes
I’ve been using AI coding agents as part of my daily engineering workflow and wanted to understand how well they actually perform on real-world bugs. To test this, I ran a series of structured experiments using…
The New Stack: “Fresh data has us asking, does AI demand Kubernetes?”
A new report reveals Kubernetes’ central role in AI adoption, while highlighting how engineering best practices, platform maturity, and guardrails are critical to managing complexity, security, and scale.
AI sandboxing is having its Kubernetes moment
Recently, Anthropic announced that its new model, Mythos, had autonomously found and exploited zero-day vulnerabilities in every major operating system and web browser – including a 27-year-old bug that had survived decades of human review and…
Kubernetes for platform teams: Leveraging k0s and k0rdent
In our previous blog, we explored a GitOps use case for on-premises infrastructure, managing multiple clusters hosted on the k3s Kubernetes distribution using k0rdent. But the platform engineering ecosystem is vast, and one blog barely scratches…
Techzine: “Kubernetes v1.36 enhances security and AI support”
The Kubernetes project has released version 1.36 with 70 improvements, comprising 18 stable features, 25 beta features, and 25 alpha features. The release focuses primarily on improved access control, visibility into hardware failures, and native support…
Cloud Native Now: “Kubernetes v1.36 Promotes Stability, Compatibility & Reproducibility”
Kubernetes v1.36 is here. Like previous releases, this Spring 2026 iteration introduces new stable, beta, and alpha features.
Auto-diagnosing Kubernetes alerts with HolmesGPT and CNCF tools
What a two-person SRE team learned building an AI investigation pipeline. Spoiler: the runbooks mattered more than the model. Why we built this At STCLab, our SRE team supports multiple Amazon EKS clusters running high-traffic production…