Search results for: kubernetes


TFIR: “How Kubernetes 1.36 Handles GPU Scheduling, DRA, and Kubelet Security | Ryota Sawada, Kubernetes”

Posted on May 27, 2026

Kubernetes 1.36 adds native GPU scheduling via Workload Aware Scheduling and DRA, plus stable fine-grained Kubelet authorization. Ryota Sawada, Release Lead, explains what changed.


GPU autoscaling on Kubernetes with KEDA: Building an external scaler

Posted on May 27, 2026 | Pavan Madduri (Senior Cloud Platform Engineer @ Grainger | CNCF Golden Kubestronaut)

If you run GPU workloads on Kubernetes — vLLM, Triton, training jobs, or the newer agentic inference stacks — you’ve probably hit a familiar problem: the default autoscaling path still reasons about CPU and memory, while…


Why Kubernetes policy enforcement happens too late—and what to do about it

Posted on May 25, 2026 | Sajal Nigam, CNCF Community Member

Kubernetes has become the backbone of modern cloud-native infrastructure. Its flexibility lets teams move fast, compose complex systems from modular components, and deploy across environments with relative ease. But that flexibility comes with a well-known cost:…


How NetEase Games achieved 30-second LLM cold starts on Kubernetes

Posted on May 21, 2026 | Haifeng Liao, Senior Infrastructure Engineer at NetEase Games and Xiang Zhang, Head of AI Infrastructure at NetEase Games

At NetEase Games, we learned a hard lesson about large language model (LLM) inference in production: elastic compute is only useful if data can move just as fast. “Elastic compute is only useful if data can…


How to get engineering time back from Kubernetes upgrades

Posted on May 11, 2026 | Munib Ali, Director of Engineering, SRE Fairwinds

Kubernetes powers your products, but with that power and flexibility comes organizational challenges around managing complexity and maintenance. It can be tough for an organization to keep up with the speed of open source, especially at…


Benchmarking AI agent retrieval strategies on Kubernetes bug fixes

Posted on May 8, 2026 | Brandon Foley

I’ve been using AI coding agents as part of my daily engineering workflow and wanted to understand how well they actually perform on real-world bugs. To test this, I ran a series of structured experiments using…



The New Stack: “Fresh data has us asking, does AI demand Kubernetes?”

Posted on May 1, 2026

A new report reveals Kubernetes’ central role in AI adoption, while highlighting how engineering best practices, platform maturity, and guardrails are critical to managing complexity, security, and scale.


AI sandboxing is having its Kubernetes moment

Posted on April 30, 2026 | Jed Salazar, Field CTO, Edera

Recently, Anthropic announced that its new model, Mythos, had autonomously found and exploited zero-day vulnerabilities in every major operating system and web browser – including a 27-year-old bug that had survived decades of human review and…


Kubernetes for platform teams: Leveraging k0s and k0rdent

Posted on April 27, 2026 | Prithvi Raj (CNCF Ambassador) & Shivani Rathod (Bacancy Technology)

In our previous blog, we explored a GitOps use case for on-premises infrastructure, managing multiple clusters hosted on the k3s Kubernetes distribution using k0rdent.  But the platform engineering ecosystem is vast, and one blog barely scratches…