Search results for: kubernetes


Welcome llm-d to the CNCF: Evolving Kubernetes into SOTA AI infrastructure

Posted on March 24, 2026 | Carlos Costa (IBM Research), Clayton Coleman (Google), and Rob Shaw (Red Hat)

We are thrilled to announce that llm-d has officially been accepted as a Cloud Native Computing Foundation (CNCF) Sandbox project! As generative AI transitions from research labs to production environments, platform engineering teams are facing a…


Policy-as-Code: Flexible Kubernetes governance with Kyverno

Posted on March 19, 2026 | Dahu Kuang, Lei Hou, and Shuting Zhao, Kyverno Project Maintainers

Overview Kubernetes has fundamentally transformed how enterprises deploy and manage business workloads. As organizations build production applications at scale on Kubernetes, cluster size and complexity continue to grow—creating unprecedented challenges in ensuring cluster security, compliance, and…


Understanding Kubernetes metrics: Best practices for effective monitoring

Posted on March 18, 2026 | Sam Suthar, Middleware

Kubernetes metrics show cluster activity. You need them to manage Kubernetes clusters, nodes, and applications. Without them, it also makes it harder to find problems and improve performance. This post will explain what Kubernetes metrics are,…


When Kubernetes restarts your pod — And when it doesn’t

Posted on March 17, 2026 | Shamsher Khan, Project Maintainer

A production internals guide verified against Kubernetes 1.35 GACompanion repository: github.com/opscart/k8s-pod-restart-mechanics The terminology problem Engineers say “the pod restarted” when they mean four different things. Getting this wrong leads to flawed runbooks and bad on-call decisions….


Registry Mirror Authentication with Kubernetes Secrets

Posted on March 16, 2026 | Sascha Grunert, Red Hat

Part II: A Platform Integration Example In Part I, we explored the architecture of the CRI-O credential provider and walked through a manual setup. In this part, we’ll see how platforms like OpenShift and its upstream…


Making etcd incidents easier to debug in production Kubernetes

Posted on March 12, 2026 | Natalie Fisher and Benjamin Wang, Broadcom

Diagnosing and Recovering etcd: Practical tools for Kubernetes Operators When Kubernetes clusters experience serious issues, the symptoms are often vague but the impact is immediate. Control plane requests slow down. API calls begin to time out….


Registry mirror authentication with Kubernetes secrets

Posted on March 9, 2026 | Sascha Grunert, Red Hat

Part I: Architecture and Implementation In production Kubernetes clusters, pulling container images from private registries happens thousands of times per day. Kubernetes distributions from major cloud vendors provide credential providers for their respective registries like AWS…


The great migration: Why every AI platform is converging on Kubernetes

Posted on March 5, 2026 | Vara Bonthu, Amazon Web Services Inc.

When Kubernetes launched a decade ago, its promise was clear: make deploying microservices as simple as running a container. Fast forward to 2026, and Kubernetes is no longer “just” for stateless web services. In the CNCF…


KubeCon + CloudNativeCon Europe 2026 Co-located Event Deep Dive: Kubernetes on Edge Day

Posted on March 2, 2026 | Co-chairs: Katerina Arzhayev, Mars Toktonaliev

Kubernetes on Edge Day returns to KubeCon + CloudNativeCon Europe 2026 with a continued focus on where cloud native technologies meet the realities of distributed, resource-constrained, and often unpredictable environments. First launched at KubeCon + CloudNativeCon…


Kubernetes WG Serving concludes following successful advancement of AI inference support

Posted on February 26, 2026 | Yuan Tang, on behalf of Kubernetes WG Serving Co-Chairs

The Kubernetes Working Group (WG) Serving was created to support development of the AI inference stack on Kubernetes. The goal of this working group was to ensure that Kubernetes is an orchestration platform of choice for…