GitOps policy-as-code: Securing Kubernetes with Argo CD and Kyverno
A hands-on guide to deploying Kyverno with Argo CD and enforcing custom policies As Kubernetes environments develop, GitOps with Argo CD has become the standard for declarative, self-healing infrastructure. Yet without guardrails for your deployments, misconfigured,…
LLMs on Kubernetes Part 1: Understanding the threat model
Let’s say you’ve got an LLM running on Kubernetes. Pods are healthy, logs are clean, users are chatting. Everything looks fine. But here’s the thing: Kubernetes is great at scheduling workloads and keeping them isolated. It…
CNCF On-Demand: k0rdent — Celebrating One Year of Orchestrating Multi- Cluster Kubernetes Platforms
Managing Kubernetes across many clusters and environments introduces challenges around consistency, lifecycle automation, observability, and cost visibility. This session explores architectural patterns and open source tooling used to manage Kubernetes fleets at scale. We’ll look at…
CNCF Nearly Doubles Certified Kubernetes AI Platforms
Following an over 70% surge in certified offerings, the program now includes agentic workflows and introduces stricter v1.35 requirements to eliminate infrastructure fragmentation Key Highlights KUBECON + CLOUDNATIVECON EUROPE, AMSTERDAM—24 MARCH, 2026— The Cloud Native Computing…
Welcome llm-d to the CNCF: Evolving Kubernetes into SOTA AI infrastructure
We are thrilled to announce that llm-d has officially been accepted as a Cloud Native Computing Foundation (CNCF) Sandbox project! As generative AI transitions from research labs to production environments, platform engineering teams are facing a…
Policy-as-Code: Flexible Kubernetes governance with Kyverno
Overview Kubernetes has fundamentally transformed how enterprises deploy and manage business workloads. As organizations build production applications at scale on Kubernetes, cluster size and complexity continue to grow—creating unprecedented challenges in ensuring cluster security, compliance, and…
Understanding Kubernetes metrics: Best practices for effective monitoring
Kubernetes metrics show cluster activity. You need them to manage Kubernetes clusters, nodes, and applications. Without them, it also makes it harder to find problems and improve performance. This post will explain what Kubernetes metrics are,…
When Kubernetes restarts your pod — And when it doesn’t
A production internals guide verified against Kubernetes 1.35 GACompanion repository: github.com/opscart/k8s-pod-restart-mechanics The terminology problem Engineers say “the pod restarted” when they mean four different things. Getting this wrong leads to flawed runbooks and bad on-call decisions….
Registry Mirror Authentication with Kubernetes Secrets
Part II: A Platform Integration Example In Part I, we explored the architecture of the CRI-O credential provider and walked through a manual setup. In this part, we’ll see how platforms like OpenShift and its upstream…
Making etcd incidents easier to debug in production Kubernetes
Diagnosing and Recovering etcd: Practical tools for Kubernetes Operators When Kubernetes clusters experience serious issues, the symptoms are often vague but the impact is immediate. Control plane requests slow down. API calls begin to time out….