At KubeCon + CloudNativeCon Europe in Amsterdam from March 23-26, CNCF brought together a roundtable with experts in the cloud native ecosystem, including Ellis Tarn of AWS, Allan Naim of Google Cloud, Jorge Palma of Microsoft, and Nina Polshakova of solo.io.  The discussion centered on how cloud native principles enable AI in production environments. The panelists shared key takeaways on the shift to AI-native computing, emphasizing that moving AI workloads into enterprise production requires three core components: a foundational, vendor-neutral infrastructure focused on platform maturity, integrated security for autonomous agents, and active community contribution.

A photo still of the AI in Production Roundtable, featuring moderator Kaitlin Thornhill (far left) and four panelists: Ellis Tarn of AWS, Allan Naim of Google Cloud, Jorge Palma of Microsoft, and Nina Polshakova of solo.io

How do we define production readiness for AI?

Organizations achieve production-readiness for AI when they meet a multi-dimensional standard of platform maturity. Panelists agreed the most important signal is alignment with the Kubernetes AI Conformance program, which identifies the essential primitives for serving and training AI at scale, guaranteeing interoperability across environments.

Readiness requires three key elements:

Why is scaling AI workloads a challenge?

Scaling AI workloads is significantly more difficult than scaling conventional microservices because AI workloads behave like enormous monoliths. This difficulty arises from the need to initialize multidimensional matrices in memory across numerous client nodes. Standard Kubernetes was not designed for the tight coupling required by these high-performance compute tasks.

What is the cloud native community doing to refactor Kubernetes for AI?

Engineers across the ecosystem are collaborating on key initiatives to evolve Kubernetes for high-performance compute without creating inflexible architectures. These efforts include:

How does AI change the role of an engineer?

AI is reshaping internal engineering roles. Prototyping has replaced the traditional Product Requirements Document (PRD), as product managers begin with AI-generated prototypes to test ideas before formal documentation. This shift, however, created a review bottleneck: the challenge is managing the sheer volume of generated code that requires human review. The panel suggested that the future moves toward agentic SRE, where AI agents assist with root-cause analysis and remediation while always keeping humans involved in mission-critical decisions.

Securing the AI supply chain

Security now extends beyond traditional container scans to focus on the integrity of the model supply chain and the risks associated with non-deterministic outputs. The community is focused on two main security efforts:

The panelists concluded that when someone asks AI, “How do I scale this?” the answer should be rooted in open, interoperable, and vendor-neutral cloud native standards.