← /blog/tags #kubernetes
5 posts tagged.
01 Why your Kubernetes rolling deploy still drops requests
A rolling update is not the same as zero downtime. Kubernetes will route traffic to a pod that is shutting down and restart one that is merely busy. Here is what actually makes a deploy invisible to users: readiness gating, the SIGTERM race and the preStop fix, PodDisruptionBudgets, and the ASP.NET Core and Node.js specifics. #kubernetes#DevOps#Cloud#Reliability#dotnet 2026-05-21 · kubernetes 02 How I would secure a Kubernetes cluster from day one
I ran .NET microservices on EKS in production, and we bolted security on after the fact. A fresh Kubernetes cluster trusts everything by default. Here is the day-one hardening I would never defer again: default-deny networking, least-privilege RBAC, admission control, real secrets, and runtime detection. #kubernetes#DevOps#Security#Cloud#AWS 2026-05-21 · kubernetes 03 The request is the wrong unit of scale for LLMs on Kubernetes
Your dashboard says traffic is flat while latency drifts and the GPU strains. The HTTP request is only the envelope; the real work is token processing. Why tokens, not requests, are the unit of scale for LLMs on Kubernetes. #kubernetes#LLM#Platform Engineering#AI Infrastructure#MLOps#DevOps 2026-05-21 · kubernetes 04 Building a production LLM platform on Kubernetes
I have run Kubernetes in production for microservices, not LLMs. Serving large language models breaks the assumptions that make K8s good at web apps. Here is how I would architect a production LLM platform, vendor-neutral, with the router, token accounting, and autoscaling Kubernetes will not give you. #kubernetes#LLM#Platform Engineering#AI Infrastructure#MLOps 2026-05-21 · kubernetes 05 Terraform, Pulumi, Kubernetes, or none: picking infra in 2026
I have used Terraform to run Kubernetes microservices in production. My personal site runs on 47 lines of wrangler.toml. After seven years with and without those tools, here is the honest cutoff. #cloudflare#terraform#pulumi#kubernetes#iac#opentofu#edge 2026-04-22 · cloudflare